Bowen Zhang

Cited by

	All	Since 2019
Citations	1391	1172
h-index	13	12
i10-index	14	13

280

140

210

20162017201820192020202120222023202411 85 113 145 165 211 261 262 127

Public access

View all

7 articles

1 article

available

not available

Based on funding mandates

Co-authors

Hanli WangDepartment of Computer Science and Technology, Tongji University, ShanghaiVerified email at tongji.edu.cn
Limin WangNanjing UniversityVerified email at nju.edu.cn
Fei ShaGoogle ResearchVerified email at feisha.org
Zhe WangGenAI@Adobe; UC IrvineVerified email at uci.edu
Yu QiaoProfessor of Shanghai AI Laboratory; Shenzhen Institutes of Advanced Technology, CASVerified email at siat.ac.cn
Hexiang (Frank) HuGoogle DeepmindVerified email at google.com
Yinfei YangAppleVerified email at apple.com
Xianzhi DuResearch Scientist, Apple AI/MLVerified email at apple.com
Yuanjun XiongAmazon Web ServicesVerified email at amazon.com
Zhe GanResearch Scientist, AppleVerified email at apple.com
Liangliang CaoApple IncVerified email at apple.com
Ruoming Pang (庞若鸣)Apple AI/MLVerified email at apple.com
Vihan JainGoogle IncVerified email at google.com
Eugene IeGoogleVerified email at google.com
Haoxuan YouColumbia UniversityVerified email at columbia.edu
Haotian ZhangResearch Scientist, AppleVerified email at apple.com
Zirui WangResearch Scientist, Apple AI/MLVerified email at apple.com
Shih-Fu ChangProfessor of Electrical Engineering and Computer Science, Columbia UniversityVerified email at columbia.edu
Melissa AilemMicrosoftVerified email at microsoft.com
Yue ZhaoUniversity of Texas at AustinVerified email at cs.utexas.edu

Bowen Zhang

Apple

Verified email at apple.com - Homepage

Computer Vision Action Recognition Natural Language Processing


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Real-time action recognition with enhanced motion vector CNNs B Zhang, L Wang, Z Wang, Y Qiao, H Wang CVPR, 2718-2726, 2016	503	2016
Real-time action recognition with deeply transferred motion vector cnns B Zhang, L Wang, Z Wang, Y Qiao, H Wang IEEE Transactions on Image Processing 27 (5), 2326-2339, 2018	173	2018
Cross-Modal and Hierarchical Modeling of Video and Text B Zhang, H Hu, F Sha Proceedings of the European Conference on Computer Vision (ECCV), 374-390, 2018	146	2018
Cuhk & ethz & siat submission to activitynet challenge 2016 Y Xiong, L Wang, Z Wang, B Zhang, H Song, W Li, D Lin, Y Qiao, ... CVPR'16 ActivityNet workshop, 2016	134	2016
Weakly supervised patchnets: Describing and aggregating local patches for scene recognition Z Wang, L Wang, Y Wang, B Zhang, Y Qiao IEEE Transactions on Image Processing 26 (4), 2028-2041, 2017	95	2017
Ferret: Refer and ground anything anywhere at any granularity H You, H Zhang, Z Gan, X Du, B Zhang, Z Wang, L Cao, SF Chang, ... arXiv preprint arXiv:2310.07704, 2023	66	2023
Cuhk & ethz & siat submission to activitynet challenge 2017 Y Zhao, B Zhang, Z Wu, S Yang, L Zhou, S Yan, L Wang, Y Xiong, D Lin, ... CVPR'17 ActivityNet workshop 8, 8, 2017	45	2017
Co-training Transformer with Videos and Images Improves Action Recognition B Zhang, J Yu, C Fifty, W Han, AM Dai, R Pang, F Sha arXiv preprint arXiv:2112.07175, 2021	42	2021
A Hierarchical Multi-Modal Encoder for Moment Localization in Video Corpus B Zhang, H Hu, J Lee, M Zhao, S Chammas, V Jain, E Ie, F Sha arXiv preprint arXiv:2011.09046, 2020	25	2020
Learning to Represent Image and Text with Denotation Graph B Zhang, H Hu, V Jain, E Ie, F Sha EMNLP'20, 823-839, 2020	21	2020
Topic Augmented Generator for Abstractive Summarization M Ailem, B Zhang, F Sha BayLearn, 2019	19	2019
MIC-TJU at MediaEval Violent Scenes Detection (VSD) 2014. B Zhang, Y Yi, H Wang, J Yu MediaEval, 2014	19	2014
Systematic Generalization on gSCAN: What is Nearly Solved and What is Next? L Qiu, H Hu, B Zhang, P Shaw, F Sha EMNLP'21, 2021	15	2021
From scarcity to efficiency: Improving clip training via visual-enriched captions Z Lai, H Zhang, B Zhang, W Wu, H Bai, A Timofeev, X Du, Z Gan, J Shan, ... arXiv preprint arXiv:2310.07699, 2023	12	2023
Less is More: Removing Text-regions Improves CLIP Training Efficiency and Robustness L Cao, B Zhang, C Chen, Y Yang, X Du, W Zhang, Z Lu, Y Zheng arXiv preprint arXiv:2305.05095, 2023	9	2023
STAIR: Learning Sparse Text and Image Representation in Grounded Tokens C Chen, B Zhang, L Cao, J Shen, T Gunter, AM Jose, A Toshev, J Shlens, ... arXiv preprint arXiv:2301.13081, 2023	8	2023
A Probabilistic Model for Joint Learning of Word Embeddings from Texts and Images M Ailem, B Zhang, A Bellet, P Denis, F Sha EMNLP'18, 1478-1487, 2018	8	2018
Learning correlations for human action recognition in videos Y Yi, H Wang, B Zhang Multimedia Tools and Applications 76, 18891-18913, 2017	8	2017
MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training B McKinzie, Z Gan, JP Fauconnier, S Dodge, B Zhang, P Dufter, D Shah, ... arXiv preprint arXiv:2403.09611, 2024	7	2024
Compressing LLMs: The Truth is Rarely Pure and Never Simple A Jaiswal, Z Gan, X Du, B Zhang, Z Wang, Y Yang arXiv preprint arXiv:2310.01382, 2023	7	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors