Bowen Zhang
Cited by
Cited by
Real-time action recognition with enhanced motion vector CNNs
B Zhang, L Wang, Z Wang, Y Qiao, H Wang
Proceedings of the IEEE conference on computer vision and pattern …, 2016
Real-time action recognition with deeply transferred motion vector cnns
B Zhang, L Wang, Z Wang, Y Qiao, H Wang
IEEE Transactions on Image Processing 27 (5), 2326-2339, 2018
Cuhk & ethz & siat submission to activitynet challenge 2016
Y Xiong, L Wang, Z Wang, B Zhang, H Song, W Li, D Lin, Y Qiao, ...
CVPR'16 ActivityNet workshop, 2016
Weakly supervised patchnets: Describing and aggregating local patches for scene recognition
Z Wang, L Wang, Y Wang, B Zhang, Y Qiao
IEEE Transactions on Image Processing 26 (4), 2028-2041, 2017
Cross-Modal and Hierarchical Modeling of Video and Text
B Zhang, H Hu, F Sha
Proceedings of the European Conference on Computer Vision (ECCV), 374-390, 2018
Cuhk & ethz & siat submission to activitynet challenge 2017
Y Zhao, B Zhang, Z Wu, S Yang, L Zhou, S Yan, L Wang, Y Xiong, D Lin, ...
CVPR'17 ActivityNet workshop 8, 8, 2017
MIC-TJU at MediaEval Violent Scenes Detection (VSD) 2014.
B Zhang, Y Yi, H Wang, J Yu
MediaEval, 2014
Learning to Represent Image and Text with Denotation Graph
B Zhang, H Hu, V Jain, E Ie, F Sha
EMNLP'20, 823-839, 2020
Topic Augmented Generator for Abstractive Summarization
M Ailem, B Zhang, F Sha
arXiv preprint arXiv:1908.07026, 2019
A Probabilistic Model for Joint Learning of Word Embeddings from Texts and Images
M Ailem, B Zhang, A Bellet, P Denis, F Sha
EMNLP'18, 1478-1487, 2018
Learning correlations for human action recognition in videos
Y Yi, H Wang, B Zhang
Multimedia Tools and Applications 76 (18), 18891-18913, 2017
A Hierarchical Multi-Modal Encoder for Moment Localization in Video Corpus
B Zhang, H Hu, J Lee, M Zhao, S Chammas, V Jain, E Ie, F Sha
arXiv preprint arXiv:2011.09046, 2020
MIC-TJU in MediaEval 2015 affective impact of movies task
Y Yi, H Wang, B Zhang
Working Notes Proceedings of the Media-Eval Workshop, 2015
Systematic Generalization on gSCAN: What is Nearly Solved and What is Next?
L Qiu, H Hu, B Zhang, P Shaw, F Sha
EMNLP'21, 2021
Visual Storytelling via Predicting Anchor Word Embeddings in the Stories
B Zhang, H Hu, F Sha
ICCV'19 CLVL workshop, 2019
Online Action Detection in Streaming Videos with Time Buffers
B Zhang, H Chen, M Wang, Y Xiong
arXiv preprint arXiv:2010.03016, 2020
Visually Grounded Concept Composition
B Zhang, H Hu, L Qiu, P Shaw, F Sha
Findings of EMNLP'21, 2021
Co-training Transformer with Videos and Images Improves Action Recognition
B Zhang, J Yu, C Fifty, W Han, AM Dai, R Pang, F Sha
arXiv preprint arXiv:2112.07175, 2021
The system can't perform the operation now. Try again later.
Articles 1–18