Follow
Luowei Zhou
Luowei Zhou
Research Scientist, Google Brain
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
Unified vision-language pre-training for image captioning and vqa
L Zhou, H Palangi, L Zhang, H Hu, J Corso, J Gao
Proceedings of the AAAI conference on artificial intelligence 34 (07), 13041 …, 2020
5142020
Towards automatic learning of procedures from web instructional videos
L Zhou, C Xu, J Corso
Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018
4252018
End-to-end dense video captioning with masked transformer
L Zhou, Y Zhou, JJ Corso, R Socher, C Xiong
Proceedings of the IEEE conference on computer vision and pattern …, 2018
4242018
Less is more: Clipbert for video-and-language learning via sparse sampling
J Lei, L Li, L Zhou, Z Gan, TL Berg, M Bansal, J Liu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
2532021
Florence: A new foundation model for computer vision
L Yuan, D Chen, YL Chen, N Codella, X Dai, J Gao, H Hu, X Huang, B Li, ...
arXiv preprint arXiv:2111.11432, 2021
1792021
Grounded video description
L Zhou, Y Kalantidis, X Chen, JJ Corso, M Rohrbach
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019
1592019
Watch what you just said: Image captioning with text-conditional attention
L Zhou, C Xu, P Koch, JJ Corso
Proceedings of the on Thematic Workshops of ACM Multimedia 2017, 305-313, 2017
802017
Weakly-Supervised Video Object Grounding from Text by Loss Weighting and Object Interaction
L Zhou, N Louis, JJ Corso
British Machine Vision Conference, 2018
662018
Bevt: Bert pretraining of video transformers
R Wang, D Chen, Z Wu, Y Chen, X Dai, M Liu, YG Jiang, L Zhou, L Yuan
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
592022
Dense video captioning
Y Zhou, L Zhou, C Xiong, R Socher
US Patent 10,542,270, 2020
572020
Multiagent reinforcement learning with sparse interactions by negotiation and knowledge transfer
L Zhou, P Yang, C Chen, Y Gao
IEEE transactions on cybernetics 47 (5), 1238-1250, 2016
492016
Value: A multi-task benchmark for video-and-language understanding evaluation
L Li, J Lei, Z Gan, L Yu, YC Chen, R Pillai, Y Cheng, L Zhou, XE Wang, ...
arXiv preprint arXiv:2106.04632, 2021
462021
Regionclip: Region-based language-image pretraining
Y Zhong, J Yang, P Zhang, C Li, N Codella, LH Li, L Zhou, X Dai, L Yuan, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
422022
Uc2: Universal cross-lingual cross-modal vision-and-language pre-training
M Zhou, L Zhou, S Wang, Y Cheng, L Li, Z Yu, J Liu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
412021
Image caption generation with text-conditional semantic attention
L Zhou, C Xu, P Koch, JJ Corso
arXiv preprint arXiv:1606.04621 2, 2016
412016
A balanced heuristic mechanism for multirobot task allocation of intelligent warehouses
L Zhou, Y Shi, J Wang, P Yang
Mathematical Problems in Engineering 2014, 2014
262014
Cluster-former: Clustering-based sparse transformer for question answering
S Wang, L Zhou, Z Gan, YC Chen, Y Fang, S Sun, Y Cheng, J Liu
Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 …, 2021
25*2021
Clip-event: Connecting text and images with event structures
M Li, R Xu, S Wang, L Zhou, X Lin, C Zhu, M Zeng, H Ji, SF Chang
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
232022
Procnets: Learning to segment procedures in untrimmed and unconstrained videos
L Zhou, C Xu, JJ Corso
arXiv preprint arXiv:1703.09788 2 (6), 7, 2017
14*2017
Omnivl: One foundation model for image-language and video-language tasks
J Wang, D Chen, Z Wu, C Luo, L Zhou, Y Zhao, Y Xie, C Liu, YG Jiang, ...
arXiv preprint arXiv:2209.07526, 2022
132022
The system can't perform the operation now. Try again later.
Articles 1–20