Xiang Long
Xiang Long
Bekræftet mail på baidu.com
Citeret af
Citeret af
Attention clusters: Purely attention based local feature integration for video classification
X Long, C Gan, G De Melo, J Wu, X Liu, S Wen
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2018
Multimodal keyless attention fusion for video classification
X Long, C Gan, G Melo, X Liu, Y Li, F Li, S Wen
Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018
Video captioning with multi-faceted attention
X Long, C Gan, G De Melo
Transactions of the Association for Computational Linguistics 6, 173-184, 2018
Temporal Modeling Approaches for Large-scale Youtube-8M Video Understanding
F Li, C Gan, X Liu, Y Bian, X Long, Y Li, Z Li, J Zhou, S Wen
CVPR Workshop, 2017
Revisiting the Effectiveness of Off-the-shelf Temporal Modeling Approaches for Large-scale Video Classification
Y Bian, C Gan, L Xiao, F Li, X Long, Y Li, H Qi, J Zhou, S Wen, Y Lin
CVPR Workshop, 2017
Multi-Label Classification with Label Graph Superimposing
Y Wang, D He, F Li, X Long, Z Zhou, J Ma, S Wen
AAAI, 2020
Cross-Modality Attention with Semantic Graph Embedding for Multi-Label Classification
R You, Z Guo, L Cui, X Long, Y Bao, S Wen
AAAI, 2020
PP-YOLO: An effective and efficient implementation of object detector
X Long, K Deng, G Wang, Y Zhang, Q Dang, Y Gao, H Shen, J Ren, ...
arXiv preprint arXiv:2007.12099, 2020
Exploiting Spatial-Temporal Modelling and Multi-Modal Fusion for Human Action Recognition
D He, F Li, Q Zhao, X Long, Y Fu, S Wen
CVPR Workshop, 2018
Deep concept-wise temporal convolutional networks for action localization
X Li, T Lin, X Liu, W Zuo, C Li, X Long, D He, F Li, S Wen, C Gan
Proceedings of the 28th ACM International Conference on Multimedia, 4004-4012, 2020
Graph-pcnn: Two stage human pose estimation with graph pose refinement
J Wang, X Long, Y Gao, E Ding, S Wen
European Conference on Computer Vision, 492-508, 2020
RSPNet: Relative Speed Perception for Unsupervised Video Representation Learning
P Chen, D Huang, D He, X Long, R Zeng, S Wen, M Tan, C Gan
arXiv preprint arXiv:2011.07949, 2020
PP-YOLOv2: A Practical Object Detector
X Huang, X Wang, W Lv, X Bai, X Long, K Deng, Q Dang, S Han, Q Liu, ...
arXiv preprint arXiv:2104.10419, 2021
Method and apparatus for classifying video
X Long, D He, F Li, CHI Zhizhen, Z Zhichao, X Zhao, P Wang, H Sun, ...
US Patent App. 16/830,895, 2021
Purely Attention Based Local Feature Integration for Video Classification
X Long, G De Melo, D He, F Li, Z Chi, S Wen, C Gan
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2020
VSRNet: End-to-End Video Segment Retrieval with Text Query
X Sun, X Long, D He, S Wen, Z Lian
Pattern Recognition, 108027, 2021
Method, device, apparatus for predicting video coding complexity and storage medium
Z Zhichao, D He, F Li, X Zhao, X Li, CHI Zhizhen, X Long, H Sun
US Patent App. 16/797,911, 2020
Right on Target: Iterative Optimization for Dense Video Captioning — Submission to ActivityNet Challenge 2019 Task 3
X Long, X Zhao, T Lin, D He, F Li, S Wen
CVPR Workshop, 2019
Action Pyramid Networks for Proposal and Activity Detection: Submission to ActivityNet Challenge 2018 Task1 and Task2
X Liu, F Yang, X Li, J Yu, R Long, X Long, S Wen
CVPR Workshop, 2018
Multi-modal fusion network based on relation-aware pyramid network for temporal action localization
J Gao, T Lin, X Long, D He, F Li, X Li, S Wen, E Ding
Systemet kan ikke foretage handlingen nu. Prøv igen senere.
Artikler 1–20