Houdong Hu
Houdong Hu
Microsoft,Principal Engineering Manager
Verified email at
Cited by
Cited by
Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks
X Li, X Yin, C Li, P Zhang, X Hu, L Zhang, L Wang, H Hu, L Dong, F Wei, ...
Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020
Stacked cross attention for image-text matching
KH Lee, X Chen, G Hua, H Hu, X He
Proceedings of the European conference on computer vision (ECCV), 201-216, 2018
Unified vision-language pre-training for image captioning and vqa
L Zhou, H Palangi, L Zhang, H Hu, J Corso, J Gao
Proceedings of the AAAI conference on artificial intelligence 34 (07), 13041 …, 2020
Florence: A new foundation model for computer vision
L Yuan, D Chen, YL Chen, N Codella, X Dai, J Gao, H Hu, X Huang, B Li, ...
arXiv preprint arXiv:2111.11432, 2021
Elevater: A benchmark and toolkit for evaluating language-augmented visual models
C Li, H Liu, L Li, P Zhang, J Aneja, J Yang, P Jin, H Hu, Z Liu, YJ Lee, ...
Advances in Neural Information Processing Systems 35, 9287-9301, 2022
Web-scale responsive visual search at bing
H Hu, Y Wang, L Yang, P Komlev, L Huang, X Chen, J Huang, Y Wu, ...
Proceedings of the 24th ACM SIGKDD international conference on knowledge …, 2018
Plasmonic dark field microscopy
H Hu, C Ma, Z Liu
Applied Physics Letters 96 (11), 2010
Product identification in image with multiple products
H Hu, L Huang
US Patent 10,902,051, 2021
Learning visual relation priors for image-text matching and image captioning with neural scene graph generators
KH Lee, H Palangi, X Chen, H Hu, J Gao
arXiv preprint arXiv:1909.09953, 2019
System and method for attribute-based visual search over a computer communication network
L Huang, M Merchant, H Hu, A Sacheti
US Patent 11,120,070, 2021
Image scene graph generation (sgg) benchmark
X Han, J Yang, H Hu, L Zhang, J Gao, P Zhang
arXiv preprint arXiv:2107.12604, 2021
Multi-modal visual search pipeline for web scale images
H Hu, Y Wang, L Yang, L Huang, X Chen, J Huang, Y Wu, AK Sacheti, ...
US Patent 11,074,289, 2021
Transforming audio content into images
L Huang, H Hu, C Su
US Patent 10,891,969, 2021
An universal image attractiveness ranking framework
N Ma, A Volkov, A Livshits, P Pietrusinski, H Hu, M Bolin
2019 IEEE winter conference on applications of computer vision (WACV), 657-665, 2019
Generating and applying an object-level relational index for images
K Wu, S Yiran, H Hu, S Sreepada, A Sacheti, MD Gupta, RR Gandhi, ...
US Patent 11,182,408, 2021
Stacked cross-modal matching
KH Lee, G Hua, X Chen, H Hu, H Xiaodong
US Patent 11,093,560, 2021
MMPTRACK: Large-scale densely annotated multi-camera multiple people tracking benchmark
X Han, Q You, C Wang, Z Zhang, P Chu, H Hu, J Wang, Z Liu
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2023
Object detection from image content
A Sacheti, X Chen, H Hu, L Huang, J Huang, M Merchant
US Patent App. 15/900,606, 2019
Machine learning hyperparameter tuning tool
J Huang, H Hu, L Huang, X Chen, L Yang
US Patent App. 15/883,686, 2019
LijuanWang, Houdong Hu, Li Dong, FuruWei, et al. 2020. Oscar: Object-semantics aligned pre-training for vision-language tasks
X Li, X Yin, C Li, P Zhang, X Hu, L Zhang
European Conference on Computer Vision. Springer, 0
The system can't perform the operation now. Try again later.
Articles 1–20