Pano-avqa: Grounded audio-visual question answering on 360deg videos H Yun, Y Yu, W Yang, K Lee, G Kim Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 91 | 2021 |
Transitional adaptation of pretrained models for visual storytelling Y Yu, J Chung, H Yun, J Kim, G Kim Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 34 | 2021 |
Multimodal knowledge alignment with reinforcement learning Y Yu, J Chung, H Yun, J Hessel, JS Park, X Lu, P Ammanabrolu, R Zellers, ... arXiv preprint arXiv:2205.12630, 2022 | 32 | 2022 |
Panoramic Vision Transformer for Saliency Detection in 360 Videos H Yun, S Lee, G Kim European Conference on Computer Vision, 422-439, 2022 | 20 | 2022 |
Fusing Pre-Trained Language Models With Multimodal Prompts Through Reinforcement Learning Y Yu, J Chung, H Yun, J Hessel, JS Park, X Lu, R Zellers, P Ammanabrolu, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 16 | 2023 |
Character grounding and re-identification in story of videos and text descriptions Y Yu, J Kim, H Yun, J Chung, G Kim Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020 | 7 | 2020 |
Dense 2D-3D Indoor Prediction with Sound via Aligned Cross-Modal Distillation H Yun, J Na, G Kim Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 5 | 2023 |
Spherical World-Locking for Audio-Visual Localization in Egocentric Videos H Yun, R Gao, I Ananthabhotla, A Kumar, J Donley, C Li, G Kim, VK Ithapu, ... European Conference on Computer Vision, 256-274, 2024 | 1 | 2024 |
A mobile robot generating video summaries of seniors' indoor activities CY Yang, H Yun, S Varadaraj, JY Hsu Proceedings of the 21st International Conference on Human-Computer …, 2019 | 1 | 2019 |