Follow
Tanmay Gupta
Tanmay Gupta
PRIOR @ Allen Institute for Artificial Intelligence
Verified email at allenai.org - Homepage
Title
Cited by
Cited by
Year
Completing 3d object shape from one depth image
J Rock, T Gupta, J Thorsen, JY Gwak, D Shin, D Hoiem
Proceedings of the IEEE conference on computer vision and pattern …, 2015
2002015
Visual programming: Compositional visual reasoning without training
T Gupta, A Kembhavi
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
1862023
No-frills human-object interaction detection: Factorization, layout encodings, and training techniques
T Gupta, A Schwing, D Hoiem
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2019
1352019
Contrastive learning for weakly supervised phrase grounding
T Gupta, A Vahdat, G Chechik, X Yang, J Kautz, D Hoiem
European Conference on Computer Vision, 752-768, 2020
1242020
Towards general purpose vision systems: An end-to-end task-agnostic vision-language architecture
T Gupta, A Kamath, A Kembhavi, D Hoiem
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
742022
Imagine this! scripts to compositions to videos
T Gupta, D Schwenk, A Farhadi, D Hoiem, A Kembhavi
Proceedings of the European conference on computer vision (ECCV), 598-613, 2018
712018
Visual semantic role labeling for video understanding
A Sadhu, T Gupta, M Yatskar, R Nevatia, A Kembhavi
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
532021
Webly supervised concept expansion for general purpose vision models
A Kamath, C Clark, T Gupta, E Kolve, D Hoiem, A Kembhavi
European Conference on Computer Vision, 662-681, 2022
472022
Vico: Word embeddings from visual co-occurrences
T Gupta, A Schwing, D Hoiem
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2019
292019
Aligned Image-Word Representations Improve Inductive Transfer Across Vision-Language Tasks
T Gupta, K Shih, S Singh, D Hoiem
International Conference on Computer Vision (ICCV), 2017
232017
Learning curves for analysis of deep networks
D Hoiem, T Gupta, Z Li, M Shlapentokh-Rothman
International conference on machine learning, 4287-4296, 2021
212021
Grit: General robust image task benchmark
T Gupta, R Marten, A Kembhavi, D Hoiem
arXiv preprint arXiv:2204.13653, 2022
162022
Object 3dit: Language-guided 3d-aware image editing
O Michel, A Bhattad, E VanderBilt, R Krishna, A Kembhavi, T Gupta
Advances in Neural Information Processing Systems 36, 2024
82024
3dfs: Deformable dense depth fusion and segmentation for object reconstruction from a handheld camera
T Gupta, D Shin, N Sivagnanadasan, D Hoiem
arXiv preprint arXiv:1606.05002, 2016
62016
Imitating Shortest Paths in Simulation Enables Effective Navigation and Manipulation in the Real World
K Ehsani, T Gupta, R Hendrix, J Salvador, L Weihs, KH Zeng, KP Singh, ...
arXiv preprint arXiv:2312.02976, 2023
32023
Joint representation learning from images and text
A Vahdat, T Gupta, X Yang, J Kautz
US Patent App. 17/000,048, 2021
22021
m&m's: A Benchmark to Evaluate Tool-Use for multi-step multi-modal Tasks
Z Ma, W Huang, J Zhang, T Gupta, R Krishna
arXiv preprint arXiv:2403.11085, 2024
2024
Selective" Selective Prediction": Reducing Unnecessary Abstention in Vision-Language Reasoning
T Srinivasan, J Hessel, T Gupta, BY Lin, Y Choi, J Thomason, KR Chandu
arXiv preprint arXiv:2402.15610, 2024
2024
Representations from vision and language
T Gupta
University of Illinois at Urbana-Champaign, 2020
2020
Policy Gradients+ Planning
R Kataria, Z Li, T Gupta
The system can't perform the operation now. Try again later.
Articles 1–20