Follow
Stephen Gould
Title
Cited by
Cited by
Year
Bottom-up and top-down attention for image captioning and visual question answering
P Anderson, X He, C Buehler, D Teney, M Johnson, S Gould, L Zhang
Proceedings of the IEEE conference on computer vision and pattern …, 2018
54602018
Spice: Semantic propositional image caption evaluation
P Anderson, B Fernando, M Johnson, S Gould
Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The …, 2016
22412016
Vision-and-language navigation: Interpreting visually-grounded navigation instructions in real environments
P Anderson, Q Wu, D Teney, J Bruce, M Johnson, N Sünderhauf, I Reid, ...
Proceedings of the IEEE conference on computer vision and pattern …, 2018
14342018
Decomposing a scene into geometric and semantically consistent regions
S Gould, R Fulton, D Koller
2009 IEEE 12th international conference on computer vision, 1-8, 2009
9422009
Dynamic image networks for action recognition
H Bilen, B Fernando, E Gavves, A Vedaldi, S Gould
Proceedings of the IEEE conference on computer vision and pattern …, 2016
7022016
Single image depth estimation from predicted semantic labels
B Liu, S Gould, D Koller
2010 IEEE computer society conference on computer vision and pattern …, 2010
6152010
Self-supervised video representation learning with odd-one-out networks
B Fernando, H Bilen, E Gavves, S Gould
Proceedings of the IEEE conference on computer vision and pattern …, 2017
5432017
Multi-class segmentation with relative location prior
S Gould, J Rodgers, D Cohen, G Elidan, D Koller
International journal of computer vision 80, 300-316, 2008
5262008
Region-based segmentation and object detection
S Gould, T Gao, D Koller
Advances in neural information processing systems 22, 2009
2862009
Vln bert: A recurrent vision-and-language bert for navigation
Y Hong, Q Wu, Y Qi, C Rodriguez-Opazo, S Gould
Proceedings of the IEEE/CVF conference on Computer Vision and Pattern …, 2021
2742021
Bottom-up and top-down attention for image captioning and vqa
P Anderson, X He, C Buehler, D Teney, M Johnson, S Gould, L Zhang
arXiv preprint arXiv:1707.07998 2 (4), 8, 2017
2682017
Guided open vocabulary image captioning with constrained beam search
P Anderson, B Fernando, M Johnson, S Gould
arXiv preprint arXiv:1612.00576, 2016
2632016
Cascaded classification models: Combining models for holistic scene understanding
G Heitz, S Gould, A Saxena, D Koller
Advances in neural information processing systems 21, 2008
2302008
On differentiating parameterized argmin and argmax problems with application to bi-level optimization
S Gould, B Fernando, A Cherian, P Anderson, RS Cruz, E Guo
arXiv preprint arXiv:1607.05447, 2016
2272016
Built-in foreground/background prior for weakly-supervised semantic segmentation
F Saleh, MS Aliakbarian, M Salzmann, L Petersson, S Gould, JM Alvarez
Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The …, 2016
2022016
Discriminative learning with latent variables for cluttered indoor scene understanding
H Wang, S Gould, D Koller
Computer Vision–ECCV 2010, 435-449, 2010
1992010
Projected subgradient methods for learning sparse gaussians
J Duchi, S Gould, D Koller
arXiv preprint arXiv:1206.3249, 2012
1832012
A signal propagation perspective for pruning neural networks at initialization
N Lee, T Ajanthan, S Gould, PHS Torr
arXiv preprint arXiv:1906.06307, 2019
1822019
Image retrieval on real-life images with pre-trained vision-and-language models
Z Liu, C Rodriguez-Opazo, D Teney, S Gould
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021
1792021
Proposal-free temporal moment localization of a natural-language query in video using guided attention
C Rodriguez, E Marrese-Taylor, FS Saleh, H Li, S Gould
Proceedings of the IEEE/CVF winter conference on applications of computer …, 2020
1702020
The system can't perform the operation now. Try again later.
Articles 1–20