Follow
Linxi "Jim" Fan
Title
Cited by
Cited by
Year
Deep speech 2: End-to-end speech recognition in english and mandarin
D Amodei, S Ananthanarayanan, R Anubhai, J Bai, E Battenberg, C Case, ...
International conference on machine learning, 173-182, 2016
31912016
SURREAL: Open-Source Reinforcement Learning Framework and Robot Manipulation Benchmark
L Fan, Y Zhu, J Zhu, Z Liu, O Zeng, A Gupta, J Creus-Costa, S Savarese, ...
Conference of Robotic Learning, 2018
1532018
Deconstructing the ladder network architecture
M Pezeshki, L Fan, P Brakel, A Courville, Y Bengio
International conference on machine learning, 2368-2376, 2016
1252016
A comparison between deep neural nets and kernel acoustic models for speech recognition
Z Lu, D Quo, AB Garakani, K Liu, A May, A Bellet, L Fan, M Collins, ...
2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016
83*2016
iGibson 1.0: A simulation environment for interactive tasks in large realistic scenes
B Shen, F Xia, C Li, R Martín-Martín, L Fan, G Wang, C Pérez-D’Arpino, ...
2021 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2021
702021
World of bits: An open-domain platform for web-based agents
TT Shi, A Karpathy, L Fan, J Hernandez, P Liang
Proceedings of the 34th International Conference on Machine Learning-Volume …, 2017
682017
Rubiksnet: Learnable 3d-shift for efficient video action recognition
L Fan, S Buch, G Wang, R Cao, Y Zhu, JC Niebles, L Fei-Fei
Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020
592020
Minedojo: Building open-ended embodied agents with internet-scale knowledge
L Fan, G Wang, Y Jiang, A Mandlekar, Y Yang, H Zhu, A Tang, DA Huang, ...
arXiv preprint arXiv:2206.08853, 2022
542022
Pre-trained language models for interactive decision-making
S Li, X Puig, Y Du, C Wang, L Fan, T Chen, DA Huang, E Akyurek, ...
arXiv preprint arXiv:2202.01771, 2022
352022
Secant: Self-expert cloning for zero-shot generalization of visual policies
L Fan, G Wang, DA Huang, Z Yu, L Fei-Fei, Y Zhu, A Anandkumar
arXiv preprint arXiv:2106.09678, 2021
332021
Vima: General robot manipulation with multimodal prompts
Y Jiang, A Gupta, Z Zhang, G Wang, Y Dou, Y Chen, L Fei-Fei, ...
arXiv preprint arXiv:2210.03094, 2022
312022
Kernel approximation methods for speech recognition
A May, AB Garakani, Z Lu, D Guo, K Liu, A Bellet, L Fan, M Collins, D Hsu, ...
The Journal of Machine Learning Research 20 (1), 2121-2156, 2019
282019
Hybrid ontology-learning materials engineering system for pharmaceutical products: Multi-label entity recognition and concept detection
MFM Remolona, MF Conway, S Balasubramanian, L Fan, Z Feng, T Gu, ...
Computers & Chemical Engineering 107, 49-60, 2017
192017
Metamorph: Learning universal controllers with transformers
A Gupta, L Fan, S Ganguli, L Fei-Fei
arXiv preprint arXiv:2203.11931, 2022
142022
Mimicplay: Long-horizon imitation learning by watching human play
C Wang, L Fan, J Sun, R Zhang, L Fei-Fei, D Xu, Y Zhu, A Anandkumar
arXiv preprint arXiv:2302.12422, 2023
22023
SURREAL-System: Fully-Integrated Stack for Distributed Deep Reinforcement Learning
L Fan, Y Zhu, J Zhu, Z Liu, O Zeng, A Gupta, J Creus-Costa, S Savarese, ...
arXiv preprint arXiv:1909.12989, 2019
22019
Prismer: A Vision-Language Model with An Ensemble of Experts
S Liu, L Fan, E Johns, Z Yu, C Xiao, A Anandkumar
arXiv preprint arXiv:2303.02506, 2023
12023
Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning
Z Yang, W Ping, Z Liu, V Korthikanti, W Nie, DA Huang, L Fan, Z Yu, S Lan, ...
arXiv preprint arXiv:2302.04858, 2023
12023
Voyager: An Open-Ended Embodied Agent with Large Language Models
G Wang, Y Xie, Y Jiang, A Mandlekar, C Xiao, Y Zhu, L Fan, ...
arXiv preprint arXiv:2305.16291, 2023
2023
Training and Deploying Visual Agents at Scale
L Fan
Stanford University, 2021
2021
The system can't perform the operation now. Try again later.
Articles 1–20