Yi Wu

Cited by

	All	Since 2019
Citations	10205	9724
h-index	26	25
i10-index	35	34

2800

1400

700

2100

20172018201920202021202220232024113 319 662 1030 1487 2040 2759 1732

Public access

View all

11 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Aviv TamarTechnionVerified email at technion.ac.il
Stuart RussellProfessor of Computer Science, University of California, BerkeleyVerified email at cs.berkeley.edu
Jiaxuan GaoInstitute for Interdisciplinary Information Sciences, Tsinghua UniversityVerified email at mails.tsinghua.edu.cn
Yu Wang (汪玉)Department of Electronic Engineering, Tsinghua University, ChinaVerified email at mail.tsinghua.edu.cn
Yuandong TianResearch Scientist, Meta AI (FAIR)Verified email at fb.com
Fei FangCarnegie Mellon UniversityVerified email at cmu.edu
Igor MordatchGoogle DeepMindVerified email at google.com
Pieter AbbeelUC Berkeley | CovariantVerified email at cs.berkeley.edu
Huazhe XuTsinghua UniversityVerified email at berkeley.edu
Xiaolong WangAssistant Professor, UC San DiegoVerified email at ucsd.edu
Chao Yu（于超）Tsinghua UniversityVerified email at mail.tsinghua.edu.cn
Ryan LoweOpenAIVerified email at openai.com
Jean HarbOpenAIVerified email at openai.com
Akash VeluStudent, Stanford UniversityVerified email at stanford.edu
Eugene VinitskyAssistant Professor, NYUVerified email at nyu.edu
Shusheng XuIIIS, Tsinghua UniversityVerified email at mails.tsinghua.edu.cn
Georgia GkioxariCaltechVerified email at caltech.edu
Yunfei LiTsinghua UniversityVerified email at mails.tsinghua.edu.cn
Alexandre BayenProfessor Electrical Engineering and Computer Science, UC BerkeleyVerified email at berkeley.edu
Yuxin WuVerified email at google.com

Yi Wu

Institute for Interdisciplinary Information Sciences, Tsinghua University

Verified email at mail.tsinghua.edu.cn - Homepage

Reinforcement Learning Human-AI Interaction Multi-Agent Learning Robot Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Multi-agent actor-critic for mixed cooperative-competitive environments R Lowe, YI Wu, A Tamar, J Harb, OAI Pieter Abbeel, I Mordatch Advances in neural information processing systems 30, 2017	4986	2017
The surprising effectiveness of ppo in cooperative multi-agent games C Yu, A Velu, E Vinitsky, J Gao, Y Wang, A Bayen, Y Wu Advances in Neural Information Processing Systems 35, 24611-24624, 2022	1090	2022
Emergent tool use from multi-agent autocurricula B Baker, I Kanitscheider, T Markov, Y Wu, G Powell, B McGrew, ... arXiv preprint arXiv:1909.07528, 2019	843	2019
Value iteration networks A Tamar, Y Wu, G Thomas, S Levine, P Abbeel Advances in neural information processing systems 29, 2016	744	2016
Building generalizable agents with a realistic and rich 3d environment Y Wu, Y Wu, G Gkioxari, Y Tian arXiv preprint arXiv:1801.02209, 2018	379	2018
Robust multi-agent reinforcement learning via minimax deep deterministic policy gradient S Li, Y Wu, X Cui, H Dong, F Fang, S Russell Proceedings of the AAAI conference on artificial intelligence 33 (01), 4213-4220, 2019	336	2019
Adversarial training for relation extraction Y Wu, D Bamman, S Russell Proceedings of the 2017 Conference on Empirical Methods in Natural Language …, 2017	249	2017
Multi-task reinforcement learning with soft modularization R Yang, H Xu, Y Wu, X Wang Advances in Neural Information Processing Systems 33, 4767-4777, 2020	180	2020
Influence-based multi-agent exploration T Wang, J Wang, Y Wu, C Zhang arXiv preprint arXiv:1910.05512, 2019	146	2019
Bayesian relational memory for semantic visual navigation Y Wu, Y Wu, A Tamar, S Russell, G Gkioxari, Y Tian Proceedings of the IEEE/CVF international conference on computer vision …, 2019	127*	2019
Evolutionary population curriculum for scaling multi-agent reinforcement learning Q Long, Z Zhou, A Gupta, F Fang, Y Wu, X Wang arXiv preprint arXiv:2003.10423, 2020	111	2020
Noveld: A simple yet effective exploration criterion T Zhang, H Xu, X Wang, Y Wu, K Keutzer, JE Gonzalez, Y Tian Advances in Neural Information Processing Systems 34, 25217-25230, 2021	100*	2021
Deep reinforcement learning for green security games with real-time information Y Wang, ZR Shi, L Yu, Y Wu, R Singh, L Joppa, F Fang Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 1401-1408, 2019	92	2019
Sequence level contrastive learning for text summarization S Xu, X Zhang, Y Wu, F Wei Proceedings of the AAAI conference on artificial intelligence 36 (10), 11556 …, 2022	86	2022
Unsupervised extractive summarization by pre-training hierarchical transformers S Xu, X Zhang, Y Wu, F Wei, M Zhou arXiv preprint arXiv:2010.08242, 2020	57	2020
Discovering diverse multi-agent strategic behavior via reward randomization Z Tang, C Yu, B Chen, H Xu, X Wang, F Fang, S Du, Y Wang, Y Wu arXiv preprint arXiv:2103.04564, 2021	51	2021
Maximum entropy population-based training for zero-shot human-ai coordination R Zhao, J Song, Y Yuan, H Hu, Y Gao, Y Wu, Z Sun, W Yang Proceedings of the AAAI Conference on Artificial Intelligence 37 (5), 6145-6153, 2023	45	2023
Swift: Compiled inference for probabilistic programming languages Y Wu, L Li, S Russell, R Bodik arXiv preprint arXiv:1606.09242, 2016	42*	2016
Meta-learning MCMC proposals T Wang, Y Wu, D Moore, SJ Russell Advances in neural information processing systems 31, 2018	38	2018
Revisiting some common practices in cooperative multi-agent reinforcement learning W Fu, C Yu, Z Xu, J Yang, Y Wu arXiv preprint arXiv:2206.07505, 2022	36	2022

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors