QIANG FU

Cited by

	All	Since 2019
Citations	744	743
h-index	10	10
i10-index	11	11

300

150

225

2020202120222023202424 96 179 299 144

Public access

View all

8 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Deheng YeDirector of AI Applications, TencentVerified email at e.ntu.edu.sg
Haobo FuTencent AI Lab, University of BirminghamVerified email at tencent.com
Wei Liu, IEEE/IAPR/IMA FellowDistinguished Scientist, TencentVerified email at ee.columbia.edu

QIANG FU

Tencent AI Lab

Verified email at tencent.com

reinforcement learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Mastering complex control in moba games with deep reinforcement learning D Ye, Z Liu, M Sun, B Shi, P Zhao, H Wu, H Yu, S Yang, X Wu, Q Guo, ... Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 6672-6679, 2020	288	2020
Towards playing full moba games with deep reinforcement learning D Ye, G Chen, W Zhang, S Chen, B Yuan, B Liu, J Chen, Z Liu, F Qiu, ... Advances in Neural Information Processing Systems 33, 621-632, 2020	159	2020
Supervised learning achieves human-level performance in moba games: A case study of honor of kings D Ye, G Chen, P Zhao, F Qiu, B Yuan, W Zhang, S Chen, M Sun, X Li, S Li, ... IEEE Transactions on Neural Networks and Learning Systems 33 (3), 908-918, 2020	49	2020
Juewu-mc: Playing minecraft with sample-efficient hierarchical reinforcement learning Z Lin, J Li, J Shi, D Ye, Q Fu, W Yang arXiv preprint arXiv:2112.04907, 2021	31	2021
Which heroes to pick? learning to draft in moba games with neural networks and tree search S Chen, M Zhu, D Ye, W Zhang, Q Fu, W Yang IEEE Transactions on Games 13 (4), 410-421, 2021	23	2021
Actor-critic policy optimization in a large-scale imperfect-information game H Fu, W Liu, S Wu, Y Wang, T Yang, K Li, J Xing, B Li, B Ma, Q Fu, Y Wei International Conference on Learning Representations, 2021	22	2021
Minerl diamond 2021 competition: Overview, results, and lessons learned A Kanervisto, S Milani, K Ramanauskas, N Topin, Z Lin, J Li, J Shi, D Ye, ... NeurIPS 2021 Competitions and Demonstrations Track, 13-28, 2022	21	2022
Mapgo: Model-assisted policy optimization for goal-oriented tasks M Zhu, M Liu, J Shen, Z Zhang, S Chen, W Zhang, D Ye, Y Yu, Q Fu, ... arXiv preprint arXiv:2105.06350, 2021	20	2021
Honor of kings arena: an environment for generalization in competitive reinforcement learning H Wei, J Chen, X Ji, H Qin, M Deng, S Li, L Wang, W Zhang, Y Yu, L Linc, ... Advances in Neural Information Processing Systems 35, 11881-11892, 2022	15	2022
Quality-similar diversity via population based reinforcement learning S Wu, J Yao, H Fu, Y Tian, C Qian, Y Yang, Q Fu, Y Wei The Eleventh International Conference on Learning Representations, 2022	10	2022
Revisiting discrete soft actor-critic H Zhou, Z Lin, J Li, Q Fu, W Yang, D Ye arXiv preprint arXiv:2209.10081, 2022	10	2022
Future-conditioned unsupervised pretraining for decision transformer Z Xie, Z Lin, D Ye, Q Fu, Y Wei, S Li International Conference on Machine Learning, 38187-38203, 2023	9	2023
Greedy when sure and conservative when uncertain about the opponents H Fu, Y Tian, H Yu, W Liu, S Wu, J Xiong, Y Wen, K Li, J Xing, Q Fu, ... International Conference on Machine Learning, 6829-6848, 2022	9	2022
Learning diverse policies in moba games via macro-goals Y Gao, B Shi, X Du, L Wang, G Chen, Z Lian, F Qiu, G Han, W Wang, D Ye, ... Advances in Neural Information Processing Systems 34, 16171-16182, 2021	9	2021
Boosting offline reinforcement learning with residual generative modeling H Wei, D Ye, Z Liu, H Wu, B Yuan, Q Fu, W Yang, Z Li arXiv preprint arXiv:2106.10411, 2021	9	2021
AutoCFR: learning to design counterfactual regret minimization algorithms H Xu, K Li, H Fu, Q Fu, J Xing Proceedings of the AAAI Conference on Artificial Intelligence 36 (5), 5244-5251, 2022	7	2022
Curriculum-based co-design of morphology and control of voxel-based soft robots Y Wang, S Wu, H Fu, Q Fu, T Zhang, Y Chang, X Wang The Eleventh International Conference on Learning Representations, 2022	6	2022
Enhance Reasoning for Large Language Models in the Game Werewolf S Wu, L Zhu, T Yang, S Xu, Q Fu, Y Wei, H Fu arXiv preprint arXiv:2402.02330, 2024	5	2024
RLogist: fast observation strategy on whole-slide images with deep reinforcement learning B Zhao, J Zhang, D Ye, J Cao, X Han, Q Fu, W Yang Proceedings of the AAAI Conference on Artificial Intelligence 37 (3), 3570-3578, 2023	5	2023
Towards effective and interpretable human-agent collaboration in moba games: A communication perspective Y Gao, F Liu, L Wang, Z Lian, W Wang, S Li, X Wang, X Zeng, R Wang, ... arXiv preprint arXiv:2304.11632, 2023	4	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors