Follow
QIANG FU
QIANG FU
Tencent AI Lab
Verified email at tencent.com
Title
Cited by
Cited by
Year
Mastering complex control in moba games with deep reinforcement learning
D Ye, Z Liu, M Sun, B Shi, P Zhao, H Wu, H Yu, S Yang, X Wu, Q Guo, ...
Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 6672-6679, 2020
2252020
Towards playing full moba games with deep reinforcement learning
D Ye, G Chen, W Zhang, S Chen, B Yuan, B Liu, J Chen, Z Liu, F Qiu, ...
Advances in Neural Information Processing Systems 33, 621-632, 2020
1302020
Supervised learning achieves human-level performance in moba games: A case study of honor of kings
D Ye, G Chen, P Zhao, F Qiu, B Yuan, W Zhang, S Chen, M Sun, X Li, S Li, ...
IEEE Transactions on Neural Networks and Learning Systems 33 (3), 908-918, 2020
392020
Which heroes to pick? Learning to draft in MOBA games with neural networks and tree search
S Chen, M Zhu, D Ye, W Zhang, Q Fu, W Yang
IEEE Transactions on Games 13 (4), 410-421, 2021
202021
Actor-critic policy optimization in a large-scale imperfect-information game
H Fu, W Liu, S Wu, Y Wang, T Yang, K Li, J Xing, B Li, B Ma, Q Fu, Y Wei
International Conference on Learning Representations, 2021
162021
Minerl diamond 2021 competition: Overview, results, and lessons learned
A Kanervisto, S Milani, K Ramanauskas, N Topin, Z Lin, J Li, J Shi, D Ye, ...
NeurIPS 2021 Competitions and Demonstrations Track, 13-28, 2022
152022
Juewu-mc: Playing minecraft with sample-efficient hierarchical reinforcement learning
Z Lin, J Li, J Shi, D Ye, Q Fu, W Yang
arXiv preprint arXiv:2112.04907, 2021
152021
Mapgo: Model-assisted policy optimization for goal-oriented tasks
M Zhu, M Liu, J Shen, Z Zhang, S Chen, W Zhang, D Ye, Y Yu, Q Fu, ...
arXiv preprint arXiv:2105.06350, 2021
122021
Honor of kings arena: an environment for generalization in competitive reinforcement learning
H Wei, J Chen, X Ji, H Qin, M Deng, S Li, L Wang, W Zhang, Y Yu, L Linc, ...
Advances in Neural Information Processing Systems 35, 11881-11892, 2022
62022
Learning diverse policies in moba games via macro-goals
Y Gao, B Shi, X Du, L Wang, G Chen, Z Lian, F Qiu, G Han, W Wang, D Ye, ...
Advances in Neural Information Processing Systems 34, 16171-16182, 2021
62021
Boosting offline reinforcement learning with residual generative modeling
H Wei, D Ye, Z Liu, H Wu, B Yuan, Q Fu, W Yang, Z Li
arXiv preprint arXiv:2106.10411, 2021
62021
Revisiting discrete soft actor-critic
H Zhou, Z Lin, J Li, Q Fu, W Yang, D Ye
arXiv preprint arXiv:2209.10081, 2022
52022
Greedy when Sure and Conservative when Uncertain about the Opponents
H Fu, Y Tian, H Yu, W Liu, S Wu, J Xiong, Y Wen, K Li, J Xing, Q Fu, ...
International Conference on Machine Learning, 6829-6848, 2022
32022
Combining Tree Search and Action Prediction for State-of-the-Art Performance in DouDiZhu.
Y Zhang, D Yan, B Shi, H Fu, Q Fu, H Su, J Zhu, N Chen
IJCAI, 3413-3419, 2021
32021
Quality-Similar Diversity via Population Based Reinforcement Learning
S Wu, J Yao, H Fu, Y Tian, C Qian, Y Yang, Q Fu, Y Wei
The Eleventh International Conference on Learning Representations, 2022
22022
RLogist: fast observation strategy on whole-slide images with deep reinforcement learning
B Zhao, J Zhang, D Ye, J Cao, X Han, Q Fu, W Yang
Proceedings of the AAAI Conference on Artificial Intelligence 37 (3), 3570-3578, 2023
12023
Towards Effective and Interpretable Human-Agent Collaboration in MOBA Games: A Communication Perspective
Y Gao, F Liu, L Wang, Z Lian, W Wang, S Li, X Wang, X Zeng, R Wang, ...
arXiv preprint arXiv:2304.11632, 2023
12023
AutoCFR: Learning to Design Counterfactual Regret Minimization Algorithms
H Xu, K Li, H Fu, Q Fu, J Xing
Proceedings of the AAAI Conference on Artificial Intelligence 36 (5), 5244-5251, 2022
12022
PreCo: Enhancing Generalization in Co-Design of Modular Soft Robots via Brain-Body Pre-Training
Y Wang, S Wu, T Zhang, Y Chang, Q FU, H Fu, X Wang
7th Annual Conference on Robot Learning, 2023
2023
RLTF: Reinforcement Learning from Unit Test Feedback
J Liu, Y Zhu, K Xiao, Q Fu, X Han, W Yang, D Ye
arXiv preprint arXiv:2307.04349, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–20