Follow
Weixun Wang
Weixun Wang
Alibaba
Verified email at tju.edu.cn - Homepage
Title
Cited by
Cited by
Year
Multi-Agent Game Abstraction via Graph Attention Neural Network
Y Liu*, W Wang*, Y Hu, J Hao, X Chen, Y Gao
AAAI 2020, 2020
2902020
Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping
Y Hu, W Wang, H Jia, Y Wang, Y Chen, J Hao, F Wu, C Fan
Advances in Neural Information Processing Systems 33, 2020
2242020
The 37 implementation details of proximal policy optimization
S Huang, RFJ Dossa, A Raffin, A Kanervisto, W Wang
ICLR Blog Track, 2022
1382022
From Few to More: Large-scale Dynamic Multiagent Curriculum Learning
W Wang, T Yang, Y Liu, J Hao, X Hao, Y Hu, Y Chen, C Fan, Y Gao
AAAI 2020, 2020
1312020
Rethinking the Implementation Tricks and Monotonicity Constraint in Cooperative Multi-agent Reinforcement Learning
J Hu, W Siying, S Jiang, W Wang
ICLR Blogposts, 2023
1092023
Achieving cooperation through deep multiagent reinforcement learning in sequential prisoner's dilemmas
W Wang, J Hao, Y Wang, M Taylor
Proceedings of the First International Conference on Distributed Artificial …, 2019
59*2019
Action Semantics Network: Considering the Effects of Actions in Multiagent Systems
W Wang, T Yang, Y Liu, J Hao, X Hao, Y Hu, Y Chen, C Fan, Y Gao
ICLR 2020, 2020
462020
An Efficient Transfer Learning Framework for Multiagent Reinforcement Learning
T Yang*, W Wang*, H Tang*, HAO Jianye, Z Meng, H Mao, D Li, W Liu, ...
Thirty-Fifth Conference on Neural Information Processing Systems, 2021
44*2021
MARLlib: A Scalable and Efficient Multi-agent Reinforcement Learning Library
S Hu, Y Zhong, M Gao, W Wang, H Dong, X Liang, Z Li, X Chang, Y Yang
Journal of Machine Learning Research 24 (315), 1-23, 2023
42*2023
Individual Reward Assisted Multi-Agent Reinforcement Learning
L Wang, Y Zhang, Y Hu, W Wang, C Zhang, Y Gao, J Hao, T Lv, C Fan
International Conference on Machine Learning, 23417-23432, 2022
412022
KoGuN: Accelerating Deep Reinforcement Learning via Integrating Human Suboptimal Knowledge
P Zhang, J Hao, W Wang, H Tang, Y Ma, Y Duan, Y Zheng
IJCAI2020, 2020
402020
Efficient Deep Reinforcement Learning via Adaptive Policy Transfer
T Yang, J Hao, Z Meng, Z Zhang, Y Hu, Y Chen, C Fan, W Wang, W Liu, ...
IJCAI 2020, 2020
402020
Boosting Multiagent Reinforcement Learning via Permutation Invariant and Permutation Equivariant Networks
HAO Jianye, X Hao, H Mao, W Wang, Y Yang, D Li, Y Zheng, Z Wang
The Eleventh International Conference on Learning Representations, 2023
39*2023
A2C is a special case of PPO
S Huang, A Kanervisto, A Raffin, W Wang, S Ontañón, RFJ Dossa
arXiv preprint arXiv:2205.09123, 2022
312022
Independent Generative Adversarial Self-Imitation Learning in Cooperative Multiagent Systems
X Hao*, W Wang*, J Hao, Y Yang
Proceedings of the 18th International Conference on Autonomous Agents and …, 2019
292019
Cooperative Multi-Agent Transfer Learning with Coalition Pattern Decomposition
T Zhou, F Zhang, K Shao, Z Dai, K Li, W Huang, W Wang, B Wang, D Li, ...
IEEE Transactions on Games, 2023
28*2023
The N+ Implementation Details of RLHF with PPO: A Case Study on TL; DR Summarization
S Huang, M Noukhovitch, A Hosseini, K Rasul, W Wang, L Tunstall
arXiv preprint arXiv:2403.17031, 2024
232024
OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework
J Hu, X Wu, W Wang, D Zhang, Y Cao
arXiv preprint arXiv:2405.11143, 2024
222024
Background-free upconversion-encoded microspheres for mycotoxin detection based on a rapid visualization method
M Yang, M Cui, W Wang, Y Yang, J Chang, J Hao, H Wang
Analytical and bioanalytical chemistry 412, 81-91, 2020
222020
Learning Adaptive Display Exposure for Real-Time Advertising
W Wang, J Jin, J Hao, C Chen, C Yu, W Zhang, J Wang, X Hao, Y Wang, ...
CIKM 2019, 2019
22*2019
The system can't perform the operation now. Try again later.
Articles 1–20