Human-level control through deep reinforcement learning V Mnih, K Kavukcuoglu, D Silver, AA Rusu, J Veness, MG Bellemare, ... nature 518 (7540), 529-533, 2015 | 13139 | 2015 |
Massively parallel methods for deep reinforcement learning A Nair, P Srinivasan, S Blackwell, C Alcicek, R Fearon, A De Maria, ... arXiv preprint arXiv:1507.04296, 2015 | 326 | 2015 |
Human-level performance in 3D multiplayer games with population-based reinforcement learning M Jaderberg, WM Czarnecki, I Dunning, L Marris, G Lever, AG Castaneda, ... Science 364 (6443), 859-865, 2019 | 319 | 2019 |
Vector-based navigation using grid-like representations in artificial agents A Banino, C Barry, B Uria, C Blundell, T Lillicrap, P Mirowski, A Pritzel, ... Nature 557 (7705), 429-433, 2018 | 299 | 2018 |
Deepmind lab C Beattie, JZ Leibo, D Teplyashin, T Ward, M Wainwright, H Küttler, ... arXiv preprint arXiv:1612.03801, 2016 | 281 | 2016 |
A multi-agent reinforcement learning model of common-pool resource appropriation J Perolat, JZ Leibo, V Zambaldi, C Beattie, K Tuyls, T Graepel arXiv preprint arXiv:1707.06600, 2017 | 89 | 2017 |
Psychlab: a psychology laboratory for deep reinforcement learning agents JZ Leibo, CM d'Autume, D Zoran, D Amos, C Beattie, K Anderson, ... arXiv preprint arXiv:1801.08116, 2018 | 33 | 2018 |
G Bellemare V Mnih, K Kavukcuoglu, D Silver, A Rusu, J Veness M., Graves, A., Riedmiller, M., K Fidjeland, A., Ostrovski, G., Petersen, S …, 2015 | 9 | 2015 |
Uncovering surprising behaviors in reinforcement learning via worst-case analysis A Ruderman, R Everett, B Sikder, H Soyer, J Uesato, A Kumar, C Beattie, ... | 5 | 2018 |
DeepMind Lab2D C Beattie, T Köppe, EA Duéñez-Guzmán, JZ Leibo arXiv preprint arXiv:2011.07027, 2020 | | 2020 |
Vector-based Navigation using Grid-like Representations in Artificial Agents. A Pritzel, A Banino, B Uria, BC Zhang, C Barry, C Blundell, C Beattie, ... | | 2018 |
代写 RC algorithm Scheme game math scala parallel AI statistic software network Bayesian GPU Go react theory Humanlevel performance in firstperson multiplayer games with … M Jaderberg, WM Czarnecki, I Dunning, L Marris, G Lever, AG Castaneda, ... | | |