Distributed distributional deterministic policy gradients G Barth-Maron, MW Hoffman, D Budden, W Dabney, D Horgan, D Tb, ... arXiv preprint arXiv:1804.08617, 2018 | 643 | 2018 |
Diego de Las Casas, David Budden, Abbas Abdolmaleki, Josh Merel, Andrew Lefrancq, et al. Deepmind control suite Y Tassa, Y Doron, A Muldal, T Erez, Y Li arXiv preprint arXiv:1801.00690 2 (6), 7, 2018 | 528* | 2018 |
dm_control: Software and tasks for continuous control S Tunyasuvunakool, A Muldal, Y Doron, S Liu, S Bohez, J Merel, T Erez, ... Software Impacts 6, 100022, 2020 | 355 | 2020 |
miR-124 acts through CoREST to control onset of Sema3A sensitivity in navigating retinal growth cones ML Baudet, KH Zivraj, C Abreu-Goodger, A Muldal, J Armisen, ... Nature neuroscience 15 (1), 29-38, 2012 | 147 | 2012 |
A data-driven approach for learning to control computers PC Humphreys, D Raposo, T Pohlen, G Thornton, R Chhaparia, A Muldal, ... International Conference on Machine Learning, 9466-9482, 2022 | 77 | 2022 |
Imitating interactive intelligence J Abramson, A Ahuja, I Barr, A Brussee, F Carnevale, M Cassin, ... arXiv preprint arXiv:2012.05672, 2020 | 70 | 2020 |
Learning awareness models B Amos, L Dinh, S Cabi, T Rothörl, SG Colmenarejo, A Muldal, T Erez, ... arXiv preprint arXiv:1804.06318, 2018 | 56 | 2018 |
Creating multimodal interactive agents with imitation and self-supervised learning DMIA Team, J Abramson, A Ahuja, A Brussee, F Carnevale, M Cassin, ... arXiv preprint arXiv:2112.03763, 2021 | 41 | 2021 |
Using neurogenetics and the warmth-gated ion channel TRPA1 to study the neural basis of behavior in Drosophila J Berni, AM Muldal, SR Pulver Journal of Undergraduate Neuroscience Education 9 (1), A5, 2010 | 30 | 2010 |
Improving multimodal interactive agents with reinforcement learning from human feedback J Abramson, A Ahuja, F Carnevale, P Georgiev, A Goldin, A Hung, ... arXiv preprint arXiv:2211.11602, 2022 | 27 | 2022 |
Clonal relationships impact neuronal tuning within a phylogenetically ancient vertebrate brain structure AM Muldal, TP Lillicrap, BA Richards, CJ Akerman Current Biology 24 (16), 1929-1933, 2014 | 14 | 2014 |
Physically embedded planning problems: New challenges for reinforcement learning M Mirza, A Jaegle, JJ Hunt, A Guez, S Tunyasuvunakool, A Muldal, ... arXiv preprint arXiv:2009.05524, 2020 | 12 | 2020 |
Intra-agent speech permits zero-shot task acquisition C Yan, F Carnevale, PI Georgiev, A Santoro, A Guy, A Muldal, CC Hung, ... Advances in Neural Information Processing Systems 35, 2423-2438, 2022 | 10 | 2022 |
Evaluating multimodal interactive agents J Abramson, A Ahuja, F Carnevale, P Georgiev, A Goldin, A Hung, ... arXiv preprint arXiv:2205.13274, 2022 | 3 | 2022 |
Controlling interactive agents using multi-modal inputs JS Abramson, A Ahuja, FJ Carnevale, PI Georgiev, CC Hung, TP Lillicrap, ... US Patent App. 18/077,194, 2023 | | 2023 |