Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor T Haarnoja, A Zhou, P Abbeel, S Levine International conference on machine learning, 1861-1870, 2018 | 9786 | 2018 |
Soft actor-critic algorithms and applications T Haarnoja, A Zhou, K Hartikainen, G Tucker, S Ha, J Tan, V Kumar, ... arXiv preprint arXiv:1812.05905, 2018 | 3060 | 2018 |
Reinforcement learning with deep energy-based policies T Haarnoja, H Tang, P Abbeel, S Levine International conference on machine learning, 1352-1361, 2017 | 1538 | 2017 |
Learning to walk via deep reinforcement learning T Haarnoja, S Ha, A Zhou, J Tan, G Tucker, S Levine arXiv preprint arXiv:1812.11103, 2018 | 568 | 2018 |
Composable deep reinforcement learning for robotic manipulation T Haarnoja, V Pong, A Zhou, M Dalal, P Abbeel, S Levine 2018 IEEE international conference on robotics and automation (ICRA), 6244-6251, 2018 | 296 | 2018 |
Backprop kf: Learning discriminative deterministic state estimators T Haarnoja, A Ajay, S Levine, P Abbeel Advances in neural information processing systems 29, 2016 | 244 | 2016 |
Latent space policies for hierarchical reinforcement learning T Haarnoja, K Hartikainen, P Abbeel, S Levine International Conference on Machine Learning, 1851-1860, 2018 | 228 | 2018 |
From motor control to team play in simulated humanoid football S Liu, G Lever, Z Wang, J Merel, SMA Eslami, D Hennes, WM Czarnecki, ... Science Robotics 7 (69), eabo0235, 2022 | 119 | 2022 |
Dynamical distance learning for semi-supervised and unsupervised skill discovery K Hartikainen, X Geng, T Haarnoja, S Levine arXiv preprint arXiv:1907.08225, 2019 | 115* | 2019 |
Learning agile soccer skills for a bipedal robot with deep reinforcement learning T Haarnoja, B Moran, G Lever, SH Huang, D Tirumala, J Humplik, ... Science Robotics 9 (89), eadi8022, 2024 | 101 | 2024 |
Imitate and repurpose: Learning reusable robot movement skills from human and animal behaviors S Bohez, S Tunyasuvunakool, P Brakel, F Sadeghi, L Hasenclever, ... arXiv preprint arXiv:2203.17138, 2022 | 46 | 2022 |
Nerf2real: Sim2real transfer of vision-guided bipedal motion skills using neural radiance fields A Byravan, J Humplik, L Hasenclever, A Brussee, F Nori, T Haarnoja, ... 2023 IEEE International Conference on Robotics and Automation (ICRA), 9362-9369, 2023 | 42 | 2023 |
Acquiring diverse robot skills via maximum entropy deep reinforcement learning T Haarnoja University of California, Berkeley, 2018 | 33 | 2018 |
Towards real robot learning in the wild: A case study in bipedal locomotion M Bloesch, J Humplik, V Patraucean, R Hafner, T Haarnoja, A Byravan, ... Conference on Robot Learning, 1502-1511, 2022 | 24 | 2022 |
Passive THz imaging system for stand-off identification of concealed objects: results from a turn-key 16 pixel imager A Luukanen, L Grönberg, T Haarnoja, P Helistö, K Kataja, M Leivo, ... Passive Millimeter-Wave Imaging Technology XI 6948, 164-172, 2008 | 21 | 2008 |
Magnetic bearing as switched reluctance motor-feasibility study for bearingless switched reluctance motor T Halmeaho, T Haarnoja, A Manninen, J Pippuri, J Keränen, K Tammi 2013 International Electric Machines & Drives Conference, 401-408, 2013 | 13 | 2013 |
Exact LTP Representation of the Generalized Periodic-Reference FxLMS Algorithm T Haarnoja, K Tammi, K Zenger IEEE transactions on signal processing 62 (1-4), 121-130, 2014 | 8 | 2014 |
Passive broadband terahertz camera for stand-off concealed threat identification using superconducting antenna-coupled microbolometers A Luukanen, L Gronberg, T Haarnoja, P Helisto, M Leivo, A Rautiainen, ... 2008 38th European Microwave Conference, 943-946, 2008 | 8 | 2008 |
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement T Haarnoja, A Zhou, P Abbeel, S Levine Proceedings of the 35th International Conference on Machine Learning. July …, 1861 | 7 | 1861 |
Forgetting and imbalance in robot lifelong learning with off-policy data W Zhou, S Bohez, J Humplik, N Heess, A Abdolmaleki, D Rao, ... Conference on Lifelong Learning Agents, 294-309, 2022 | 6 | 2022 |