Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context G Team, P Georgiev, VI Lei, R Burnell, L Bai, A Gulati, G Tanzer, ... arXiv preprint arXiv:2403.05530, 2024 | 875 | 2024 |
Real-time user-guided image colorization with learned deep priors R Zhang, JY Zhu, P Isola, X Geng, AS Lin, T Yu, AA Efros arXiv preprint arXiv:1705.02999, 2017 | 819 | 2017 |
Automatic Goal Generation for Reinforcement Learning Agents C Florensa, D Held, X Geng, P Abbeel International Conference on Machine Learning, 1514-1523, 2018 | 587* | 2018 |
Open x-embodiment: Robotic learning datasets and rt-x models A O'Neill, A Rehman, A Gupta, A Maddukuri, A Gupta, A Padalkar, A Lee, ... arXiv preprint arXiv:2310.08864, 2023 | 412* | 2023 |
Koala: A dialogue model for academic research X Geng, A Gudibande, H Liu, E Wallace, P Abbeel, S Levine, D Song Blog post, April 1, 6, 2023 | 206 | 2023 |
The false promise of imitating proprietary llms A Gudibande, E Wallace, C Snell, X Geng, H Liu, P Abbeel, S Levine, ... arXiv preprint arXiv:2305.15717, 2023 | 160 | 2023 |
Sequential modeling enables scalable learning for large vision models Y Bai, X Geng, K Mangalam, A Bar, AL Yuille, T Darrell, J Malik, AA Efros Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 133 | 2024 |
Deep reinforcement learning for tensegrity robot locomotion M Zhang, X Geng, J Bruce, K Caluwaerts, M Vespignani, V SunSpiral, ... 2017 IEEE international conference on robotics and automation (ICRA), 634-641, 2017 | 130 | 2017 |
Rewriting history with inverse rl: Hindsight inference for policy improvement B Eysenbach, X Geng, S Levine, RR Salakhutdinov Advances in neural information processing systems 33, 14783-14795, 2020 | 107 | 2020 |
Multimodal masked autoencoders learn transferable representations X Geng, H Liu, L Lee, D Schuurmans, S Levine, P Abbeel arXiv preprint arXiv:2205.14204, 2022 | 104 | 2022 |
OpenLLaMA: An open reproduction of llama X Geng, H Liu https://github.com/openlm-research/open_llama, 2023 | 101 | 2023 |
Conservative objective models for effective offline model-based optimization B Trabucco, A Kumar, X Geng, S Levine International Conference on Machine Learning, 10358-10368, 2021 | 100 | 2021 |
Dynamical distance learning for semi-supervised and unsupervised skill discovery K Hartikainen, X Geng, T Haarnoja, S Levine arXiv preprint arXiv:1907.08225, 2019 | 96 | 2019 |
Design-bench: Benchmarks for data-driven offline model-based optimization B Trabucco, X Geng, A Kumar, S Levine International Conference on Machine Learning, 21658-21676, 2022 | 94 | 2022 |
Offline q-learning on diverse multi-task data both scales and generalizes A Kumar, R Agarwal, X Geng, G Tucker, S Levine arXiv preprint arXiv:2211.15144, 2022 | 49 | 2022 |
Meta-reinforcement learning robust to distributional shift via model identification and experience relabeling R Mendonca, X Geng, C Finn, S Levine arXiv preprint arXiv:2006.07178, 2020 | 48 | 2020 |
Multi-stage cable routing through hierarchical imitation learning J Luo, C Xu, X Geng, G Feng, K Fang, L Tan, S Schaal, S Levine IEEE Transactions on Robotics, 2024 | 36 | 2024 |
RL on Incorrect Synthetic Data Scales the Efficiency of LLM Math Reasoning by Eight-Fold A Setlur, S Garg, X Geng, N Garg, V Smith, A Kumar arXiv preprint arXiv:2406.14532, 2024 | 22 | 2024 |
Dynamical distance learning for unsupervised and semi-supervised skill discovery K Hartikainen, X Geng, T Haarnoja, S Levine arXiv preprint arXiv:1907.08225, 2019 | 20 | 2019 |
Action-quantized offline reinforcement learning for robotic skill learning J Luo, P Dong, J Wu, A Kumar, X Geng, S Levine Conference on Robot Learning, 1348-1361, 2023 | 18 | 2023 |