Tadashi Kozuno

Cited by

	All	Since 2019
Citations	357	350
h-index	10	10
i10-index	10	10

120

2019202020212022202320244 12 53 93 104 84

Public access

View all

4 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Rémi MunosGoogle DeepMindVerified email at inria.fr
Michal ValkoLlama @ Meta Paris & Inria & MVA - Ex: Gemini and BYOL @ Google DeepMindVerified email at meta.com
Matthieu GeistCohere (ex Google, on leave of Professor, Université de Lorraine)Verified email at univ-lorraine.fr
Olivier PietquinCohere | ex Google DeepMind (On leave - Professor at University of Lille)Verified email at univ-lille.fr
Nino VieillardGoogle DeepMindVerified email at google.com
Pierre MénardOvGU MagdeburgVerified email at inria.fr
Yunhao TangResearch Scientist, DeepMindVerified email at columbia.edu
Kenji DoyaOkinawa Institute of Science and TechnologyVerified email at oist.jp
Hiroki FurutaThe University of TokyoVerified email at weblab.t.u-tokyo.ac.jp
Shixiang Shane GuGoogle DeepMindVerified email at google.com
Tatsuya MatsushimaThe University of TokyoVerified email at weblab.t.u-tokyo.ac.jp
Yutaka MatsuoProfessor, University of TokyoVerified email at weblab.t.u-tokyo.ac.jp
Mark RowlandResearch Scientist, Google DeepMindVerified email at google.com
Wenhao YangStanford UniversityVerified email at stanford.edu
Eiji UchibeDept. of Brain Robot Interface, ATR Computational Neuroscience Labs.Verified email at atr.jp
Csaba SzepesvariDeepMind & University of AlbertaVerified email at cs.ualberta.ca
Martha WhiteUniversity of AlbertaVerified email at ualberta.ca
Toshinori KitamuraThe University of TokyoVerified email at weblab.t.u-tokyo.ac.jp
Ryo YonetaniResearch Scientist at CyberAgentVerified email at cyberagent.co.jp
Hugo SilvaUniversity of AlbertaVerified email at ualberta.ca

Tadashi Kozuno

OMRON SINIC X

Verified email at alumni.oist.jp - Homepage

reinforcement learning machine learning neuroscience


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Leverage the Average: an Analysis of KL Regularization in Reinforcement Learning N Vieillard, T Kozuno, B Scherrer, O Pietquin, R Munos, M Geist The 34th Conference on Neural Information Processing Systems, 2020	111*	2020
Theoretical analysis of efficiency and robustness of softmax and gap-increasing operators in reinforcement learning T Kozuno, E Uchibe, K Doya The 22nd International Conference on Artificial Intelligence and Statistics …, 2019	43	2019
Model-Free Learning for Two-Player Zero-Sum Partially Observable Markov Games with Perfect Recall T Kozuno, P Ménard, R Munos, M Valko Advances in Neural Information Processing Systems 35, 2021	36*	2021
Greedification operators for policy optimization: Investigating forward and reverse kl divergences A Chan, H Silva, S Lim, T Kozuno, AR Mahmood, M White Journal of Machine Learning Research 23 (253), 1-79, 2022	25	2022
Revisiting Peng's Q () for Modern Reinforcement Learning T Kozuno, Y Tang, M Rowland, R Munos, S Kapturowski, W Dabney, ... The 38th International Conference on Machine Learning, 2021	22	2021
Policy Information Capacity: Information-Theoretic Measure for Task Complexity in Deep Reinforcement Learning H Furuta, T Matsushima, T Kozuno, Y Matsuo, S Levine, O Nachum, ... The 38th International Conference on Machine Learning, 2021	19	2021
Identifying Co-Adaptation of Algorithmic and Implementational Innovations in Deep Reinforcement Learning: A Taxonomy and Case Study of Inference-based Algorithms H Furuta, T Kozuno, T Matsushima, Y Matsuo, SS Gu Advances in Neural Information Processing Systems 35, 2021	16*	2021
Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation Y Tang, T Kozuno, M Rowland, R Munos, M Valko Advances in Neural Information Processing Systems 35, 2021	11	2021
Avoiding model estimation in robust markov decision processes with a generative model W Yang, H Wang, T Kozuno, SM Jordan, Z Zhang arXiv preprint arXiv:2302.01248 23, 2023	10	2023
Confident Approximate Policy Iteration for Efficient Local Planning in -realizable MDPs G Weisz, A György, T Kozuno, C Szepesvári Advances in Neural Information Processing Systems 35, 25547-25559, 2022	10	2022
Benchmarking Actor-Critic Deep Reinforcement Learning Algorithms for Robotics Control with Action Constraints K Kasaura, S Miura, T Kozuno, R Yonetani, K Hoshino, Y Hosoe IEEE Robotics and Automation Letters, 2023	8	2023
KL-Entropy-Regularized RL with a Generative Model is Minimax Optimal T Kozuno, W Yang, N Vieillard, T Kitamura, Y Tang, J Mei, P Ménard, ... arXiv preprint arXiv:2205.14211, 2022	7	2022
No More Pesky Hyperparameters: Offline Hyperparameter Tuning for RL H Wang, A Sakhadeo, A White, J Bell, V Liu, X Zhao, P Liu, T Kozuno, ... Transactions on Machine Learning Research, 2022	7	2022
Adapting to game trees in zero-sum imperfect information games C Fiegel, P Ménard, T Kozuno, R Munos, V Perchet, M Valko International Conference on Machine Learning, 10093-10135, 2023	6	2023
Variational oracle guiding for reinforcement learning D Han, T Kozuno, X Luo, ZY Chen, K Doya, Y Yang, D Li International Conference on Learning Representations, 2021	6	2021
Study of White-LED Using Amorphous Carbon Nitride Grown by RF-sputtering and ECR-plasma CVD T Kozuno, S Kishimoto, K Tachibana, K Itoh, Y Iwano, S Kunitsugu, ... Journal of Light & Visual Environment 35 (1), 86-89, 2011	6	2011
Gap-Increasing Policy Evaluation for Efficient and Noise-Tolerant Reinforcement Learning T Kozuno, D Han, K Doya arXiv preprint arXiv:1906.07586, 2019	3	2019
Unifying Value Iteration, Advantage Learning, and Dynamic Policy Programming T Kozuno, E Uchibe, K Doya arXiv preprint arXiv:1710.10866, 2017	3	2017
Symmetry-aware Reinforcement Learning for Robotic Assembly under Partial Observability with a Soft Wrist H Nguyen, T Kozuno, CC Beltran-Hernandez, M Hamaya arXiv preprint arXiv:2402.18002, 2024	2	2024
Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice T Kitamura, T Kozuno, Y Tang, N Vieillard, M Valko, W Yang, J Mei, ... International Conference on Machine Learning, 17135-17175, 2023	2	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors