Tadashi Kozuno

引用先

	すべて	2019 年以来
引用	357	350
h 指標	10	10
i10 指標	10	10

120

2019202020212022202320244 12 53 93 104 84

オープンアクセス

すべて表示

4 件の論文

0 件の論文

利用可能

利用不可

助成機関の要件に基づく

共著者

Rémi MunosGoogle DeepMind確認したメールアドレス: inria.fr
Michal ValkoLlama @ Meta Paris & Inria & MVA - Ex: Gemini and BYOL @ Google DeepMind確認したメールアドレス: meta.com
Matthieu GeistCohere (ex Google, on leave of Professor, Université de Lorraine)確認したメールアドレス: univ-lorraine.fr
Olivier PietquinCohere | ex Google DeepMind (On leave - Professor at University of Lille)確認したメールアドレス: univ-lille.fr
Nino VieillardGoogle DeepMind確認したメールアドレス: google.com
Pierre MénardOvGU Magdeburg確認したメールアドレス: inria.fr
Yunhao TangResearch Scientist, DeepMind確認したメールアドレス: columbia.edu
Kenji DoyaOkinawa Institute of Science and Technology確認したメールアドレス: oist.jp
Hiroki FurutaThe University of Tokyo確認したメールアドレス: weblab.t.u-tokyo.ac.jp
Shixiang Shane GuGoogle DeepMind確認したメールアドレス: google.com
Tatsuya MatsushimaThe University of Tokyo確認したメールアドレス: weblab.t.u-tokyo.ac.jp
Yutaka MatsuoProfessor, University of Tokyo確認したメールアドレス: weblab.t.u-tokyo.ac.jp
Mark RowlandResearch Scientist, Google DeepMind確認したメールアドレス: google.com
Wenhao YangStanford University確認したメールアドレス: stanford.edu
Eiji UchibeDept. of Brain Robot Interface, ATR Computational Neuroscience Labs.確認したメールアドレス: atr.jp
Csaba SzepesvariDeepMind & University of Alberta確認したメールアドレス: cs.ualberta.ca
Martha WhiteUniversity of Alberta確認したメールアドレス: ualberta.ca
Toshinori KitamuraThe University of Tokyo確認したメールアドレス: weblab.t.u-tokyo.ac.jp
Ryo YonetaniResearch Scientist at CyberAgent確認したメールアドレス: cyberagent.co.jp
Hugo SilvaUniversity of Alberta確認したメールアドレス: ualberta.ca

フォロー

Tadashi Kozuno

OMRON SINIC X

確認したメールアドレス: alumni.oist.jp - ホームページ

reinforcement learning machine learning neuroscience


タイトル引用回数順公開年順タイトル順	引用先引用先	年
Leverage the Average: an Analysis of KL Regularization in Reinforcement Learning N Vieillard, T Kozuno, B Scherrer, O Pietquin, R Munos, M Geist The 34th Conference on Neural Information Processing Systems, 2020	111*	2020
Theoretical analysis of efficiency and robustness of softmax and gap-increasing operators in reinforcement learning T Kozuno, E Uchibe, K Doya The 22nd International Conference on Artificial Intelligence and Statistics …, 2019	43	2019
Model-Free Learning for Two-Player Zero-Sum Partially Observable Markov Games with Perfect Recall T Kozuno, P Ménard, R Munos, M Valko Advances in Neural Information Processing Systems 35, 2021	36*	2021
Greedification operators for policy optimization: Investigating forward and reverse kl divergences A Chan, H Silva, S Lim, T Kozuno, AR Mahmood, M White Journal of Machine Learning Research 23 (253), 1-79, 2022	25	2022
Revisiting Peng's Q () for Modern Reinforcement Learning T Kozuno, Y Tang, M Rowland, R Munos, S Kapturowski, W Dabney, ... The 38th International Conference on Machine Learning, 2021	22	2021
Policy Information Capacity: Information-Theoretic Measure for Task Complexity in Deep Reinforcement Learning H Furuta, T Matsushima, T Kozuno, Y Matsuo, S Levine, O Nachum, ... The 38th International Conference on Machine Learning, 2021	19	2021
Identifying Co-Adaptation of Algorithmic and Implementational Innovations in Deep Reinforcement Learning: A Taxonomy and Case Study of Inference-based Algorithms H Furuta, T Kozuno, T Matsushima, Y Matsuo, SS Gu Advances in Neural Information Processing Systems 35, 2021	16*	2021
Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation Y Tang, T Kozuno, M Rowland, R Munos, M Valko Advances in Neural Information Processing Systems 35, 2021	11	2021
Avoiding model estimation in robust markov decision processes with a generative model W Yang, H Wang, T Kozuno, SM Jordan, Z Zhang arXiv preprint arXiv:2302.01248 23, 2023	10	2023
Confident Approximate Policy Iteration for Efficient Local Planning in -realizable MDPs G Weisz, A György, T Kozuno, C Szepesvári Advances in Neural Information Processing Systems 35, 25547-25559, 2022	10	2022
Benchmarking Actor-Critic Deep Reinforcement Learning Algorithms for Robotics Control with Action Constraints K Kasaura, S Miura, T Kozuno, R Yonetani, K Hoshino, Y Hosoe IEEE Robotics and Automation Letters, 2023	8	2023
KL-Entropy-Regularized RL with a Generative Model is Minimax Optimal T Kozuno, W Yang, N Vieillard, T Kitamura, Y Tang, J Mei, P Ménard, ... arXiv preprint arXiv:2205.14211, 2022	7	2022
No More Pesky Hyperparameters: Offline Hyperparameter Tuning for RL H Wang, A Sakhadeo, A White, J Bell, V Liu, X Zhao, P Liu, T Kozuno, ... Transactions on Machine Learning Research, 2022	7	2022
Adapting to game trees in zero-sum imperfect information games C Fiegel, P Ménard, T Kozuno, R Munos, V Perchet, M Valko International Conference on Machine Learning, 10093-10135, 2023	6	2023
Variational oracle guiding for reinforcement learning D Han, T Kozuno, X Luo, ZY Chen, K Doya, Y Yang, D Li International Conference on Learning Representations, 2021	6	2021
Study of White-LED Using Amorphous Carbon Nitride Grown by RF-sputtering and ECR-plasma CVD T Kozuno, S Kishimoto, K Tachibana, K Itoh, Y Iwano, S Kunitsugu, ... Journal of Light & Visual Environment 35 (1), 86-89, 2011	6	2011
Gap-Increasing Policy Evaluation for Efficient and Noise-Tolerant Reinforcement Learning T Kozuno, D Han, K Doya arXiv preprint arXiv:1906.07586, 2019	3	2019
Unifying Value Iteration, Advantage Learning, and Dynamic Policy Programming T Kozuno, E Uchibe, K Doya arXiv preprint arXiv:1710.10866, 2017	3	2017
Symmetry-aware Reinforcement Learning for Robotic Assembly under Partial Observability with a Soft Wrist H Nguyen, T Kozuno, CC Beltran-Hernandez, M Hamaya arXiv preprint arXiv:2402.18002, 2024	2	2024
Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice T Kitamura, T Kozuno, Y Tang, N Vieillard, M Valko, W Yang, J Mei, ... International Conference on Machine Learning, 17135-17175, 2023	2	2023

現在システムで処理を実行できません。しばらくしてからもう一度お試しください。

論文 1–20

年間引用数

重複した引用

結合された引用

共著者を追加共著者

フォロー

引用先

共著者