Sigmoid-weighted linear units for neural network function approximation in reinforcement learning S Elfwing, E Uchibe, K Doya Neural Networks 107, 3-11, 2018 | 1736 | 2018 |
Deep learning, reinforcement learning, and world models Y Matsuo, Y LeCun, M Sahani, D Precup, D Silver, M Sugiyama, E Uchibe, ... Neural Networks 152, 267-275, 2022 | 335 | 2022 |
Cooperative behavior acquisition for mobile robots in dynamically changing real worlds via vision-based reinforcement learning and development M Asada, E Uchibe, K Hosoda Artificial Intelligence 110 (2), 275-292, 1999 | 209 | 1999 |
Deep reinforcement learning with smooth policy update: Application to robotic cloth manipulation Y Tsurumine, Y Cui, E Uchibe, T Matsubara Robotics and Autonomous Systems 112, 72-83, 2019 | 192 | 2019 |
Behavior coordination for a mobile robot using modular reinforcement learning E Uchibe, M Asada, K Hosoda Proceedings of IEEE/RSJ International Conference on Intelligent Robots and …, 1996 | 128 | 1996 |
Coordination of multiple behaviors acquired by a vision-based reinforcement learning M Asada, E Uchibe, S Noda, S Tawaratsumida, K Hosoda Proceedings of IEEE/RSJ International Conference on Intelligent Robots and …, 1994 | 96 | 1994 |
The cyber rodent project: Exploration of adaptive mechanisms for self-preservation and self-reproduction K Doya, E Uchibe Adaptive Behavior 13 (2), 149-160, 2005 | 88 | 2005 |
Competitive-cooperative-concurrent reinforcement learning with importance sampling E Uchibe, K Doya Proc. of International Conference on Simulation of Adaptive Behavior: From …, 2004 | 73 | 2004 |
Constrained reinforcement learning from intrinsic and extrinsic rewards E Uchibe, K Doya 2007 IEEE 6th International Conference on Development and Learning, 163-168, 2007 | 72 | 2007 |
Constrained Deep Q-learning gradually approaching ordinary Q-learning S Ohnishi, E Uchibe, K Nakanishi, S Ishii Frontiers in Neurorobotics 13, 103, 2019 | 71 | 2019 |
Biologically inspired embodied evolution of survival S Elfwing, E Uchibe, K Doya, HI Christensen 2005 IEEE Congress on Evolutionary Computation 3, 2210-2216, 2005 | 54 | 2005 |
Modular deep reinforcement learning from reward and punishment for robot navigation J Wang, S Elfwing, E Uchibe Neural Networks 135, 115-126, 2021 | 53 | 2021 |
Model-free deep inverse reinforcement learning by logistic regression E Uchibe Neural Processing Letters 47 (3), 891-905, 2018 | 50 | 2018 |
Evolutionary development of hierarchical learning structures S Elfwing, E Uchibe, K Doya, HI Christensen Evolutionary Computation, IEEE Transactions on 11 (2), 249-264, 2007 | 49 | 2007 |
Theoretical Analysis of Efficiency and Robustness of Softmax and Gap-Increasing Operators in Reinforcement Learning T Kozuno, E Uchibe, K Doya The 22nd International Conference on Artificial Intelligence and Statistics …, 2019 | 44 | 2019 |
Expected energy-based restricted Boltzmann machine for classification S Elfwing, E Uchibe, K Doya Neural Networks 64, 29-38, 2015 | 44 | 2015 |
Cooperative behavior acquisition in multi-mobile robots environment by reinforcement learning based on state vector estimation E Uchibe, M Asada, K Hosoda Proceedings. 1998 IEEE International Conference on Robotics and Automation …, 1998 | 44 | 1998 |
Cooperative behavior acquisition in multi-mobile robots environment by reinforcement learning based on state vector estimation E Uchibe, M Asada, K Hosoda Proceedings. 1998 IEEE International Conference on Robotics and Automation …, 1998 | 44 | 1998 |
Utilizing the natural gradient in temporal difference reinforcement learning with eligibility traces T Morimura, E Uchibe, K Doya International Symposium on Information Geometry and Its Applications, 256-263, 2005 | 43 | 2005 |
Co-evolution for cooperative behavior acquisition in a multiple mobile robot environment E Uchibe, M Nakamura, M Asada Proceedings. 1998 IEEE/RSJ International Conference on Intelligent Robots …, 1998 | 43 | 1998 |