Yan Duan
Yan Duan
Embodied Intelligence, UC Berkeley
確認したメール アドレス: berkeley.edu - ホームページ
タイトル引用先
InfoGAN: Interpretable representation learning by information maximizing generative adversarial nets
X Chen, Y Duan, R Houthooft, J Schulman, I Sutskever, P Abbeel
Advances in Neural Information Processing Systems, 2172-2180, 2016
16252016
Benchmarking deep reinforcement learning for continuous control
Y Duan, X Chen, R Houthooft, J Schulman, P Abbeel
International Conference on Machine Learning, 1329-1338, 2016
7182016
Vime: Variational information maximizing exploration
R Houthooft, X Chen, Y Duan, J Schulman, F De Turck, P Abbeel
Advances in Neural Information Processing Systems, 1109-1117, 2016
315*2016
Motion planning with sequential convex optimization and convex collision checking
J Schulman, Y Duan, J Ho, A Lee, I Awwal, H Bradlow, J Pan, S Patil, ...
The International Journal of Robotics Research 33 (9), 1251-1270, 2014
3012014
Variational lossy autoencoder
X Chen, DP Kingma, T Salimans, Y Duan, P Dhariwal, J Schulman, ...
arXiv preprint arXiv:1611.02731, 2016
2762016
Deep Spatial Autoencoders for Visuomotor Learning
C Finn, XY Tan, Y Duan, T Darrell, S Levine, P Abbeel
International Conference on Robotics and Automation (ICRA), 2016
274*2016
One-shot imitation learning
Y Duan, M Andrychowicz, B Stadie, OAIJ Ho, J Schneider, I Sutskever, ...
Advances in neural information processing systems, 1087-1098, 2017
2662017
RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning
Y Duan, J Schulman, X Chen, PL Bartlett, I Sutskever, P Abbeel
arXiv preprint arXiv:1611.02779, 2016
2522016
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning
H Tang, R Houthooft, D Foote, A Stooke, X Chen, Y Duan, J Schulman, ...
arXiv preprint arXiv:1611.04717, 2016
2262016
Adversarial attacks on neural network policies
S Huang, N Papernot, I Goodfellow, Y Duan, P Abbeel
arXiv preprint arXiv:1702.02284, 2017
2082017
Stochastic neural networks for hierarchical reinforcement learning
C Florensa, Y Duan, P Abbeel
arXiv preprint arXiv:1704.03012, 2017
1312017
Model-ensemble trust-region policy optimization
T Kurutach, I Clavera, Y Duan, A Tamar, P Abbeel
arXiv preprint arXiv:1802.10592, 2018
792018
Variance reduction for policy gradient with action-dependent factorized baselines
C Wu, A Rajeswaran, Y Duan, V Kumar, AM Bayen, S Kakade, I Mordatch, ...
arXiv preprint arXiv:1803.07246, 2018
472018
The Importance of Sampling in Meta-Reinforcement Learning
B Stadie, G Yang, R Houthooft, P Chen, Y Duan, Y Wu, P Abbeel, ...
Advances in Neural Information Processing Systems, 9299-9309, 2018
37*2018
Flow++: Improving flow-based generative models with variational dequantization and architecture design
J Ho, X Chen, A Srinivas, Y Duan, P Abbeel
arXiv preprint arXiv:1902.00275, 2019
282019
Sigma hulls for gaussian belief space planning for imprecise articulated robots amid obstacles
A Lee, Y Duan, S Patil, J Schulman, Z McCarthy, J Van Den Berg, ...
2013 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2013
262013
Gaussian belief space planning with discontinuities in sensing domains
S Patil, Y Duan, J Schulman, K Goldberg, P Abbeel
2014 IEEE International Conference on Robotics and Automation (ICRA), 6483-6490, 2014
242014
Planning locally optimal, curvature-constrained trajectories in 3D using sequential convex optimization
Y Duan, S Patil, J Schulman, K Goldberg, P Abbeel
2014 IEEE International Conference on Robotics and Automation (ICRA), 5889-5895, 2014
192014
Selectivity estimation with deep likelihood models
Z Yang, E Liang, A Kamsetty, C Wu, Y Duan, X Chen, P Abbeel, ...
arXiv preprint arXiv:1905.04278, 2019
42019
Evaluating protein transfer learning with TAPE
R Rao, N Bhattacharya, N Thomas, Y Duan, P Chen, J Canny, P Abbeel, ...
Advances in Neural Information Processing Systems, 9686-9698, 2019
42019
現在システムで処理を実行できません。しばらくしてからもう一度お試しください。
論文 1–20