Yan Duan

引用先

	すべて	2019 年以来
引用	16801	14241
h 指標	21	20
i10 指標	23	22

3000

1500

750

2250

201520162017201820192020202120222023202452 174 684 1569 2180 2614 2997 2874 2775 798

オープンアクセス

すべて表示

8 件の論文

0 件の論文

利用可能

利用不可

助成機関の要件に基づく

共著者

Pieter AbbeelUC Berkeley | Covariant確認したメールアドレス: cs.berkeley.edu
(Peter) Xi Chencovariant.ai | UC Berkeley確認したメールアドレス: berkeley.edu
John SchulmanResearch Scientist, OpenAI確認したメールアドレス: openai.com
Rein HouthooftNetflix Research確認したメールアドレス: netflix.com
Ilya SutskeverCo-Founder and Chief Scientist of OpenAI確認したメールアドレス: openai.com
Jonathan Ho確認したメールアドレス: berkeley.edu
Haoran TangPhD student in Applied Mathematics; University of California, Berkeley確認したメールアドレス: math.berkeley.edu
Ken GoldbergProfessor, UC Berkeley and UCSF確認したメールアドレス: berkeley.edu
Sachin PatilNvidia確認したメールアドレス: nvidia.com
Ian GoodfellowDeepMind確認したメールアドレス: deepmind.com
Nicolas PapernotUniversity of Toronto and Vector Institute確認したメールアドレス: utoronto.ca
Alex X. LeeResearch Scientist, Google DeepMind確認したメールアドレス: google.com
Carlos FlorensaPhD from University of California at Berkeley確認したメールアドレス: berkeley.edu
Sergey LevineUC Berkeley, Physical Intelligence確認したメールアドレス: eecs.berkeley.edu
Trevor DarrellProfessor of Computer Science, U.C. Berkeley確認したメールアドレス: eecs.berkeley.edu
Peter BartlettProfessor, EECS and Statistics, UC Berkeley確認したメールアドレス: cs.berkeley.edu
Jia PanComputer Science, The University of Hong Kong確認したメールアドレス: cs.hku.hk
Ibrahim AwwalPhD Student in Electrical and Computer Engineering, UC San Diego確認したメールアドレス: eng.ucsd.edu
Diederik P. KingmaResearch Scientist, Google Brain確認したメールアドレス: google.com
Prafulla DhariwalResearcher, OpenAI確認したメールアドレス: openai.com

フォロー

Yan Duan

Covariant.AI

確認したメールアドレス: covariant.ai - ホームページ

Robotics Machine Learning Reinforcement Learning Meta Learning


タイトル引用回数順公開年順タイトル順	引用先引用先	年
InfoGAN: Interpretable representation learning by information maximizing generative adversarial nets X Chen, Y Duan, R Houthooft, J Schulman, I Sutskever, P Abbeel Advances in Neural Information Processing Systems, 2172-2180, 2016	5183	2016
Benchmarking deep reinforcement learning for continuous control Y Duan, X Chen, R Houthooft, J Schulman, P Abbeel International conference on machine learning, 1329-1338, 2016	1970	2016
RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning Y Duan, J Schulman, X Chen, PL Bartlett, I Sutskever, P Abbeel arXiv preprint arXiv:1611.02779, 2016	1072	2016
Adversarial attacks on neural network policies S Huang, N Papernot, I Goodfellow, Y Duan, P Abbeel arXiv preprint arXiv:1702.02284, 2017	930	2017
Vime: Variational information maximizing exploration R Houthooft, X Chen, Y Duan, J Schulman, F De Turck, P Abbeel Advances in neural information processing systems 29, 2016	913	2016
Motion planning with sequential convex optimization and convex collision checking J Schulman, Y Duan, J Ho, A Lee, I Awwal, H Bradlow, J Pan, S Patil, ... The International Journal of Robotics Research 33 (9), 1251-1270, 2014	823	2014
Variational lossy autoencoder X Chen, DP Kingma, T Salimans, Y Duan, P Dhariwal, J Schulman, ... arXiv preprint arXiv:1611.02731, 2016	764	2016
Evaluating protein transfer learning with TAPE R Rao, N Bhattacharya, N Thomas, Y Duan, P Chen, J Canny, P Abbeel, ... Advances in neural information processing systems 32, 2019	753	2019
One-shot imitation learning Y Duan, M Andrychowicz, B Stadie, OAI Jonathan Ho, J Schneider, ... Advances in neural information processing systems 30, 2017	749	2017
Deep Spatial Autoencoders for Visuomotor Learning C Finn, XY Tan, Y Duan, T Darrell, S Levine, P Abbeel International Conference on Robotics and Automation (ICRA), 2016	699*	2016
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning H Tang, R Houthooft, D Foote, A Stooke, X Chen, Y Duan, J Schulman, ... arXiv preprint arXiv:1611.04717, 2016	664	2016
Model-ensemble trust-region policy optimization T Kurutach, I Clavera, Y Duan, A Tamar, P Abbeel arXiv preprint arXiv:1802.10592, 2018	501	2018
Flow++: Improving flow-based generative models with variational dequantization and architecture design J Ho, X Chen, A Srinivas, Y Duan, P Abbeel International conference on machine learning, 2722-2730, 2019	454	2019
Stochastic neural networks for hierarchical reinforcement learning C Florensa, Y Duan, P Abbeel arXiv preprint arXiv:1704.03012, 2017	408	2017
Deep unsupervised cardinality estimation Z Yang, E Liang, A Kamsetty, C Wu, Y Duan, X Chen, P Abbeel, ... arXiv preprint arXiv:1905.04278, 2019	211	2019
Variance reduction for policy gradient with action-dependent factorized baselines C Wu, A Rajeswaran, Y Duan, V Kumar, AM Bayen, S Kakade, I Mordatch, ... arXiv preprint arXiv:1803.07246, 2018	169	2018
The Importance of Sampling in Meta-Reinforcement Learning B Stadie, G Yang, R Houthooft, P Chen, Y Duan, Y Wu, P Abbeel, ... Advances in Neural Information Processing Systems, 9299-9309, 2018	160*	2018
NeuroCard: one cardinality estimator for all tables Z Yang, A Kamsetty, S Luan, E Liang, Y Duan, X Chen, I Stoica arXiv preprint arXiv:2006.08109, 2020	145	2020
Attacking machine learning with adversarial examples I Goodfellow, N Papernot, S Huang, Y Duan, P Abbeel, J Clark OpenAI Blog 24, 1, 2017	76	2017
Sigma hulls for gaussian belief space planning for imprecise articulated robots amid obstacles A Lee, Y Duan, S Patil, J Schulman, Z McCarthy, J Van Den Berg, ... 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2013	45	2013

現在システムで処理を実行できません。しばらくしてからもう一度お試しください。

論文 1–20

年間引用数

重複した引用

結合された引用

共著者を追加共著者

フォロー

引用先

共著者