Yinlam Chow

Cited by

	All	Since 2019
Citations	4512	4099
h-index	26	25
i10-index	43	42

960

480

240

720

201520162017201820192020202120222023202435 47 80 110 273 511 710 877 960 763

Public access

View all

7 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Ofir NachumOpenAIVerified email at openai.com
Marco PavoneStanford University and NVIDIAVerified email at stanford.edu
Shie MannorProfessor of Electrical Engineering @ Technion & Researcher @ NvidiaVerified email at technion.ac.il
Aviv TamarTechnionVerified email at technion.ac.il
Jiyan YangStanford UniversityVerified email at stanford.edu
Junjie QinAssistant Professor, Purdue UniversityVerified email at purdue.edu
Ram RajagopalAssociate Professor, Stanford UniversityVerified email at stanford.edu
Lucas JansonAssociate Professor, Harvard University Department of StatisticsVerified email at fas.harvard.edu
Marek PetrikUniversity of New HampshireVerified email at cs.unh.edu
Mehrdad FarajtabarResearch Scientist at AppleVerified email at apple.com
Stefano CarpinProfessor, University of California, MercedVerified email at ucmerced.edu
Sumeet KatariyaAmazonVerified email at wisc.edu
Alan MalekMITVerified email at mit.edu
Sumeet SinghResearch Scientist, Google Brain RoboticsVerified email at google.com
Anirudha MajumdarAssociate Professor, Princeton University & Visiting Research Scientist, Google DeepMindVerified email at princeton.edu
Christopher RéComputer Science, Stanford UniversityVerified email at cs.stanford.edu
Bo LiuPhD, AAAI SM, IEEE SMVerified email at cs.umass.edu
Brian M SadlerThe University of Texas at AustinVerified email at ieee.org
Martin CorlessAeronautics & Astronautics, Purdue UniversityVerified email at purdue.edu

Yinlam Chow

Research Scientist, Google Research

Verified email at google.com

Reinforcement learning Optimal Control Sequential Decision Making Robust Control Nonlinear Systems


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
A lyapunov-based approach to safe reinforcement learning Y Chow, O Nachum, E Duenez-Guzman, M Ghavamzadeh Advances in neural information processing systems 31, 2018	595	2018
Risk-constrained reinforcement learning with percentile risk criteria Y Chow, M Ghavamzadeh, L Janson, M Pavone Journal of Machine Learning Research 18 (167), 1-51, 2018	578	2018
Algorithms for CVaR optimization in MDPs Y Chow, M Ghavamzadeh Advances in neural information processing systems 27, 2014	380	2014
Risk-sensitive and robust decision-making: a cvar optimization approach Y Chow, A Tamar, S Mannor, M Pavone Advances in neural information processing systems 28, 2015	375	2015
Dualdice: Behavior-agnostic estimation of discounted stationary distribution corrections O Nachum, Y Chow, B Dai, L Li Advances in neural information processing systems 32, 2019	343	2019
Lyapunov-based safe policy optimization for continuous control Y Chow, O Nachum, A Faust, E Duenez-Guzman, M Ghavamzadeh arXiv preprint arXiv:1901.10031, 2019	276	2019
More robust doubly robust off-policy evaluation M Farajtabar, Y Chow, M Ghavamzadeh International Conference on Machine Learning, 1447-1456, 2018	276	2018
Algaedice: Policy gradient from arbitrary experience O Nachum, B Dai, I Kostrikov, Y Chow, L Li, D Schuurmans arXiv preprint arXiv:1912.02074, 2019	245	2019
Safe policy improvement by minimizing robust baseline regret M Ghavamzadeh, M Petrik, Y Chow Advances in Neural Information Processing Systems 29, 2016	153	2016
Policy gradient for coherent risk measures A Tamar, Y Chow, M Ghavamzadeh, S Mannor Advances in neural information processing systems 28, 2015	143	2015
Coindice: Off-policy confidence interval estimation B Dai, O Nachum, Y Chow, L Li, C Szepesvári, D Schuurmans Advances in neural information processing systems 33, 9398-9411, 2020	85	2020
Sequential decision making with coherent risk A Tamar, Y Chow, M Ghavamzadeh, S Mannor IEEE transactions on automatic control 62 (7), 3323-3338, 2016	81	2016
A framework for time-consistent, risk-sensitive model predictive control: Theory and algorithms S Singh, Y Chow, A Majumdar, M Pavone IEEE Transactions on Automatic Control 64 (7), 2905-2912, 2018	70	2018
Online modified greedy algorithm for storage control under uncertainty J Qin, Y Chow, J Yang, R Rajagopal IEEE Transactions on Power Systems 31 (3), 1729-1743, 2015	63	2015
CAQL: Continuous action Q-learning M Ryu, Y Chow, R Anderson, C Tjandraatmadja, C Boutilier arXiv preprint arXiv:1909.12397, 2019	55	2019
Latent bandits revisited J Hong, B Kveton, M Zaheer, Y Chow, A Ahmed, C Boutilier Advances in Neural Information Processing Systems 33, 13423-13433, 2020	53	2020
Weighted SGD for Regression with Randomized Preconditioning J Yang, YL Chow, C Ré, MW Mahoney Journal of Machine Learning Research 18 (211), 1-43, 2018	53	2018
Distributed online modified greedy algorithm for networked storage operation under uncertainty J Qin, Y Chow, J Yang, R Rajagopal IEEE Transactions on Smart Grid 7 (2), 1106-1118, 2015	44	2015
A framework for time-consistent, risk-averse model predictive control: Theory and algorithms YL Chow, M Pavone 2014 American Control Conference, 4204-4211, 2014	44	2014
Path consistency learning in tsallis entropy regularized mdps Y Chow, O Nachum, M Ghavamzadeh International conference on machine learning, 979-988, 2018	40	2018

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors