Mengdi Wang

Cited by

	All	Since 2019
Citations	5287	4917
h-index	41	40
i10-index	82	78

1500

750

375

1125

201520162017201820192020202120222023202423 44 112 155 265 494 922 1137 1494 602

Public access

View all

40 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Lin F. Yang (杨林)Assistant Professor, Department of Electrical and Computer Engineering @ UCLAVerified email at ee.ucla.edu
Csaba SzepesvariDeepMind & University of AlbertaVerified email at cs.ualberta.ca
Alec KoppelAI Research Lead, JP Morgan AI ResearchVerified email at jpmchase.com
Yinyu YeK.T. Li Professor of Engineering, Stanford UniversityVerified email at stanford.edu
Tuo ZhaoAssistant Professor, Georgia TechVerified email at gatech.edu
Dimitri BertsekasArizona State University - Massachusetts Institute of TechnologyVerified email at mit.edu
Aaron SidfordStanford UniversityVerified email at stanford.edu
Ethan X. FangAssociate Professor at Duke UniversityVerified email at duke.edu
Botao HaoDeepmindVerified email at google.com
Anru ZhangDuke UniversityVerified email at duke.edu
Michael I. JordanProfessor of Electrical Engineering and Computer Sciences and Professor of Statistics, UC BerkeleyVerified email at cs.berkeley.edu
Zhaoran WangAssistant Professor at Northwestern UniversityVerified email at northwestern.edu
Yu-Xiang WangAssociate Professor of Computer Science, UC Santa BarbaraVerified email at cs.ucsb.edu
Tong ZhangHKUSTVerified email at tongzhang-ml.org
Saeed GhadimiUniversity of WaterlooVerified email at uwaterloo.ca
Tor LattimoreDeepMindVerified email at google.com
Prateek MittalProfessor, Princeton UniversityVerified email at princeton.edu
Andrzej RuszczyńskiBoard of Governors Professor of Rutgers UniversityVerified email at business.rutgers.edu
Zheng Tracy KeHarvard UniversityVerified email at fas.harvard.edu
Lihong Li (李力鸿)AmazonVerified email at amazon.com

Mengdi Wang

Center for Statistics & Machine Learning, ECE, Princeton University

Verified email at princeton.edu - Homepage

reinforcement learning optimization machine learning data science control


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Sample-optimal parametric q-learning using linearly additive features L Yang, M Wang International conference on machine learning, 6995-7004, 2019	335	2019
Reinforcement Learning in Feature Space: Matrix Bandit, Kernels, and Regret Bound LF Yang, M Wang International Conference on Machine Learning, 2020, 2019	298	2019
Model-based reinforcement learning with value-targeted regression A Ayoub, Z Jia, C Szepesvari, M Wang, L Yang International Conference on Machine Learning, 463-474, 2020	297	2020
Stochastic compositional gradient descent: algorithms for minimizing compositions of expected-value functions M Wang, EX Fang, H Liu Mathematical Programming 161, 419-449, 2017	255	2017
Near-optimal time and sample complexities for solving Markov decision processes with a generative model A Sidford, M Wang, X Wu, L Yang, Y Ye Advances in Neural Information Processing Systems 31, 2018	246*	2018
Approximation methods for bilevel programming S Ghadimi, M Wang arXiv preprint arXiv:1802.02246, 2018	202	2018
Minimax-optimal off-policy evaluation with linear function approximation Y Duan, Z Jia, M Wang International Conference on Machine Learning, 2701-2709, 2020	151	2020
Accelerating stochastic composition optimization M Wang, J Liu, EX Fang Journal of Machine Learning Research, 2017, 2016	148	2016
Variance reduced value iteration and faster algorithms for solving markov decision processes A Sidford, M Wang, X Wu, Y Ye. Proceedings of the Twenty-Ninth Annual ACM-SIAM Symposium on Discrete …, 2017	134*	2017
Variational policy gradient method for reinforcement learning with general utilities J Zhang, A Koppel, AS Bedi, C Szepesvari, M Wang Advances in Neural Information Processing Systems 2020, 2020	125	2020
Stochastic first-order methods with random constraint projection M Wang, DP Bertsekas SIAM Journal on Optimization 26 (1), 681-717, 2016	118*	2016
A single timescale stochastic approximation method for nested stochastic optimization S Ghadimi, A Ruszczynski, M Wang SIAM Journal on Optimization 30 (1), 960-979, 2020	115	2020
Finite-sum composition optimization via variance reduced gradient descent X Lian, M Wang, J Liu Artificial Intelligence and Statistics. 2017., 2016	93	2016
On function approximation in reinforcement learning: Optimism in the face of large state spaces Z Yang, C Jin, Z Wang, M Wang, MI Jordan arXiv preprint arXiv:2011.04622, 2020	92*	2020
Towards compact cnns via collaborative compression Y Li, S Lin, J Liu, Q Ye, M Wang, F Chao, F Yang, J Ma, Q Tian, R Ji Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021	84	2021
A distributed tracking algorithm for reconstruction of graph signals X Wang, M Wang, Y Gu IEEE Journal of Selected Topics in Signal Processing 9 (4), 728-740, 2015	79	2015
Randomized linear programming solves the Markov decision problem in nearly linear (sometimes sublinear) time M Wang Mathematics of Operations Research 45 (2), 517-546, 2020	78*	2020
Solving discounted stochastic two-player games with near-optimal time and sample complexity A Sidford, M Wang, L Yang, Y Ye International Conference on Artificial Intelligence and Statistics, 2992-3002, 2020	77	2020
Primal-Dual Learning: Sample Complexity and Sublinear Run Time for Ergodic Markov Decision Problems M Wang arXiv preprint arXiv:1710.06100, 2017	74	2017
Stochastic primal-dual methods and sample complexity of reinforcement learning Y Chen, M Wang arXiv preprint arXiv:1612.02516, 2016	71	2016

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors