Follow
Yu Bai
Yu Bai
Research Scientist, Salesforce Research
Verified email at salesforce.com - Homepage
Title
Cited by
Cited by
Year
The landscape of empirical risk for nonconvex losses
S Mei, Y Bai, A Montanari
The Annals of Statistics 46 (6A), 2747-2774, 2018
2542018
Provable self-play algorithms for competitive reinforcement learning
Y Bai, C Jin
International conference on machine learning, 551-560, 2020
812020
Proxquant: Quantized neural networks via proximal operators
Y Bai, YX Wang, E Liberty
International Conference on Learning Representations (ICLR) 2019, 2018
712018
Beyond linearization: On quadratic and higher-order approximation of wide neural networks
Y Bai, JD Lee
International Conference on Learning Representations (ICLR) 2020, 2019
692019
Near-Optimal Reinforcement Learning with Self-Play
Y Bai, C Jin, T Yu
Advances in Neural Information Processing Systems, 2020, 2020
602020
Approximability of discriminators implies diversity in GANs
Y Bai, T Ma, A Risteski
International Conference on Learning Representations (ICLR) 2019, 2018
602018
A sharp analysis of model-based reinforcement learning with self-play
Q Liu, T Yu, Y Bai, C Jin
International Conference on Machine Learning, 7001-7010, 2021
522021
Provably Efficient Q-Learning with Low Switching Cost
Y Bai, T Xie, N Jiang, YX Wang
Advances in Neural Information Processing Systems, 2019, 2019
482019
Near-optimal provable uniform convergence in offline policy evaluation for reinforcement learning
M Yin, Y Bai, YX Wang
International Conference on Artificial Intelligence and Statistics, 1567-1575, 2021
42*2021
Subgradient descent learns orthogonal dictionaries
Y Bai, Q Jiang, J Sun
International Conference on Learning Representations (ICLR) 2019, 2018
412018
Near-optimal offline reinforcement learning via double variance reduction
M Yin, Y Bai, YX Wang
Advances in neural information processing systems 34, 7677-7688, 2021
302021
How Important is the Train-Validation Split in Meta-Learning?
Y Bai, M Chen, P Zhou, T Zhao, J Lee, S Kakade, H Wang, C Xiong
International Conference on Machine Learning, 543-553, 2021
282021
Policy finetuning: Bridging sample-efficient offline and online reinforcement learning
T Xie, N Jiang, H Wang, C Xiong, Y Bai
Advances in neural information processing systems 34, 27395-27407, 2021
262021
When Can We Learn General-Sum Markov Games with a Large Number of Players Sample-Efficiently?
Z Song, S Mei, Y Bai
arXiv preprint arXiv:2110.04184, 2021
182021
Towards understanding hierarchical learning: Benefits of neural representations
M Chen, Y Bai, JD Lee, T Zhao, H Wang, C Xiong, R Socher
Advances in Neural Information Processing Systems, 2020, 2020
182020
Tapas: Two-pass approximate adaptive sampling for softmax
Y Bai, S Goldman, L Zhang
arXiv preprint arXiv:1707.03073, 2017
132017
Sample-efficient learning of stackelberg equilibria in general-sum games
Y Bai, C Jin, H Wang, C Xiong
Advances in Neural Information Processing Systems 34, 25799-25811, 2021
102021
Exact gap between generalization error and uniform convergence in random feature models
Z Yang, Y Bai, S Mei
International Conference on Machine Learning, 11704-11715, 2021
102021
Taylorized training: Towards better approximation of neural network training at finite width
Y Bai, B Krause, H Wang, C Xiong, R Socher
arXiv preprint arXiv:2002.04010, 2020
92020
Don't Just Blame Over-parametrization for Over-confidence: Theoretical Analysis of Calibration in Binary Classification
Y Bai, S Mei, H Wang, C Xiong
International Conference on Machine Learning, 566-576, 2021
82021
The system can't perform the operation now. Try again later.
Articles 1–20