Yuan Cao

Cited by

	All	Since 2020
Citations	3139	2892
h-index	18	18
i10-index	27	26

720

360

180

540

2018201920202021202220232024202525 202 354 461 610 591 716 157

Public access

View all

22 articles

0 articles

available

not available

Based on funding mandates

Yuan Cao

The University of Hong Kong

Verified email at hku.hk - Homepage

Machine Learning Optimization


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Gradient descent optimizes over-parameterized deep ReLU networks D Zou, Y Cao, D Zhou, Q Gu Machine learning 109, 467-492, 2020	798	2020
Generalization bounds of stochastic gradient descent for wide and deep neural networks Y Cao, Q Gu Advances in neural information processing systems 32, 2019	455	2019
Towards understanding the spectral bias of deep learning Y Cao, Z Fang, Y Wu, DX Zhou, Q Gu arXiv preprint arXiv:1912.01198, 2019	261	2019
Closing the generalization gap of adaptive gradient methods in training deep neural networks J Chen, D Zhou, Y Tang, Z Yang, Y Cao, Q Gu arXiv preprint arXiv:1806.06763, 2018	235	2018
On the convergence of adaptive gradient methods for nonconvex optimization D Zhou, J Chen, Y Cao, Y Tang, Z Yang, Q Gu arXiv preprint arXiv:1808.05671, 2018	218	2018
Generalization error bounds of gradient descent for learning over-parameterized deep relu networks Y Cao, Q Gu Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 3349-3356, 2020	217*	2020
How much over-parameterization is sufficient to learn deep ReLU networks? Z Chen, Y Cao, D Zou, Q Gu arXiv preprint arXiv:1911.12360, 2019	151	2019
Benign overfitting in two-layer convolutional neural networks Y Cao, Z Chen, M Belkin, Q Gu Advances in neural information processing systems 35, 25237-25250, 2022	136	2022
A generalized neural tangent kernel analysis for two-layer neural networks Z Chen, Y Cao, Q Gu, T Zhang Advances in Neural Information Processing Systems 33, 13363-13373, 2020	106*	2020
Agnostic learning of a single neuron with gradient descent S Frei, Y Cao, Q Gu Advances in neural information processing systems 33, 5417-5428, 2020	75	2020
Risk bounds for over-parameterized maximum margin classification on sub-gaussian mixtures Y Cao, Q Gu, M Belkin Advances in Neural Information Processing Systems 34, 8407-8418, 2021	69	2021
Understanding the generalization of adam in learning neural networks with proper regularization D Zou, Y Cao, Y Li, Q Gu arXiv preprint arXiv:2108.11371, 2021	62	2021
The benefits of mixup for feature learning D Zou, Y Cao, Y Li, Q Gu International Conference on Machine Learning, 43423-43479, 2023	43	2023
Algorithm-dependent generalization bounds for overparameterized deep residual networks S Frei, Y Cao, Q Gu Advances in neural information processing systems 32, 2019	40	2019
Local and global inference for high dimensional nonparanormal graphical models Q Gu, Y Cao, Y Ning, H Liu arXiv preprint arXiv:1502.02347, 2015	38*	2015
Online machine learning modeling and predictive control of nonlinear systems with scheduled mode transitions C Hu, Y Cao, Z Wu AIChE Journal 69 (2), e17882, 2023	26	2023
Provable generalization of sgd-trained neural networks of any width in the presence of adversarial label noise S Frei, Y Cao, Q Gu International Conference on Machine Learning, 3427-3438, 2021	25	2021
Tight sample complexity of learning one-hidden-layer convolutional neural networks Y Cao, Q Gu Advances in Neural Information Processing Systems 32, 2019	24	2019
Agnostic learning of halfspaces with gradient descent via soft margins S Frei, Y Cao, Q Gu International Conference on Machine Learning, 3417-3426, 2021	18	2021
Benign overfitting in two-layer relu convolutional neural networks for xor data X Meng, D Zou, Y Cao arXiv preprint arXiv:2310.01975, 2023	16	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by