Pengcheng He

Cited by

	All	Since 2019
Citations	8978	8959
h-index	29	29
i10-index	46	46

3500

1750

875

2625

201920202021202220232024212 727 1309 1866 3418 1397

Co-authors

Weizhu ChenMicrosoftVerified email at microsoft.com
Jianfeng GaoMicrosoft Research, RedmondVerified email at microsoft.com
Xiaodong LiuMicrosoft Research, RedmondVerified email at microsoft.com
Tuo ZhaoAssistant Professor, Georgia TechVerified email at gatech.edu
Baolin PengTencent AI LabVerified email at global.tencent.com
Jiawei HanAbel Bliss Professor of Computer Science, University of IllinoisVerified email at cs.uiuc.edu
Hao ChengMicrosoft ResearchVerified email at microsoft.com
Liyuan LiuMicrosoft ResearchVerified email at illinois.edu
Hoifung PoonGeneral Manager, Microsoft Health FuturesVerified email at microsoft.com
Adam TrischlerMicrosoft Research, McGill UniversityVerified email at microsoft.com
Tao ShenOracleVerified email at oracle.com
Guodong LongAssociate Professor, Faculty of Engineering and IT, University of Technology SydneyVerified email at uts.edu.au
William DarlingCohereVerified email at cohere.com
Yu WangMicrosoft ResearchVerified email at microsoft.com

Pengcheng He

Microsoft

Verified email at microsoft.com

Machine Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
On the variance of the adaptive learning rate and beyond L Liu, H Jiang, P He, W Chen, X Liu, J Gao, J Han ICLR 2019, 2019	1992	2019
Deberta: Decoding-enhanced bert with disentangled attention P He, X Liu, J Gao, W Chen ICLR 2021, 2020	1937	2020
Multi-task deep neural networks for natural language understanding X Liu, P He, W Chen, J Gao ACL 2019, 2019	1324	2019
Debertav3: Improving deberta using electra-style pre-training with gradient-disentangled embedding sharing P He, J Gao, W Chen ICLR 2023, 2021	566	2021
Instruction tuning with gpt-4 B Peng, C Li, P He, M Galley, J Gao arXiv preprint arXiv:2304.03277, 2023	435	2023
Smart: Robust and efficient fine-tuning for pre-trained natural language models through principled regularized optimization H Jiang, P He, W Chen, X Liu, J Gao, T Zhao ACL 2020, 2019	413	2019
Check your facts and try again: Improving large language models with external knowledge and automated feedback B Peng, M Galley, P He, H Cheng, Y Xie, Y Hu, Q Huang, L Liden, Z Yu, ... arXiv preprint arXiv:2302.12813, 2023	237	2023
Improving multi-task deep neural networks via knowledge distillation for natural language understanding X Liu, P He, W Chen, J Gao arXiv preprint arXiv:1904.09482, 2019	198	2019
Generation-augmented retrieval for open-domain question answering Y Mao, P He, X Liu, Y Shen, J Gao, J Han, W Chen arXiv preprint arXiv:2009.08553, 2020	161	2020
Adversarial training for large neural language models X Liu, H Cheng, P He, W Chen, Y Wang, H Poon, J Gao arXiv preprint arXiv:2004.08994, 2020	161	2020
Diffusion-GAN: Training GANs with Diffusion Z Wang, H Zheng, P He, W Chen, M Zhou ICLR 2023, 2022	140	2022
Adaptive budget allocation for parameter-efficient fine-tuning Q Zhang, M Chen, A Bukharin, P He, Y Cheng, W Chen, T Zhao The Eleventh International Conference on Learning Representations, 2023	135	2023
X-SQL: reinforce schema representation with context P He, Y Mao, K Chakrabarti, W Chen arXiv preprint arXiv:1908.08113, 2019	99	2019
On the variance of the adaptive learning rate and beyond. arXiv 2019 L Liu, H Jiang, P He, W Chen, X Liu, J Gao, J Han arXiv preprint arXiv:1908.03265, 1908	84	1908
NeurIPS 2020 EfficientQA competition: Systems, analyses and lessons learned S Min, J Boyd-Graber, C Alberti, D Chen, E Choi, M Collins, K Guu, ... NeurIPS 2020, 2021	65	2021
Dola: Decoding by contrasting layers improves factuality in large language models YS Chuang, Y Xie, H Luo, Y Kim, J Glass, P He arXiv preprint arXiv:2309.03883, 2023	63	2023
Exploiting structured knowledge in text via graph-guided representation learning T Shen, Y Mao, P He, G Long, A Trischler, W Chen arXiv preprint arXiv:2004.14224, 2020	61	2020
Godel: Large-scale pre-training for goal-directed dialog B Peng, M Galley, P He, C Brockett, L Liden, E Nouri, Z Yu, B Dolan, ... arXiv preprint arXiv:2206.11309, 2022	51	2022
The microsoft toolkit of multi-task deep neural networks for natural language understanding X Liu, Y Wang, J Ji, H Cheng, X Zhu, E Awa, P He, W Chen, H Poon, ... arXiv preprint arXiv:2002.07972, 2020	51	2020
Platon: Pruning large transformer models with upper confidence bound of weight importance Q Zhang, S Zuo, C Liang, A Bukharin, P He, W Chen, T Zhao International Conference on Machine Learning, 26809-26823, 2022	49	2022

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors