Xu Tan

Cited by

	All	Since 2019
Citations	11609	11499
h-index	48	48
i10-index	104	103

3800

1900

950

2850

201820192020202120222023202485 390 892 1562 2285 3708 2577

Public access

View all

25 articles

1 article

available

not available

Based on funding mandates

Co-authors

Tao QinSenior Principal Research Manager, Microsoft ResearchVerified email at microsoft.com
Tie-Yan LiuDistinguished Scientist, Microsoft Research AI4Science | IEEE Fellow | ACM Fellow | AAIA FellowVerified email at microsoft.com
Sheng ZhaoMicrosoftVerified email at microsoft.com
Kaitao SongSenior Researcher, Microsoft ResearchVerified email at microsoft.com
Yi Ren (任意)Research Scientist, TiktokVerified email at bytedance.com
Zhou ZhaoZhejiang UniversityVerified email at zju.edu.cn
Yichong LengUniversity of Science and Technology of ChinaVerified email at mail.ustc.edu.cn
Rui WangMicrosoft Research AsiaVerified email at microsoft.com
Renqian LuoMicrosoft ResearchVerified email at microsoft.com
Junliang GuoMicrosoft ResearchVerified email at microsoft.com
Lei HePrincipal Scientist Manager, MicrosoftVerified email at microsoft.com
Tianyu HeMicrosoft ResearchVerified email at microsoft.com
Arul MenezesMicrosoft ResearchVerified email at microsoft.com
Hany Hassan AwadallaMicrosoft ResearchVerified email at microsoft.com
Ming Zhou (周明)Chief Scientist at Sinovation, ACL president (2019), VP of CCF(2020-2024)Verified email at chuangxin.com
Xuedong D. HuangMicrosoftVerified email at microsoft.com
Yuanchao ShuMicrosoft ResearchVerified email at microsoft.com
JIMING CHENProfessor at Zhejiang UniversityVerified email at ieee.org
Yoshua BengioProfessor of computer science, University of Montreal, Mila, IVADO, CIFARVerified email at umontreal.ca

Xu Tan

Principal Researcher and Research Manager, Microsoft

Verified email at microsoft.com - Homepage

Large Language Models Speech/Music Generation Avatar/Video Generation Multimodality


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
FastSpeech 2: Fast and High-Quality End-to-End Text to Speech Y Ren, C Hu, X Tan, T Qin, S Zhao, Z Zhao, TY Liu ICLR 2021, 2020	1287	2020
MASS: Masked Sequence to Sequence Pre-training for Language Generation K Song, X Tan, T Qin, J Lu, TY Liu ICML 2019, 2019	1115	2019
FastSpeech: Fast, Robust and Controllable Text to Speech Y Ren, Y Ruan, X Tan, T Qin, S Zhao, Z Zhao, TY Liu NIPS 2019, 2019	1096	2019
Mpnet: Masked and permuted pre-training for language understanding K Song, X Tan, T Qin, J Lu, TY Liu Advances in neural information processing systems 33, 16857-16867, 2020	893	2020
Hugginggpt: Solving ai tasks with chatgpt and its friends in hugging face Y Shen, K Song, X Tan, D Li, W Lu, Y Zhuang Advances in Neural Information Processing Systems 36, 2024	700	2024
Achieving human parity on automatic chinese to english news translation H Hassan, A Aue, C Chen, V Chowdhary, J Clark, C Federmann, X Huang, ... arXiv preprint arXiv:1803.05567, 2018	695	2018
A survey on neural speech synthesis X Tan, T Qin, F Soong, TY Liu arXiv preprint arXiv:2106.15561, 2021	368	2021
Multilingual Neural Machine Translation with Knowledge Distillation X Tan, Y Ren, D He, T Qin, Z Zhao, TY Liu ICLR 2019, 2019	257	2019
Representation Degeneration Problem in Training Natural Language Generation Models J Gao, D He, X Tan, T Qin, L Wang, T Liu ICLR 2019, 2018	236	2018
ESPnet-TTS: Unified, reproducible, and integratable open source end-to-end text-to-speech toolkit T Hayashi, R Yamamoto, K Inoue, T Yoshimura, S Watanabe, T Toda, ... ICASSP 2020-2020 IEEE international conference on acoustics, speech and …, 2020	216	2020
FRAGE: frequency-agnostic word representation C Gong, D He, X Tan, T Qin, L Wang, Liu, Tie-Yan NIPS 2018, 2018	177	2018
Adaspeech: Adaptive text to speech for custom voice M Chen, X Tan, B Li, Y Liu, T Qin, S Zhao, TY Liu ICLR 2021, 2021	161	2021
Naturalspeech: End-to-end text-to-speech synthesis with human-level quality X Tan, J Chen, H Liu, J Cong, C Zhang, Y Liu, X Wang, Y Leng, Y Yi, L He, ... IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024	135	2024
Non-Autoregressive Neural Machine Translation with Enhanced Decoder Input J Guo, X Tan, D He, T Qin, L Xu, TY Liu AAAI 2019, 2018	131	2018
Layer-Wise Coordination between Encoder and Decoder for Neural Machine Translation T He, X Tan, Y Xia, D He, T Qin, Z Chen, Liu, Tie-Yan NIPS 2018, 2018	128	2018
Almost Unsupervised Text to Speech and Automatic Speech Recognition Y Ren, X Tan, T Qin, S Zhao, Z Zhao, TY Liu ICML 2019, 2019	119	2019
Multilingual neural machine translation with language clustering X Tan, J Chen, D He, Y Xia, T Qin, TY Liu EMNLP 2019, 2019	114	2019
Popmag: Pop music accompaniment generation Y Ren, J He, X Tan, T Qin, Z Zhao, TY Liu Proceedings of the 28th ACM international conference on multimedia, 1198-1206, 2020	110	2020
Musicbert: Symbolic music understanding with large-scale pre-training M Zeng, X Tan, R Wang, Z Ju, T Qin, TY Liu ACL 2021, 2021	109	2021
Naturalspeech 2: Latent diffusion models are natural and zero-shot speech and singing synthesizers K Shen, Z Ju, X Tan, Y Liu, Y Leng, L He, T Qin, S Zhao, J Bian arXiv preprint arXiv:2304.09116, 2023	106	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors