Follow
Xu Tan
Xu Tan
Principal Researcher and Research Manager, Microsoft
Verified email at microsoft.com - Homepage
Title
Cited by
Cited by
Year
FastSpeech 2: Fast and High-Quality End-to-End Text to Speech
Y Ren, C Hu, X Tan, T Qin, S Zhao, Z Zhao, TY Liu
ICLR 2021, 2020
15092020
FastSpeech: Fast, Robust and Controllable Text to Speech
Y Ren, Y Ruan, X Tan, T Qin, S Zhao, Z Zhao, TY Liu
NIPS 2019, 2019
12142019
MASS: Masked Sequence to Sequence Pre-training for Language Generation
K Song, X Tan, T Qin, J Lu, TY Liu
ICML 2019, 2019
11762019
Mpnet: Masked and permuted pre-training for language understanding
K Song, X Tan, T Qin, J Lu, TY Liu
Advances in neural information processing systems 33, 16857-16867, 2020
11232020
Hugginggpt: Solving ai tasks with chatgpt and its friends in hugging face
Y Shen, K Song, X Tan, D Li, W Lu, Y Zhuang
Advances in Neural Information Processing Systems 36, 2024
9422024
Achieving human parity on automatic chinese to english news translation
H Hassan, A Aue, C Chen, V Chowdhary, J Clark, C Federmann, X Huang, ...
arXiv preprint arXiv:1803.05567, 2018
7342018
A survey on neural speech synthesis
X Tan, T Qin, F Soong, TY Liu
arXiv preprint arXiv:2106.15561, 2021
4332021
Multilingual Neural Machine Translation with Knowledge Distillation
X Tan, Y Ren, D He, T Qin, Z Zhao, TY Liu
ICLR 2019, 2019
2682019
Representation Degeneration Problem in Training Natural Language Generation Models
J Gao, D He, X Tan, T Qin, L Wang, T Liu
ICLR 2019, 2018
2672018
ESPnet-TTS: Unified, reproducible, and integratable open source end-to-end text-to-speech toolkit
T Hayashi, R Yamamoto, K Inoue, T Yoshimura, S Watanabe, T Toda, ...
ICASSP 2020-2020 IEEE international conference on acoustics, speech and …, 2020
2412020
Naturalspeech: End-to-end text-to-speech synthesis with human-level quality
X Tan, J Chen, H Liu, J Cong, C Zhang, Y Liu, X Wang, Y Leng, Y Yi, L He, ...
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024
1892024
Adaspeech: Adaptive text to speech for custom voice
M Chen, X Tan, B Li, Y Liu, T Qin, S Zhao, TY Liu
ICLR 2021, 2021
1852021
FRAGE: frequency-agnostic word representation
C Gong, D He, X Tan, T Qin, L Wang, Liu, Tie-Yan
NIPS 2018, 2018
1842018
Naturalspeech 2: Latent diffusion models are natural and zero-shot speech and singing synthesizers
K Shen, Z Ju, X Tan, Y Liu, Y Leng, L He, T Qin, S Zhao, J Bian
arXiv preprint arXiv:2304.09116, 2023
1792023
Connecting large language models with evolutionary algorithms yields powerful prompt optimizers
Q Guo, R Wang, J Guo, B Li, K Song, X Tan, G Liu, J Bian, Y Yang
arXiv preprint arXiv:2309.08532, 2023
1382023
Musicbert: Symbolic music understanding with large-scale pre-training
M Zeng, X Tan, R Wang, Z Ju, T Qin, TY Liu
ACL 2021, 2021
1352021
Non-Autoregressive Neural Machine Translation with Enhanced Decoder Input
J Guo, X Tan, D He, T Qin, L Xu, TY Liu
AAAI 2019, 2018
1342018
Popmag: Pop music accompaniment generation
Y Ren, J He, X Tan, T Qin, Z Zhao, TY Liu
Proceedings of the 28th ACM international conference on multimedia, 1198-1206, 2020
1322020
Layer-Wise Coordination between Encoder and Decoder for Neural Machine Translation
T He, X Tan, Y Xia, D He, T Qin, Z Chen, Liu, Tie-Yan
NIPS 2018, 2018
1302018
Almost Unsupervised Text to Speech and Automatic Speech Recognition
Y Ren, X Tan, T Qin, S Zhao, Z Zhao, TY Liu
ICML 2019, 2019
1282019
The system can't perform the operation now. Try again later.
Articles 1–20