Sheng Zhao

Cited by

	All	Since 2019
Citations	6090	5999
h-index	31	31
i10-index	50	47

2100

1050

525

1575

20192020202120222023202490 329 855 1246 2008 1424

Public access

View all

11 articles

0 articles

available

not available

Based on funding mandates

Sheng Zhao

Microsoft

Verified email at microsoft.com

Speech


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Fastspeech 2: Fast and high-quality end-to-end text to speech Y Ren, C Hu, X Tan, T Qin, S Zhao, Z Zhao, TY Liu arXiv preprint arXiv:2006.04558, 2020	1287	2020
Fastspeech: Fast, robust and controllable text to speech Y Ren, Y Ruan, X Tan, T Qin, S Zhao, Z Zhao, TY Liu Advances in neural information processing systems 32, 2019	1096	2019
Neural speech synthesis with transformer network N Li, S Liu, Y Liu, S Zhao, M Liu Proceedings of the AAAI conference on artificial intelligence 33 (01), 6706-6713, 2019	797	2019
Neural codec language models are zero-shot text to speech synthesizers C Wang, S Chen, Y Wu, Z Zhang, L Zhou, S Liu, Z Chen, Y Liu, H Wang, ... arXiv preprint arXiv:2301.02111, 2023	379	2023
Adaspeech: Adaptive text to speech for custom voice M Chen, X Tan, B Li, Y Liu, T Qin, S Zhao, TY Liu arXiv preprint arXiv:2103.00993, 2021	161	2021
Hyper-structure recurrent neural networks for text-to-speech P Zhao, M Leung, K Yao, B Yan, S Zhao, FA Alleva US Patent 10,127,901, 2018	140	2018
Naturalspeech: End-to-end text-to-speech synthesis with human-level quality X Tan, J Chen, H Liu, J Cong, C Zhang, Y Liu, X Wang, Y Leng, Y Yi, L He, ... IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024	135	2024
Almost unsupervised text to speech and automatic speech recognition Y Ren, X Tan, T Qin, S Zhao, Z Zhao, TY Liu International conference on machine learning, 5410-5419, 2019	119	2019
Close to human quality TTS with transformer N Li, S Liu, Y Liu, S Zhao, M Liu, M Zhou arXiv preprint arXiv:1809.08895 2, 2018	117	2018
Developing RNN-T models surpassing high-performance hybrid models with customization capability J Li, R Zhao, Z Meng, Y Liu, W Wei, S Parthasarathy, V Mazalov, Z Wang, ... arXiv preprint arXiv:2007.15188, 2020	112	2020
Naturalspeech 2: Latent diffusion models are natural and zero-shot speech and singing synthesizers K Shen, Z Ju, X Tan, Y Liu, Y Leng, L He, T Qin, S Zhao, J Bian arXiv preprint arXiv:2304.09116, 2023	106	2023
Multispeech: Multi-speaker text to speech with transformer M Chen, X Tan, Y Ren, J Xu, H Sun, S Zhao, T Qin, TY Liu arXiv preprint arXiv:2006.04664, 2020	100	2020
Lrspeech: Extremely low-resource speech synthesis and recognition J Xu, X Tan, Y Ren, T Qin, J Li, S Zhao, TY Liu Proceedings of the 26th ACM SIGKDD International Conference on Knowledge …, 2020	92	2020
Speak foreign languages with your own voice: Cross-lingual neural codec language modeling Z Zhang, L Zhou, C Wang, S Chen, Y Wu, S Liu, Z Chen, Y Liu, H Wang, ... arXiv preprint arXiv:2303.03926, 2023	90	2023
MBNet: MOS prediction for synthesized speech with mean-bias network Y Leng, X Tan, S Zhao, F Soong, XY Li, T Qin ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	83	2021
Dilated residual network with multi-head self-attention for speech emotion recognition R Li, Z Wu, J Jia, S Zhao, H Meng ICASSP 2019-2019 IEEE international conference on acoustics, speech and …, 2019	81	2019
Token-level ensemble distillation for grapheme-to-phoneme conversion H Sun, X Tan, JW Gan, H Liu, S Zhao, T Qin, TY Liu arXiv preprint arXiv:1904.03446, 2019	71	2019
Lightspeech: Lightweight and fast text to speech with neural architecture search R Luo, X Tan, R Wang, T Qin, J Li, S Zhao, E Chen, TY Liu ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	68	2021
A study of non-autoregressive model for sequence generation Y Ren, J Liu, X Tan, Z Zhao, S Zhao, TY Liu arXiv preprint arXiv:2004.10454, 2020	67	2020
Prompttts: Controllable text-to-speech with text descriptions Z Guo, Y Leng, Y Wu, S Zhao, X Tan ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	60	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by