Ziyang Ma

Cited by

	All	Since 2019
Citations	81	81
h-index	4	4
i10-index	3	3

2022202320248 27 46

Public access

View all

1 article

0 articles

available

not available

Based on funding mandates

Co-authors

Xie ChenShanghai Jiao Tong UniversityVerified email at sjtu.edu.cn
Zhisheng ZhengThe University of Texas at AustinVerified email at utexas.edu
ShiLiang ZhangSpeechLab，AlibabaVerified email at mail.ustc.edu.cn
Qian Chen (陈谦)Alibaba GroupVerified email at alibaba-inc.com
Changli TangTsinghua UniversityVerified email at mails.tsinghua.edu.cn
Kai Yu（俞凯）Shanghai Jiao Tong UniversityVerified email at sjtu.edu.cn
gao zhifuSpeech Lab, Alibaba GroupVerified email at alibaba-inc.com
Siqi ZhengDAMO Academy, Alibaba GroupVerified email at mail.harvard.edu
Yiwei GuoShanghai Jiao Tong UniversityVerified email at sjtu.edu.cn
Yifan YangMachine Learning Engineer, Xiaomi Corp.Verified email at xiaomi.com
Xuemeng SongShandong UniversityVerified email at sdu.edu.cn
Liqiang Nie (聂礼强), IAPR FellowHarbin Institute of Technology (Shenzhen)Verified email at hit.edu.cn
Wen WuUniversity of CambridgeVerified email at cam.ac.uk
Xuenan XuShanghai Jiao Tong UniversityVerified email at sjtu.edu.cn
Qi ChenShanghai Jiao Tong UniversityVerified email at sjtu.edu.cn
Jiaxin YePh.D. Student, Fudan UniversityVerified email at m.fudan.edu.cn

Ziyang Ma

Shanghai Jiao Tong University

Verified email at sjtu.edu.cn - Homepage

Speech and Language Processing Textless NLP Self-supervised Learning Multimedia


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
MT4SSL: Boosting self-supervised speech representation learning by integrating multiple targets Z Ma, Z Zheng, C Tang, Y Wang, X Chen Proc. Interspeech 2023, 2022	18	2022
Lauragpt: Listen, attend, understand, and regenerate audio with gpt Q Chen, Y Chu, Z Gao, Z Li, K Hu, X Zhou, J Xu, Z Ma, W Wang, S Zheng, ... arXiv preprint arXiv:2310.04673, 2023	11	2023
Hierarchical deep residual reasoning for temporal moment localization Z Ma, X Han, X Song, Y Cui, L Nie Proceedings of the 3rd ACM International Conference on Multimedia in Asia, 1-7, 2021	10	2021
Leveraging speech ptm, text llm, and emotional tts for speech emotion recognition Z Ma, W Wu, Z Zheng, Y Guo, Q Chen, S Zhang, X Chen ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	4	2024
ELLA-V: Stable Neural Codec Language Modeling with Alignment-guided Sequence Reordering Y Song, Z Chen, X Wang, Z Ma, X Chen arXiv preprint arXiv:2401.07333, 2024	4	2024
Pushing the Limits of Unsupervised Unit Discovery for SSL Speech Representation Z Ma, Z Zheng, G Yang, Y Wang, C Zhang, X Chen Proc. Interspeech 2023, 2023	4	2023
Tessp: text-enhanced self-supervised speech pre-training Z Yao, S Ren, S Chen, Z Ma, P Guo, L Xie arXiv preprint arXiv:2211.13443, 2022	4	2022
VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching Y Guo, C Du, Z Ma, X Chen, K Yu ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	3*	2024
EAT: Self-Supervised Pre-Training with Efficient Audio Transformer W Chen, Y Liang, Z Ma, Z Zheng, X Chen arXiv preprint arXiv:2401.03497, 2024	3	2024
Fast-Hubert: an Efficient Training Framework for Self-Supervised Speech Representation Learning G Yang, Z Ma, Z Zheng, Y Song, Z Niu, X Chen 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-7, 2023	3	2023
Front-end adapter: Adapting front-end input of speech based self-supervised learning for speech recognition X Chen, Z Ma, C Tang, Y Wang, Z Zheng ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	3	2023
Improving few-shot learning for talking face system with tts data augmentation Q Chen, Z Ma, T Liu, X Tan, Q Lu, K Yu, X Chen ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	3	2023
Towards universal speech discrete tokens: A case study for asr and tts Y Yang, F Shen, C Du, Z Ma, K Yu, D Povey, X Chen ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	2	2024
Unsupervised Active Learning: Optimizing Labeling Cost-Effectiveness for Automatic Speech Recognition Z Zheng, Z Ma, Y Wang, X Chen Proc. Interspeech 2023, 2023	2	2023
Improving Code-Switching and Named Entity Recognition in ASR with Speech Editing based Data Augmentation Z Liang, Z Song, Z Ma, C Du, K Yu, X Chen Proc. Interspeech 2023, 2023	2	2023
Hourglass-AVSR: Down-Up Sampling-Based Computational Efficiency Model for Audio-Visual Speech Recognition F Yu, H Wang, Z Ma, S Zhang ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	1	2024
ChatMusician: Understanding and Generating Music Intrinsically with LLM R Yuan, H Lin, Y Wang, Z Tian, S Wu, T Shen, G Zhang, Y Wu, C Liu, ... arXiv preprint arXiv:2402.16153, 2024	1	2024
emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation Z Ma, Z Zheng, J Ye, J Li, Z Gao, S Zhang, X Chen arXiv preprint arXiv:2312.15185, 2023	1	2023
Exploring effective distillation of self-supervised speech models for automatic speech recognition Y Wang, C Tang, Z Ma, Z Zheng, X Chen, WQ Zhang 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-6, 2023	1	2023
SOUND EVENT DETECTION BY AGGREGATING PRE-TRAINED EMBEDDINGS FROM DIFFERENT LAYERS X Xu, Z Ma, F Yang, G Yang, M Wu, X Chen Tech. Rep., Technical report, DCASE2023 Challenge, 2023	1	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors