フォロー
Jiatong Shi (史嘉彤)
Jiatong Shi (史嘉彤)
確認したメール アドレス: andrew.cmu.edu - ホームページ
タイトル
引用先
引用先
SUPERB: Speech processing Universal PERformance Benchmark
S Yang, PH Chi, YS Chuang, CIJ Lai, K Lakhotia, YY Lin, AT Liu, J Shi, ...
Proceedings of the Interspeech, 1194--1198, 2021
7552021
Recent developments on ESPnet toolkit boosted by Conformer
P Guo, F Boyer, X Chang, T Hayashi, Y Higuchi, H Inaguma, N Kamo, C Li, ...
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
2772021
Audiogpt: Understanding and generating speech, music, sound, and talking head
R Huang, M Li, D Yang, J Shi, X Chang, Z Ye, Y Wu, Z Hong, J Huang, ...
Proceedings of the AAAI Conference on Artificial Intelligence 38 (21), 23802 …, 2024
1112024
Findings of the IWSLT 2022 Evaluation Campaign.
A Anastasopoulos, L Barrault, L Bentivogli, MZ Boito, O Bojar, R Cattoni, ...
Proceedings of the 19th International Conference on Spoken Language …, 2022
972022
SUPERB-SG: Enhanced speech processing universal performance benchmark for semantic and generative capabilities
HS Tsai, HJ Chang, WC Huang, Z Huang, K Lakhotia, S Yang, S Dong, ...
Proceedings of the 60th Annual Meeting of the Association for Computational …, 2022
832022
Large-Scale End-to-End Multilingual Speech Recognition and Language Identification with Multi-Task Learning
W Hou, Y Dong, B Zhuang, L Yang, J Shi, T Shinozaki
Proceedings of the Interspeech, 1037-1041, 2020
732020
Context-aware Goodness of Pronunciation for Computer-Assisted Pronunciation Training
J Shi, N Huo, Q Jin
Proceedings of the Interspeech, 3057-3061, 2020
592020
ESPnet2-TTS: Extending the edge of TTS research
T Hayashi, R Yamamoto, T Yoshimura, P Wu, J Shi, T Saeki, Y Ju, ...
arXiv preprint arXiv:2110.07840, 2021
512021
UniAudio: Towards Universal Audio Generation with Large Language Models
D Yang, J Tian, X Tan, R Huang, S Liu, H Guo, X Chang, J Shi, J Bian, ...
Forty-first International Conference on Machine Learning, 2024
48*2024
ML-SUPERB: Multilingual Speech Universal PERformance Benchmark
J Shi, D Berrebbi, W Chen, HL Chung, EP Hu, WP Huang, X Chang, ...
Proceedings of the Interspeech, 884--888, 2023
372023
Leveraging End-to-End ASR for Endangered Language Documentation: An Empirical Study on Yolox\'ochitl Mixtec
J Shi, JD Amith, RC García, EG Sierra, K Duh, S Watanabe
Proceedings of the 16th Conference of the European Chapter of the …, 2021
352021
Findings of the iwslt 2023 evaluation campaign
M Agarwal, S Agarwal, A Anastasopoulos, L Bentivogli, O Bojar, C Borg, ...
Association for Computational Linguistics, 2023
342023
The singing voice conversion challenge 2023
WC Huang, LP Violeta, S Liu, J Shi, T Toda
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023
312023
SUPERB@ SLT 2022: Challenge on generalization and efficiency of self-supervised speech representation learning
T Feng, A Dong, CF Yeh, S Yang, TQ Lin, J Shi, KW Chang, Z Huang, ...
2022 IEEE Spoken Language Technology Workshop (SLT), 1096-1103, 2023
312023
Improving massively multilingual ASR with auxiliary CTC objectives
W Chen, B Yan, J Shi, Y Peng, S Maiti, S Watanabe
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
292023
Sequence-to-sequence singing voice synthesis with perceptual entropy loss
J Shi, S Guo, N Huo, Y Zhang, Q Jin
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
262021
Reproducing whisper-style training using an open-source toolkit and publicly available data
Y Peng, J Tian, B Yan, D Berrebbi, X Chang, X Li, J Shi, S Arora, W Chen, ...
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023
222023
ESPnet-ST IWSLT 2021 Offline Speech Translation System
H Inaguma, B Yan, S Dalmia, P Gu, J Shi, K Duh, S Watanabe
Proceedings of the 18th International Conference on Spoken Language …, 2021
202021
Combining Spectral and Self-Supervised Features for Low Resource Speech Recognition and Translation
D Berrebbi, J Shi, B Yan, O Lopez-Francisco, JD Amith, S Watanabe
Proceedings of the Interspeech, 3533--3537, 2022
192022
Leveraging deep learning with audio analytics to predict the success of crowdfunding projects
J Shi, K Yang, W Xu, M Wang
The Journal of Supercomputing 77, 7833-7853, 2021
192021
現在システムで処理を実行できません。しばらくしてからもう一度お試しください。
論文 1–20