フォロー
Xiaoyu Yang
Xiaoyu Yang
Machine Learning Engineer, Xiaomi Corp.
確認したメール アドレス: xiaomi.com
タイトル
引用先
引用先
Knowledge distillation for neural transducers from large self-supervised pre-trained models
X Yang, Q Li, PC Woodland
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
172022
Zipformer: A faster and better encoder for automatic speech recognition
Z Yao, L Guo, X Yang, W Kang, F Kuang, Y Yang, Z Jin, L Lin, D Povey
arXiv preprint arXiv:2310.11230, 2023
152023
Fast and parallel decoding for transducer
W Kang, L Guo, F Kuang, L Lin, M Luo, Z Yao, X Yang, P Żelasko, ...
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
92023
Knowledge distillation from multiple foundation models for end-to-end speech recognition
X Yang, Q Li, C Zhang, PC Woodland
arXiv preprint arXiv:2303.10917, 2023
42023
Libriheavy: a 50,000 hours asr corpus with punctuation casing and context
W Kang, X Yang, Z Yao, F Kuang, Y Yang, L Guo, L Lin, D Povey
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
32024
Predicting multi-codebook vector quantization indexes for knowledge distillation
L Guo, X Yang, Q Wang, Y Kong, Z Yao, F Cui, F Kuang, W Kang, L Lin, ...
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
32023
Blank-regularized ctc for frame skipping in neural transducer
Y Yang, X Yang, L Guo, Z Yao, W Kang, F Kuang, L Lin, X Chen, D Povey
arXiv preprint arXiv:2305.11558, 2023
32023
Delay-penalized transducer for low-latency streaming asr
W Kang, Z Yao, F Kuang, L Guo, X Yang, L Lin, P Żelasko, D Povey
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
22023
PromptASR for contextualized ASR with controllable style
X Yang, W Kang, Z Yao, Y Yang, L Guo, F Kuang, L Lin, D Povey
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
12024
Delay-penalized CTC implemented based on Finite State Transducer
Z Yao, W Kang, F Kuang, L Guo, X Yang, Y Yang, L Lin, D Povey
arXiv preprint arXiv:2305.11539, 2023
2023
Knowledge Distillation for End-to-End Automatic Speech Recognition
X Yang
現在システムで処理を実行できません。しばらくしてからもう一度お試しください。
論文 1–11