Multilingual sequence-to-sequence speech recognition: architecture, transfer learning, and language modeling J Cho, MK Baskar, R Li, M Wiesner, SH Mallidi, N Yalta, M Karafiat, ... 2018 IEEE Spoken Language Technology Workshop (SLT), 521-527, 2018 | 156 | 2018 |
The Hitachi/JHU CHiME-5 system: Advances in speech recognition for everyday home environments using multiple microphone arrays N Kanda, R Ikeshita, S Horiguchi, Y Fujita, K Nagamatsu, X Wang, ... Proc. CHiME-5, 6-10, 2018 | 54 | 2018 |
Multi-stream end-to-end speech recognition R Li, X Wang, SH Mallidi, S Watanabe, T Hori, H Hermansky IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 646-655, 2019 | 30 | 2019 |
Stream attention-based multi-array end-to-end speech recognition X Wang, R Li, SH Mallidi, T Hori, S Watanabe, H Hermansky ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 27 | 2019 |
BAT System Description for NIST LRE 2015. O Plchot, P Matejka, O Glembek, R Fer, O Novotný, J Pesan, L Burget, ... Odyssey, 166-173, 2016 | 26 | 2016 |
The MIT-LL, JHU and LRDE NIST 2016 Speaker Recognition Evaluation System. PA Torres-Carrasquillo, F Richardson, SC Nercessian, DE Sturim, ... Interspeech, 1333-1337, 2017 | 19 | 2017 |
M-vectors: sub-band based energy modulation features for multi-stream automatic speech recognition S Sadhu, R Li, H Hermansky ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 18 | 2019 |
Stream Attention for Distributed Multi-Microphone Speech Recognition. X Wang, R Li, H Hermansky Interspeech, 3033-3037, 2018 | 15 | 2018 |
Multi-encoder multi-resolution framework for end-to-end speech recognition R Li, X Wang, SH Mallidi, T Hori, S Watanabe, H Hermansky arXiv preprint arXiv:1811.04897, 2018 | 13 | 2018 |
Exploiting Hidden-Layer Responses of Deep Neural Networks for Language Recognition. R Li, SHR Mallidi, L Burget, O Plchot, N Dehak INTERSPEECH, 3265-3269, 2016 | 13 | 2016 |
A practical two-stage training strategy for multi-stream end-to-end speech recognition R Li, G Sell, X Wang, S Watanabe, H Hermansky ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 10 | 2020 |
Deriving spectro-temporal properties of hearing from speech data L Ondel, R Li, G Sell, H Hermansky ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 5 | 2019 |
Performance monitoring for end-to-end speech recognition R Li, G Sell, H Hermansky arXiv preprint arXiv:1904.04896, 2019 | 2 | 2019 |
Exploring methods for the automatic detection of errors in manual transcription X Wang, J Yang, R Li, S Sadhu, H Hermansky arXiv preprint arXiv:1904.04294, 2019 | 2 | 2019 |
Two-stage augmentation and adaptive CTC fusion for improved robustness of multi-stream end-to-end ASR R Li, G Sell, H Hermansky 2021 IEEE Spoken Language Technology Workshop (SLT), 229-235, 2021 | 1 | 2021 |
An Efficient and Robust Multi-stream Framework for End-to-end Speech Recognition R Li The Johns Hopkins University, 2020 | | 2020 |