State-of-the-art speaker recognition with neural network embeddings in NIST SRE18 and speakers in the wild evaluations J Villalba, N Chen, D Snyder, D Garcia-Romero, A McCree, G Sell, ... Computer Speech & Language 60, 101026, 2020 | 150 | 2020 |
State-of-the-Art Speaker Recognition for Telephone and Video Speech: The JHU-MIT Submission for NIST SRE18. J Villalba, N Chen, D Snyder, D Garcia-Romero, A McCree, G Sell, ... Interspeech, 1488-1492, 2019 | 126 | 2019 |
Overlap-aware diarization: Resegmentation using neural end-to-end overlapped speech detection L Bullock, H Bredin, LP Garcia-Perera ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 122 | 2020 |
Dover-lap: A method for combining overlap-aware diarization outputs D Raj, LP Garcia-Perera, Z Huang, S Watanabe, D Povey, A Stolcke, ... 2021 IEEE Spoken Language Technology Workshop (SLT), 881-888, 2021 | 83 | 2021 |
Speaker diarization with region proposal network Z Huang, S Watanabe, Y Fujita, P García, Y Shao, D Povey, S Khudanpur ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 79 | 2020 |
On the results of the first mobile biometry (MOBIO) face and speaker verification evaluation S Marcel, C McCool, P Matějka, T Ahonen, J Černocký, S Chakraborty, ... Recognizing Patterns in Signals, Speech, Images and Videos: ICPR 2010 …, 2010 | 72 | 2010 |
Investigating self-supervised learning for speech enhancement and separation Z Huang, S Watanabe, S Yang, P García, S Khudanpur ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 71 | 2022 |
Encoder-decoder based attractors for end-to-end neural diarization S Horiguchi, Y Fujita, S Watanabe, Y Xue, P Garcia IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 1493-1507, 2022 | 65 | 2022 |
Online end-to-end neural diarization with speaker-tracing buffer Y Xue, S Horiguchi, Y Fujita, S Watanabe, P García, K Nagamatsu 2021 IEEE Spoken Language Technology Workshop (SLT), 841-848, 2021 | 56 | 2021 |
End-to-end Domain-Adversarial Voice Activity Detection M Lavechin, MP Gill, R Bousbib, H Bredin, LP Garcia-Perera arXiv preprint arXiv:1910.10655, 2019 | 56 | 2019 |
The Hitachi/JHU CHiME-5 system: Advances in speech recognition for everyday home environments using multiple microphone arrays N Kanda, R Ikeshita, S Horiguchi, Y Fujita, K Nagamatsu, X Wang, ... Proc. CHiME-5, 6-10, 2018 | 54 | 2018 |
The chime-7 dasr challenge: Distant meeting transcription with multiple devices in diverse scenarios S Cornell, M Wiesner, S Watanabe, D Raj, X Chang, P Garcia, ... arXiv preprint arXiv:2306.13734, 2023 | 53 | 2023 |
Advances in Automatic Speech Recognition for Child Speech Using Factored Time Delay Neural Network. F Wu, LP García-Perera, D Povey, S Khudanpur Interspeech, 1-5, 2019 | 50 | 2019 |
End-to-end speaker diarization as post-processing S Horiguchi, P Garcia, Y Fujita, S Watanabe, K Nagamatsu ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 49 | 2021 |
Towards neural diarization for unlimited numbers of speakers using global and local attractors S Horiguchi, S Watanabe, P García, Y Xue, Y Takashima, Y Kawaguchi 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 98-105, 2021 | 43 | 2021 |
The Hitachi-JHU DIHARD III system: Competitive end-to-end neural diarization and x-vector clustering systems combined by DOVER-Lap S Horiguchi, N Yalta, P Garcia, Y Takashima, Y Xue, D Raj, Z Huang, ... arXiv preprint arXiv:2102.01363, 2021 | 42 | 2021 |
Feature enhancement with deep feature losses for speaker verification S Kataria, PS Nidadavolu, J Villalba, N Chen, P Garcia-Perera, N Dehak ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 39 | 2020 |
Speaker detection in the wild: Lessons learned from JSALT 2019 P García, J Villalba, H Bredin, J Du, D Castan, A Cristia, L Bullock, L Guo, ... arXiv preprint arXiv:1912.00938, 2019 | 38 | 2019 |
Online streaming end-to-end neural diarization handling overlapping speech and flexible numbers of speakers Y Xue, S Horiguchi, Y Fujita, Y Takashima, S Watanabe, P Garcia, ... arXiv preprint arXiv:2101.08473, 2021 | 30 | 2021 |
End-to-end speaker diarization conditioned on speech activity and overlap detection Y Takashima, Y Fujita, S Watanabe, S Horiguchi, P García, K Nagamatsu 2021 IEEE Spoken Language Technology Workshop (SLT), 849-856, 2021 | 28 | 2021 |