Recurrent neural network based language model T Mikolov, M Karafiįt, L Burget, J Černockż, S Khudanpur Eleventh annual conference of the international speech communication association, 2010 | 5270 | 2010 |
Librispeech: an asr corpus based on public domain audio books V Panayotov, G Chen, D Povey, S Khudanpur 2015 IEEE international conference on acoustics, speech and signal …, 2015 | 1640 | 2015 |
Extensions of recurrent neural network language model T Mikolov, S Kombrink, L Burget, J Černockż, S Khudanpur 2011 IEEE international conference on acoustics, speech and signal …, 2011 | 1142 | 2011 |
X-vectors: Robust dnn embeddings for speaker recognition D Snyder, D Garcia-Romero, G Sell, D Povey, S Khudanpur 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 997 | 2018 |
A time delay neural network architecture for efficient modeling of long temporal contexts V Peddinti, D Povey, S Khudanpur Sixteenth Annual Conference of the International Speech Communication …, 2015 | 706 | 2015 |
Purely sequence-trained neural networks for ASR based on lattice-free MMI. D Povey, V Peddinti, D Galvez, P Ghahremani, V Manohar, X Na, Y Wang, ... Interspeech, 2751-2755, 2016 | 577 | 2016 |
Audio augmentation for speech recognition T Ko, V Peddinti, D Povey, S Khudanpur Sixteenth Annual Conference of the International Speech Communication …, 2015 | 549 | 2015 |
Deep Neural Network Embeddings for Text-Independent Speaker Verification. D Snyder, D Garcia-Romero, D Povey, S Khudanpur Interspeech, 999-1003, 2017 | 501 | 2017 |
Improving deep neural network acoustic models using generalized maxout networks X Zhang, J Trmal, D Povey, S Khudanpur 2014 IEEE international conference on acoustics, speech and signal …, 2014 | 332 | 2014 |
A smorgasbord of features for statistical machine translation FJ Och, D Gildea, S Khudanpur, A Sarkar, K Yamada, A Fraser, S Kumar, ... Proceedings of the Human Language Technology Conference of the North …, 2004 | 328 | 2004 |
Developments and directions in speech recognition and understanding, Part 1 [DSP Education] JM Baker, L Deng, J Glass, S Khudanpur, CH Lee, N Morgan, ... IEEE Signal processing magazine 26 (3), 75-80, 2009 | 312 | 2009 |
A study on data augmentation of reverberant speech for robust speech recognition T Ko, V Peddinti, D Povey, ML Seltzer, S Khudanpur 2017 IEEE International Conference on Acoustics, Speech and Signal …, 2017 | 300 | 2017 |
Modeling pronunciation variation for ASR: A survey of the literature H Strik, C Cucchiarini Speech Communication 29 (2-4), 225-246, 1999 | 300 | 1999 |
A pitch extraction algorithm tuned for automatic speech recognition P Ghahremani, B BabaAli, D Povey, K Riedhammer, J Trmal, ... 2014 IEEE international conference on acoustics, speech and signal …, 2014 | 283 | 2014 |
Deep neural network-based speaker embeddings for end-to-end speaker verification D Snyder, P Ghahremani, D Povey, D Garcia-Romero, Y Carmiel, ... 2016 IEEE Spoken Language Technology Workshop (SLT), 165-170, 2016 | 282 | 2016 |
Highway long short-term memory rnns for distant speech recognition Y Zhang, G Chen, D Yu, K Yaco, S Khudanpur, J Glass 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016 | 267 | 2016 |
Joshua: An open source toolkit for parsing-based machine translation Z Li, C Callison-Burch, C Dyer, S Khudanpur, L Schwartz, W Thornton, ... Proceedings of the Fourth Workshop on Statistical Machine Translation, 135-139, 2009 | 218 | 2009 |
Transliteration of proper names in cross-lingual information retrieval P Virga, S Khudanpur Proceedings of the ACL 2003 workshop on Multilingual and mixed-language …, 2003 | 215 | 2003 |
Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks. D Povey, G Cheng, Y Wang, K Li, H Xu, M Yarmohammadi, S Khudanpur Interspeech, 3743-3747, 2018 | 205 | 2018 |
Stochastic pronunciation modelling from hand-labelled phonetic corpora M Riley, W Byrne, M Finke, S Khudanpur, A Ljolje, J McDonough, H Nock, ... Speech Communication 29 (2-4), 209-224, 1999 | 202 | 1999 |