XLS-R: Self-supervised cross-lingual speech representation learning at scale A Babu, C Wang, A Tjandra, K Lakhotia, Q Xu, N Goyal, K Singh, ... arXiv preprint arXiv:2111.09296, 2021 | 592 | 2021 |
Contextual RNN-T for open domain ASR M Jain, G Keren, J Mahadeokar, G Zweig, F Metze, Y Saraf arXiv preprint arXiv:2006.03411, 2020 | 96 | 2020 |
Contextualized streaming end-to-end speech recognition with trie-based deep biasing and shallow fusion D Le, M Jain, G Keren, S Kim, Y Shi, J Mahadeokar, J Chan, ... arXiv preprint arXiv:2104.02194, 2021 | 81 | 2021 |
Providing entity-specific content in response to a search query AJ Berntson, N Agrawal, S Zhou, Y Saraf, T Joshi, KR Mcdonald, ... US Patent App. 12/876,638, 2012 | 79 | 2012 |
Improving RNN transducer based ASR with auxiliary tasks C Liu, F Zhang, D Le, S Kim, Y Saraf, G Zweig 2021 IEEE Spoken Language Technology Workshop (SLT), 172-179, 2021 | 50 | 2021 |
A multi-view approach to audio-visual speaker verification L Sarı, K Singh, J Zhou, L Torresani, N Singhal, Y Saraf ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 49 | 2021 |
Improved language identification through cross-lingual self-supervised learning A Tjandra, DG Choudhury, F Zhang, K Singh, A Conneau, A Baevski, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 46 | 2022 |
Dual application of speech enhancement for automatic speech recognition A Pandey, C Liu, Y Wang, Y Saraf 2021 IEEE Spoken Language Technology Workshop (SLT), 223-228, 2021 | 37 | 2021 |
Towards measuring fairness in speech recognition: Casual conversations dataset transcriptions C Liu, M Picheny, L Sarı, P Chitkara, A Xiao, X Zhang, M Chou, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 35 | 2022 |
Faster, simpler and more accurate hybrid asr systems using wordpieces F Zhang, Y Wang, X Zhang, C Liu, Y Saraf, G Zweig arXiv preprint arXiv:2005.09150, 2020 | 30 | 2020 |
Multilingual graphemic hybrid ASR with massive data augmentation C Liu, Q Zhang, X Zhang, K Singh, Y Saraf, G Zweig arXiv preprint arXiv:1909.06522, 2019 | 30 | 2019 |
Contextualizing ASR lattice rescoring with hybrid pointer network language model DR Liu, C Liu, F Zhang, G Synnaeve, Y Saraf, G Zweig arXiv preprint arXiv:2005.07394, 2020 | 25 | 2020 |
Conformer-based self-supervised learning for non-speech audio tasks S Srivastava, Y Wang, A Tjandra, A Kumar, C Liu, K Singh, Y Saraf ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 23 | 2022 |
Benchmarking lf-mmi, ctc and rnn-t criteria for streaming asr X Zhang, F Zhang, C Liu, K Schubert, J Chan, P Prakash, J Liu, CF Yeh, ... 2021 IEEE spoken language technology workshop (SLT), 46-51, 2021 | 23 | 2021 |
Kaizen: Continuously improving teacher using exponential moving average for semi-supervised speech recognition V Manohar, T Likhomanenko, Q Xu, WN Hsu, R Collobert, Y Saraf, ... 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021 | 22 | 2021 |
Scaling asr improves zero and few shot learning A Xiao, W Zheng, G Keren, D Le, F Zhang, C Fuegen, O Kalinli, Y Saraf, ... arXiv preprint arXiv:2111.05948, 2021 | 22 | 2021 |
Accent-robust automatic speech recognition using supervised and unsupervised wav2vec embeddings J Li, V Manohar, P Chitkara, A Tjandra, M Picheny, F Zhang, X Zhang, ... arXiv preprint arXiv:2110.03520, 2021 | 17 | 2021 |
Search result driven query intent identification F Radlinski, N Craswell, B Billerbeck, M Shokouhi, S Ahari, N Agrawal, ... US Patent App. 12/813,376, 2011 | 16 | 2011 |
Algorithms for image segmentation Y Saraf Birla Institute of Technology and Science, 2006 | 15 | 2006 |
On lattice-free boosted MMI training of HMM and CTC-based full-context ASR models X Zhang, V Manohar, D Zhang, F Zhang, Y Shi, N Singhal, J Chan, ... 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021 | 13 | 2021 |