Transformer transducer: A streamable speech recognition model with transformer encoders and rnn-t loss Q Zhang, H Lu, H Sak, A Tripathi, E McDermott, S Koo, S Kumar ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 555 | 2020 |
Language/dialect recognition based on unsupervised deep learning Q Zhang, JHL Hansen IEEE/ACM Transactions on Audio, Speech, and Language Processing 26 (5), 873-882, 2018 | 51 | 2018 |
Transformer Transducer: One Model Unifying Streaming and Non-streaming Speech Recognition A Tripathi, J Kim, Q Zhang, H Lu, H Sak arXiv, 2020 | 47 | 2020 |
Joint information from nonlinear and linear features for spoofing detection: An i-vector/DNN based approach C Zhang, S Ranjan, MK Nandwana, Q Zhang, A Misra, G Liu, F Kelly, ... 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016 | 34 | 2016 |
Multilingual Speech Recognition with Self-Attention Structured Parameterization Y Zhu, P Haghani, A Tripathi, B Ramabhadran, B Farris, H Xu, H Lu, ... Interspeech, 2020 | 32 | 2020 |
Supervector pre-processing for PRSVM-based Chinese and Arabic dialect identification Q Zhang, H Bořil, JHL Hansen 2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013 | 24 | 2013 |
Reducing streaming ASR model delay with self alignment J Kim, H Lu, A Tripathi, Q Zhang, H Sak arXiv preprint arXiv:2105.05005, 2021 | 22 | 2021 |
DIALECT RECOGNITION BASED ON UNSUPERVISED BOTTLENECK FEATURES Q Zhang, JHL Hansen Interspeech, 2017 | 20 | 2017 |
UTD-CRSS system for the NIST 2015 language recognition i-vector machine learning challenge C Yu, C Zhang, S Ranjan, Q Zhang, A Misra, F Kelly, JHL Hansen 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016 | 17 | 2016 |
Robust language recognition based on diverse features Q Zhang, G Liu, JHL Hansen ODYSSEY: The speaker and language and language recognition workshop, 152-157, 2014 | 14 | 2014 |
Semi-supervised learning with generative adversarial networks for arabic dialect identification C Zhang, Q Zhang, JHL Hansen ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 13 | 2019 |
Contrastive siamese network for semi-supervised speech recognition S Khorram, J Kim, A Tripathi, H Lu, Q Zhang, H Sak ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 12 | 2022 |
Training Candidate Selection for Effective Rejection in Open-Set Language Identification Q Zhang, JHL Hansen | 12 | 2014 |
UTD-CRSS submission for MGB-3 Arabic dialect identification: Front-end and back-end advancements on broadcast speech AE Bulut, Q Zhang, C Zhang, F Bahmaninezhad, JHL Hansen 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017 | 11 | 2017 |
Automatic assessment of language background in toddlers through phonotactic and pitch pattern modeling of short vocalizations. H Boril, Q Zhang, A Ziaei, JHL Hansen, D Xu, J Gilkerson, JA Richards, ... WOCCI, 39-43, 2014 | 8 | 2014 |
The delay-constrained information coverage problem in mobile sensor networks: single hop case GY Keung, Q Zhang, B Li Wireless Networks 16, 1961-1973, 2010 | 8 | 2010 |
Transformer transducer: one model unifying streaming and non-streaming speech recognition A Tripathi, H Sak, H Lu, Q Zhang, JY Kim US Patent 11,741,947, 2023 | 6 | 2023 |
A preliminary study of child vocalization on a parallel corpus of US and shanghainese toddlers. H Boril, Q Zhang, P Angkititrakul, JHL Hansen, D Xu, J Gilkerson, ... INTERSPEECH, 2405-2409, 2013 | 6 | 2013 |
Between-Class Covariance Correction For Linear Discriminant Analysis in Language Recognition A Misra, Q Zhang, F Kelly, J Hansen Odyssey, 2016 | 5 | 2016 |
Unsupervised k-means clustering based out-of-set candidate selection for robust open-set language recognition Q Zhang, JHL Hansen Spoken Language Technology Workshop (SLT), 2016 | 4 | 2016 |