An audio-visual corpus for speech perception and automatic speech recognition M Cooke, J Barker, S Cunningham, X Shao The Journal of the Acoustical Society of America 120 (5), 2421-2424, 2006 | 1334 | 2006 |
Speech reconstruction from mel-frequency cepstral coefficients using a source-filter model B Milner, X Shao Seventh International Conference on Spoken Language Processing, 2002 | 83 | 2002 |
Prediction of fundamental frequency and voicing from mel-frequency cepstral coefficients for unconstrained speech reconstruction B Milner, X Shao IEEE transactions on audio, speech, and language processing 15 (1), 24-33, 2006 | 69 | 2006 |
Clean speech reconstruction from MFCC vectors and fundamental frequency using an integrated front-end B Milner, X Shao Speech Communication 48 (6), 697-715, 2006 | 55 | 2006 |
Stream weight estimation for multistream audio–visual speech recognition in a multispeaker environment X Shao, J Barker Speech Communication 50 (4), 337-353, 2008 | 51 | 2008 |
Pitch prediction from MFCC vectors for speech reconstruction X Shao, B Milner 2004 IEEE International Conference on Acoustics, Speech, and Signal …, 2004 | 45 | 2004 |
Energetic and informational masking effects in an audiovisual speech recognition system J Barker, X Shao IEEE transactions on audio, speech, and language processing 17 (3), 446-458, 2009 | 23 | 2009 |
Predicting formant frequencies from MFCC vectors [speech recognition applications] J Darch, B Milner, X Shao, S Vaseghi, Q Yang Proceedings.(ICASSP'05). IEEE International Conference on Acoustics, Speech …, 2005 | 21 | 2005 |
Clean speech reconstruction from noisy mel-frequency cepstral coefficients using a sinusoidal model X Shao, B Milner 2003 IEEE International Conference on Acoustics, Speech, and Signal …, 2003 | 16 | 2003 |
Predicting fundamental frequency from mel-frequency cepstral coefficients to enable speech reconstruction X Shao, B Milner The Journal of the Acoustical Society of America 118 (2), 1134-1143, 2005 | 12 | 2005 |
Pruning redundant synthesis units based on static and delta unit appearance frequency. H Lu, W Zhang, X Shao, Q Zhou, W Lei, H Zhou, AP Breen INTERSPEECH, 269-273, 2015 | 9 | 2015 |
Robust algorithms for speech reconstruction on mobile devices X Shao University of East Anglia, 2005 | 9 | 2005 |
Methods, apparatus and data structure for cross-language speech adaptation X Shao, A Breen US Patent 9,798,653, 2017 | 8 | 2017 |
Low bit-rate feature vector compression using transform coding and non-uniform bit allocation B Milner, X Shao 2003 IEEE International Conference on Acoustics, Speech, and Signal …, 2003 | 8 | 2003 |
Integrated pitch and MFCC extraction for speech reconstruction and speech recognition applications. X Shao, BP Milner, SJ Cox INTERSPEECH, 1725-1728, 2003 | 7 | 2003 |
Audio-visual speech fragment decoding. J Barker, X Shao AVSP, 2007 | 6 | 2007 |
Audio-visual speech recognition in the presence of a competing speaker X Shao, J Barker Ninth International Conference on Spoken Language Processing, 2006 | 6 | 2006 |
MAP prediction of pitch from MFCC vectors for speech reconstruction. X Shao, BP Milner INTERSPEECH, 2425-2428, 2004 | 6 | 2004 |
Model-based parametric prosody synthesis with deep neural network H Liu, H Lu, X Shao, Y Xu Proceedings of the Annual Conference of the International Speech …, 2016 | 5 | 2016 |
Fundamental frequency and voicing prediction from MFCCs for speech reconstruction from unconstrained speech. B Milner, X Shao, J Darch INTERSPEECH, 321-324, 2005 | 4 | 2005 |