John Hershey

引用先

	すべて	2019 年以来
引用	17861	13198
h 指標	58	50
i10 指標	144	107

2600

1300

650

1950

200620072008200920102011201220132014201520162017201820192020202120222023202456 100 86 144 215 186 271 269 289 365 413 806 1245 1774 2253 2494 2433 2509 1730

オープンアクセス

すべて表示

7 件の論文

0 件の論文

利用可能

利用不可

助成機関の要件に基づく

共著者

Jonathan Le RouxMERL確認したメールアドレス: merl.com
Shinji WatanabeCarnegie Mellon University確認したメールアドレス: cmu.edu
Hakan ErdoganGoogle確認したメールアドレス: google.com
Scott WisdomGoogle Research確認したメールアドレス: google.com
Takaaki HoriApple確認したメールアドレス: apple.com
Peder A OlsenMicrosoft Research (formerly IBM Research)確認したメールアドレス: microsoft.com
Zhuo ChenBytedance (formerly Microsoft, Columbia University)確認したメールアドレス: columbia.edu
Steven J. RenniePryon Inc. (Formerly Fusemachines Inc, IBM Research, University of Toronto)確認したメールアドレス: pryoninc.com
Felix WeningerMicrosoft確認したメールアドレス: microsoft.com
Kevin WilsonGoogle確認したメールアドレス: google.com
Trausti T KristjanssonAmazon Lab126, Adjoint Professor at University of Reykjavik (formerly Google, IBM, MSR)確認したメールアドレス: amazon.com
Javier MovellanResearch Professor, University of California San Diego確認したメールアドレス: mplab.ucsd.edu
Chiori HoriMERL確認したメールアドレス: merl.com
Tim K. MarksPrincipal Research Scientist, Mitsubishi Electric Research Labs (MERL)確認したメールアドレス: merl.com
Efthymios TzinisResearch Scientist at Google | Ex. UIUC, MERL, Meta確認したメールアドレス: google.com
Zhong-Qiu WangAssociate Professor, Southern University of Science and Technology確認したメールアドレス: sustech.edu.cn
Ron J WeissGoogle確認したメールアドレス: google.com
Yuuki TachiokaDenso IT Laboratory確認したメールアドレス: d-itlab.co.jp
Björn SchullerProfessor, Technische Universität München (TUM) / Imperial College London & CSO, audEERING確認したメールアドレス: tum.de
Joshua M SusskindApple AI Research確認したメールアドレス: apple.com

フォロー

John Hershey

Google (formerly MERL, IBM, MSR, UCSD)

確認したメールアドレス: google.com

machine learning sound separation speech recognition audio-visual perception


タイトル引用回数順公開年順タイトル順	引用先引用先	年
Deep clustering: Discriminative embeddings for segmentation and separation JR Hershey, Z Chen, J Le Roux, S Watanabe 2016 IEEE international conference on acoustics, speech and signal …, 2016	1555	2016
Approximating the Kullback Leibler divergence between Gaussian mixture models JR Hershey, PA Olsen 2007 IEEE International Conference on Acoustics, Speech and Signal …, 2007	1432	2007
SDR–half-baked or well done? J Le Roux, S Wisdom, H Erdogan, JR Hershey ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	1223	2019
Hybrid CTC/attention architecture for end-to-end speech recognition S Watanabe, T Hori, S Kim, JR Hershey, T Hayashi IEEE Journal of Selected Topics in Signal Processing 11 (8), 1240-1253, 2017	908	2017
Phase-sensitive and recognition-boosted speech separation using deep recurrent neural networks H Erdogan, JR Hershey, S Watanabe, J Le Roux 2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015	753	2015
Speech enhancement with LSTM recurrent neural networks and its application to noise-robust ASR F Weninger, H Erdogan, S Watanabe, E Vincent, J Le Roux, JR Hershey, ... Latent Variable Analysis and Signal Separation: 12th International …, 2015	690	2015
Deep unfolding: Model-based inspiration of novel deep architectures JR Hershey, JL Roux, F Weninger arXiv preprint arXiv:1409.2574, 2014	507	2014
Single-channel multi-speaker separation using deep clustering Y Isik, JL Roux, Z Chen, S Watanabe, JR Hershey arXiv preprint arXiv:1607.02173, 2016	491	2016
Attention-based multimodal fusion for video description C Hori, T Hori, TY Lee, Z Zhang, B Harsham, JR Hershey, TK Marks, ... Proceedings of the IEEE international conference on computer vision, 4193-4202, 2017	424	2017
Voicefilter: Targeted voice separation by speaker-conditioned spectrogram masking Q Wang, H Muckenhirn, K Wilson, P Sridhar, Z Wu, J Hershey, ... arXiv preprint arXiv:1810.04826, 2018	422	2018
Audio vision: Using audio-visual synchrony to locate sounds J Hershey, J Movellan Advances in neural information processing systems 12, 1999	383	1999
Improved MVDR beamforming using single-channel mask prediction networks. H Erdogan, JR Hershey, S Watanabe, MI Mandel, J Le Roux Interspeech, 1981-1985, 2016	367	2016
Discriminatively trained recurrent neural networks for single-channel speech separation F Weninger, JR Hershey, J Le Roux, B Schuller 2014 IEEE global conference on signal and information processing (GlobalSIP …, 2014	360	2014
Full-capacity unitary recurrent neural networks S Wisdom, T Powers, J Hershey, J Le Roux, L Atlas Advances in Neural Information Processing Systems, 4880-4888, 2016	359	2016
Multi-channel deep clustering: Discriminative spectral and spatial embeddings for speaker-independent speech separation ZQ Wang, J Le Roux, JR Hershey 2018 IEEE International conference on acoustics, speech and signal …, 2018	267	2018
Monaural speech separation and recognition challenge M Cooke, JR Hershey, SJ Rennie Computer Speech & Language 24 (1), 1-15, 2010	247	2010
Deep beamforming networks for multi-channel speech recognition X Xiao, S Watanabe, H Erdogan, L Lu, J Hershey, ML Seltzer, G Chen, ... 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016	222	2016
Alternative objective functions for deep clustering ZQ Wang, J Le Roux, JR Hershey 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018	218	2018
Universal sound separation I Kavalerov, S Wisdom, H Erdogan, B Patton, K Wilson, J Le Roux, ... 2019 IEEE Workshop on Applications of Signal Processing to Audio and …, 2019	214	2019
Super-human multi-talker speech recognition: A graphical modeling approach JR Hershey, SJ Rennie, PA Olsen, TT Kristjansson Computer Speech & Language 24 (1), 45-66, 2010	212	2010

現在システムで処理を実行できません。しばらくしてからもう一度お試しください。

論文 1–20

年間引用数

重複した引用

結合された引用

共著者を追加共著者

フォロー

引用先

共著者