Shigeki Karita

Cited by

	All	Since 2019
Citations	3428	3371
h-index	19	19
i10-index	25	25

860

430

215

645

2017201820192020202120222023202410 37 188 442 776 743 843 375

Co-authors

Shinji WatanabeCarnegie Mellon UniversityVerified email at cmu.edu
Tomohiro NakataniNTT Communication Science LaboratoriesVerified email at ieee.org
Marc DelcroixNTT Communication Science LaboratoriesVerified email at ieee.org
Tomoki HayashiHuman Dataware Lab. Co., Ltd., Nagoya UniversityVerified email at g.sp.m.is.nagoya-u.ac.jp
Atsunori OgawaNTT Communication Science LaboratoriesVerified email at ieee.org
Takaaki HoriAppleVerified email at apple.com
Hirofumi InagumaFundamental AI Research (FAIR) at MetaVerified email at meta.com
Nanxin ChenMember of Technical StaffVerified email at openai.com
Michiel BacchianiGoogle Inc.Verified email at google.com
Jiro NishitobaRetrieva, Inc.Verified email at retrieva.jp
Wangyou ZhangPh.D. candidate, Department of Computer Science and Engineering, Shanghai Jiao Tong UniversityVerified email at sjtu.edu.cn
Jahn HeymannApplied Scientist @ AmazonVerified email at amazon.com
Yuma KoizumiGoogleVerified email at google.com
Ryuichi YamamotoLY CorporationVerified email at lycorp.co.jp
Xiaofei WangMicrosoftVerified email at jhu.edu
Ziyan JiangAmazon AGIVerified email at amazon.com
Keisuke KinoshitaResearch Scientist at GoogleVerified email at ieee.org
Tomoharu IwataNTTVerified email at hco.ntt.co.jp
Yotaro KuboGoogle SpeechVerified email at ieee.org
Nobutaka ItoUniversity of Tokyo, Japan (formerly NTT)Verified email at k.u-tokyo.ac.jp

Shigeki Karita

Google

Verified email at google.com - Homepage

Machine Learning Speech Recognition


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
ESPnet: End-to-end speech processing toolkit S Watanabe, T Hori, S Karita, T Hayashi, J Nishitoba, Y Unno, NEY Soplin, ... arXiv preprint arXiv:1804.00015, 2018	1548	2018
A comparative study on transformer vs rnn in speech applications S Karita, N Chen, T Hayashi, T Hori, H Inaguma, Z Jiang, M Someki, ... 2019 IEEE automatic speech recognition and understanding workshop (ASRU …, 2019	802	2019
Improving Transformer-based End-to-End Speech Recognition with Connectionist Temporal Classification and Language Model Integration S Karita, NEY Soplin, S Watanabe, M Delcroix, A Ogawa, T Nakatani Proc. Interspeech 2019, 1408-1412, 2019	254	2019
ESPnet-ST: All-in-one speech translation toolkit H Inaguma, S Kiyono, K Duh, S Karita, NEY Soplin, T Hayashi, ... arXiv preprint arXiv:2004.10234, 2020	162	2020
Semi-Supervised End-to-End Speech Recognition S Karita, S Watanabe, T Iwata, A Ogawa, M Delcroix INTERSPEECH, 2-6, 2018	79	2018
Frame-by-frame closed-form update for mask-based adaptive MVDR beamforming T Higuchi, K Kinoshita, N Ito, S Karita, T Nakatani IEEE International Conference on Acoustics, Speech and Signal Processing, 2018	63	2018
The 2020 espnet update: new features, broadened applications, performance improvements, and future plans S Watanabe, F Boyer, X Chang, P Guo, T Hayashi, Y Higuchi, T Hori, ... 2021 IEEE Data Science and Learning Workshop (DSLW), 1-6, 2021	53	2021
Semi-Supervised End-to-End Speech Recognition Using Text-to-Speech and Autoencoders S Karita, S Watanabe, T Iwata, M Delcroix, A Ogawa, T Nakatani IEEE International Conference on Acoustics, Speech, and Signal Processing, 2019	50	2019
DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using linear complexity self-attention for speech enhancement Y Koizumi, S Karita, S Wisdom, H Erdogan, JR Hershey, L Jones, ... 2021 IEEE Workshop on Applications of Signal Processing to Audio and …, 2021	46	2021
Auxiliary feature based adaptation of end-to-end ASR systems M Delcroix, S Watanabe, A Ogawa, S Karita, T Nakatani INTERSPEECH, 2018	46	2018
Far-field speech recognition using CNN-DNN-HMM with convolution in time T Yoshioka, S Karita, T Nakatani 2015 IEEE international conference on acoustics, speech and signal …, 2015	39	2015
Rescoring n-best speech recognition list based on one-on-one hypothesis comparison using encoder-classifier model A Ogawa, M Delcroix, S Karita, T Nakatani IEEE International Conference on Acoustics, Speech and Signal Processing, 2018	27	2018
Sequence training of encoder-decoder model using policy gradient for end-to-end speech recognition S Karita, A Ogawa, M Delcroix, T Nakatani IEEE International Conference on Acoustics, Speech and Signal Processing, 2018	27	2018
Self-Distillation for Improving CTC-Transformer-Based ASR Systems. T Moriya, T Ochiai, S Karita, H Sato, T Tanaka, T Ashihara, R Masumura, ... INTERSPEECH, 546-550, 2020	24	2020
Knowledge transfer from large-scale pretrained language models to end-to-end speech recognizers Y Kubo, S Karita, M Bacchiani ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	23	2022
End-to-End SpeakerBeam for Single Channel Target Speech Recognition. M Delcroix, S Watanabe, T Ochiai, K Kinoshita, S Karita, A Ogawa, ... Interspeech, 451-455, 2019	23	2019
Espnet: End-to-end speech processing toolkit. arXiv 2018 S Watanabe, T Hori, S Karita, T Hayashi, J Nishitoba, Y Unno, NEY Soplin, ... arXiv preprint arXiv:1804.00015, 2018	20	2018
Online meeting recognition in noisy environments with time-frequency mask based MVDR beamforming S Araki, N Ito, M Delcroix, A Ogawa, K Kinoshita, T Higuchi, T Yoshioka, ... 2017 Hands-free Speech Communications and Microphone Arrays (HSCMA), 16-20, 2017	20	2017
Libritts-r: A restored multi-speaker text-to-speech corpus Y Koizumi, H Zen, S Karita, Y Ding, K Yatabe, N Morioka, M Bacchiani, ... arXiv preprint arXiv:2305.18802, 2023	19	2023
Learning device, learning method, and learning program A Ogawa, M Delcroix, S Karita, T Nakatani US Patent App. 16/966,056, 2020	19	2020

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors