Shigeki Karita
Shigeki Karita
確認したメール アドレス: google.com - ホームページ
タイトル
引用先
引用先
Espnet: End-to-end speech processing toolkit
S Watanabe, T Hori, S Karita, T Hayashi, J Nishitoba, Y Unno, NEY Soplin, ...
arXiv preprint arXiv:1804.00015, 2018
3892018
A comparative study on transformer vs rnn in speech applications
S Karita, N Chen, T Hayashi, T Hori, H Inaguma, Z Jiang, M Someki, ...
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019
2092019
Improving Transformer-based End-to-End Speech Recognition with Connectionist Temporal Classification and Language Model Integration
S Karita, NEY Soplin, S Watanabe, M Delcroix, A Ogawa, T Nakatani
Proc. Interspeech 2019, 1408-1412, 2019
592019
Semi-Supervised End-to-End Speech Recognition
S Karita, S Watanabe, T Iwata, A Ogawa, M Delcroix
INTERSPEECH, 2-6, 2018
372018
Far-field speech recognition using CNN-DNN-HMM with convolution in time
T Yoshioka, S Karita, T Nakatani
2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015
292015
ESPnet-ST: All-in-one speech translation toolkit
H Inaguma, S Kiyono, K Duh, S Karita, NEY Soplin, T Hayashi, ...
arXiv preprint arXiv:2004.10234, 2020
272020
Frame-by-frame closed-form update for mask-based adaptive MVDR beamforming
T Higuchi, K Kinoshita, N Ito, S Karita, T Nakatani
IEEE International Conference on Acoustics, Speech and Signal Processing, 2018
272018
Auxiliary feature based adaptation of end-to-end ASR systems
M Delcroix, S Watanabe, A Ogawa, S Karita, T Nakatani
INTERSPEECH, 2018
242018
Online meeting recognition in noisy environments with time-frequency mask based MVDR beamforming
S Araki, N Ito, M Delcroix, A Ogawa, K Kinoshita, T Higuchi, T Yoshioka, ...
2017 Hands-free Speech Communications and Microphone Arrays (HSCMA), 16-20, 2017
142017
Semi-Supervised End-to-End Speech Recognition Using Text-to-Speech and Autoencoders
S Karita, S Watanabe, T Iwata, M Delcroix, A Ogawa, T Nakatani
IEEE International Conference on Acoustics, Speech, and Signal Processing, 2019
132019
Rescoring n-best speech recognition list based on one-on-one hypothesis comparison using encoder-classifier model
A Ogawa, M Delcroix, S Karita, T Nakatani
IEEE International Conference on Acoustics, Speech and Signal Processing, 2018
132018
Sequence training of encoder-decoder model using policy gradient for end-to-end speech recognition
S Karita, A Ogawa, M Delcroix, T Nakatani
IEEE International Conference on Acoustics, Speech and Signal Processing, 2018
122018
ESPnet: End-to-End Speech Processing Toolkit. arXiv 2018
S Watanabe, T Hori, S Karita, T Hayashi, J Nishitoba, Y Unno, NEY Soplin, ...
arXiv preprint arXiv:1804.00015, 0
8
End-to-End SpeakerBeam for Single Channel Target Speech Recognition.
M Delcroix, S Watanabe, T Ochiai, K Kinoshita, S Karita, A Ogawa, ...
Interspeech, 451-455, 2019
52019
Improved Deep Duel Model for Rescoring N-Best Speech Recognition List Using Backward LSTMLM and Ensemble Encoders.
A Ogawa, M Delcroix, S Karita, T Nakatani
INTERSPEECH, 3900-3904, 2019
42019
Owner authentication for mobile devices using motion gestures based on multi-owner template update
S Karita, K Nakamura, K Kono, Y Ito, N Babaguchi
2015 IEEE International Conference on Multimedia & Expo Workshops (ICMEW), 1-6, 2015
42015
Self-Distillation for Improving CTC-Transformer-based ASR Systems
T Moriya, T Ochiai, S Karita, H Sato, T Tanaka, T Ashihara, R Masumura, ...
Proc. Interspeech 2020, 546-550, 2020
22020
Forward-Backward Convolutional LSTM for Acoustic Modeling
S Karita, A Ogawa, M Delcroix, T Nakatani
INTERSPEECH, 1601-1605, 2017
22017
Video forgery detection using a time series model in dynamic scenes
S Karita, K Kono, N Babaguchi
IEICE Technical Report; IEICE Tech. Rep. 115 (479), 25-30, 2016
22016
Unfolded Deep Recurrent Convolutional Neural Network with Jump Ahead Connections for Acoustic Modeling
DT Tran, M Delcroix, S Karita, M Hentschel, A Ogawa, T Nakatani
INTERSPEECH, 1596-1600, 2017
12017
現在システムで処理を実行できません。しばらくしてからもう一度お試しください。
論文 1–20