Hirofumi Inaguma
Hirofumi Inaguma
Ph.D. student at Kyoto University
Verified email at sap.ist.i.kyoto-u.ac.jp - Homepage
Title
Cited by
Cited by
Year
A comparative study on transformer vs RNN in speech applications
S Karita, N Chen, T Hayashi, T Hori, H Inaguma, Z Jiang, M Someki, ...
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019
1422019
Acoustic-to-word attention-based model complemented with character-level CTC-based model
S Ueno, H Inaguma, M Mimura, T Kawahara
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
322018
ESPnet-ST: All-in-One Speech Translation Toolkit
H Inaguma, S Kiyono, K Duh, S Karita, NEY Soplin, T Hayashi, ...
arXiv preprint arXiv:2004.10234, 2020
202020
Multilingual end-to-end speech translation
H Inaguma, K Duh, T Kawahara, S Watanabe
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019
162019
Minimum latency training strategies for streaming sequence-to-sequence ASR
H Inaguma, Y Gaur, L Lu, J Li, Y Gong
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
142020
Leveraging sequence-to-sequence speech synthesis for enhancing acoustic-to-word speech recognition
M Mimura, S Ueno, H Inaguma, S Sakai, T Kawahara
2018 IEEE Spoken Language Technology Workshop (SLT), 477-484, 2018
142018
Transfer learning of language-independent end-to-end asr with language model fusion
H Inaguma, J Cho, MK Baskar, T Kawahara, S Watanabe
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
132019
Social Signal Detection in Spontaneous Dialogue Using Bidirectional LSTM-CTC
H Inaguma, K Inoue, M Mimura, T Kawahara
INTERSPEECH, 1691-1695, 2017
82017
Prediction of ice-breaking between participants using prosodic features in the first meeting dialogue
H Inaguma, K Inoue, S Nakamura, K Takanashi, T Kawahara
Proceedings of the 2nd Workshop on Advancements in Social Signal Processing …, 2016
72016
Language model integration based on memory control for sequence to sequence speech recognition
J Cho, S Watanabe, T Hori, MK Baskar, H Inaguma, J Villalba, N Dehak
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
52019
The JHU/KyotoU speech translation system for IWSLT 2018
H Inaguma, X Zhang, Z Wang, A Renduchintala, S Watanabe, K Duh
International Workshop on Spoken Language Translation, 153-159, 2018
42018
Improving OOV detection and resolution with external language models in acoustic-to-word ASR
H Inaguma, M Mimura, S Sakai, T Kawahara
2018 IEEE Spoken Language Technology Workshop (SLT), 212-218, 2018
22018
Recent Developments on ESPnet Toolkit Boosted by Conformer
P Guo, F Boyer, X Chang, T Hayashi, Y Higuchi, H Inaguma, N Kamo, C Li, ...
arXiv preprint arXiv:2010.13956, 2020
12020
Distilling the Knowledge of BERT for Sequence-to-Sequence ASR
H Futami, H Inaguma, S Ueno, M Mimura, S Sakai, T Kawahara
arXiv preprint arXiv:2008.03822, 2020
12020
Enhancing Monotonic Multihead Attention for Streaming ASR
H Inaguma, M Mimura, T Kawahara
arXiv preprint arXiv:2005.09394, 2020
12020
CTC-synchronous Training for Monotonic Attention Model
H Inaguma, M Mimura, T Kawahara
arXiv preprint arXiv:2005.04712, 2020
12020
Improved Mask-CTC for Non-Autoregressive End-to-End ASR
Y Higuchi, H Inaguma, S Watanabe, T Ogawa, T Kobayashi
arXiv preprint arXiv:2010.13270, 2020
2020
Orthros: Non-autoregressive End-to-end Speech Translation with Dual-decoder
H Inaguma, Y Higuchi, K Duh, T Kawahara, S Watanabe
arXiv preprint arXiv:2010.13047, 2020
2020
End-to-end speech-to-dialog-act recognition
VT Dang, T Zhao, S Ueno, H Inaguma, T Kawahara
arXiv preprint arXiv:2004.11419, 2020
2020
An End-to-End Approach to Joint Social Signal Detection and Automatic Speech Recognition
H Inaguma, M Mimura, K Inoue, K Yoshii, T Kawahara
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
2018
The system can't perform the operation now. Try again later.
Articles 1–20