Espnet: End-to-end speech processing toolkit S Watanabe, T Hori, S Karita, T Hayashi, J Nishitoba, Y Unno, NEY Soplin, ... arXiv preprint arXiv:1804.00015, 2018 | 329 | 2018 |
Hybrid CTC/Attention Architecture for End-to-End Speech Recognition S Watanabe, T Hori, S Kim, JR Hershey, T Hayashi IEEE Journal of Selected Topics in Signal Processing 11 (8), 1240-1253, 2017 | 223 | 2017 |
Speaker-dependent wavenet vocoder. A Tamamori, T Hayashi, K Kobayashi, K Takeda, T Toda Interspeech 2017, 1118-1122, 2017 | 210 | 2017 |
A comparative study on Transformer vs RNN in speech applications S Karita, N Chen, T Hayashi, T Hori, H Inaguma, Z Jiang, M Someki, ... arXiv preprint arXiv:1909.06317, 2019 | 158 | 2019 |
Statistical Voice Conversion with WaveNet-Based Waveform Generation. K Kobayashi, T Hayashi, A Tamamori, T Toda Interspeech, 1138-1142, 2017 | 88 | 2017 |
Exploring multi-channel features for denoising-autoencoder-based speech enhancement S Araki, T Hayashi, M Delcroix, M Fujimoto, K Takeda, T Nakatani 2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015 | 83 | 2015 |
An investigation of multi-speaker training for WaveNet vocoder T Hayashi, A Tamamori, K Kobayashi, K Takeda, T Toda 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017 | 82 | 2017 |
Multi-channel speech recognition: Lstms all the way through H Erdogan, T Hayashi, JR Hershey, T Hori, C Hori, WN Hsu, S Kim, ... CHiME-4 workshop, 1-4, 2016 | 57 | 2016 |
Duration-controlled LSTM for polyphonic sound event detection T Hayashi, S Watanabe, T Toda, T Hori, J Le Roux, K Takeda IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP) 25 …, 2017 | 52 | 2017 |
Espnet-TTS: Unified, reproducible, and integratable open source end-to-end text-to-speech toolkit T Hayashi, R Yamamoto, K Inoue, T Yoshimura, S Watanabe, T Toda, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 41 | 2020 |
Back-translation-style data augmentation for end-to-end ASR T Hayashi, S Watanabe, Y Zhang, T Toda, T Hori, R Astudillo, K Takeda 2018 IEEE Spoken Language Technology Workshop (SLT), 426-433, 2018 | 41 | 2018 |
Bidirectional LSTM-HMM hybrid system for polyphonic sound event detection T Hayashi, S Watanabe, T Toda, T Hori, J Le Roux, K Takeda Proceedings of the Detection and Classification of Acoustic Scenes and …, 2016 | 38 | 2016 |
Cycle-consistency training for end-to-end speech recognition T Hori, R Astudillo, T Hayashi, Y Zhang, S Watanabe, J Le Roux ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 32 | 2019 |
Daily activity recognition based on DNN using environmental sound and acceleration signals T Hayashi, M Nishida, N Kitaoka, K Takeda 2015 23rd European Signal Processing Conference (EUSIPCO), 2306-2310, 2015 | 28 | 2015 |
The NU Non-Parallel Voice Conversion System for the Voice Conversion Challenge 2018. YC Wu, PL Tobing, T Hayashi, K Kobayashi, T Toda Odyssey, 211-218, 2018 | 26 | 2018 |
ESPnet-ST: All-in-One Speech Translation Toolkit H Inaguma, S Kiyono, K Duh, S Karita, NEY Soplin, T Hayashi, ... arXiv preprint arXiv:2004.10234, 2020 | 22 | 2020 |
Non-parallel voice conversion with cyclic variational autoencoder PL Tobing, YC Wu, T Hayashi, K Kobayashi, T Toda arXiv preprint arXiv:1907.10185, 2019 | 21 | 2019 |
Collapsed speech segment detection and suppression for WaveNet vocoder YC Wu, K Kobayashi, T Hayashi, PL Tobing, T Toda arXiv preprint arXiv:1804.11055, 2018 | 21 | 2018 |
Voice transformer network: Sequence-to-sequence voice conversion using transformer with text-to-speech pretraining WC Huang, T Hayashi, YC Wu, H Kameoka, T Toda arXiv preprint arXiv:1912.06813, 2019 | 17 | 2019 |
Voice conversion with cyclic recurrent neural network and fine-tuned WaveNet vocoder PL Tobing, YC Wu, T Hayashi, K Kobayashi, T Toda ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 17 | 2019 |