フォロー
Ryuichi Yamamoto
Ryuichi Yamamoto
LY Corporation
確認したメール アドレス: lycorp.co.jp - ホームページ
タイトル
引用先
引用先
Parallel WaveGAN: A fast waveform generation model based on generative adversarial networks with multi-resolution spectrogram
R Yamamoto, E Song, JM Kim
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
8522020
A comparative study on transformer vs rnn in speech applications
S Karita, N Chen, T Hayashi, T Hori, H Inaguma, Z Jiang, M Someki, ...
2019 IEEE automatic speech recognition and understanding workshop (ASRU …, 2019
8012019
librosa/librosa: 0.6. 3
B McFee, M McVicar, S Balke, V Lostanlen, C Thom, C Raffel, D Lee, ...
URL: https://doi. org/10.5281/zenodo 2564164, 2019
358*2019
ESPnet-TTS: Unified, reproducible, and integratable open source end-to-end text-to-speech toolkit
T Hayashi, R Yamamoto, K Inoue, T Yoshimura, S Watanabe, T Toda, ...
ICASSP 2020-2020 IEEE international conference on acoustics, speech and …, 2020
2162020
Probability density distillation with generative adversarial networks for high-quality parallel waveform generation
R Yamamoto, E Song, JM Kim
arXiv preprint arXiv:1904.04472, 2019
572019
Espnet2-tts: Extending the edge of tts research
T Hayashi, R Yamamoto, T Yoshimura, P Wu, J Shi, T Saeki, Y Ju, ...
arXiv preprint arXiv:2110.07840, 2021
512021
TTS-by-TTS: TTS-driven data augmentation for fast and high-quality speech synthesis
MJ Hwang, R Yamamoto, E Song, JM Kim
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
372021
Parallel waveform synthesis based on generative adversarial networks with voicing-aware conditional discriminators
R Yamamoto, E Song, MJ Hwang, JM Kim
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
212021
Improved Parallel WaveGAN vocoder with perceptually weighted spectrogram loss
E Song, R Yamamoto, MJ Hwang, JS Kim, O Kwon, JM Kim
2021 IEEE Spoken Language Technology Workshop (SLT), 470-476, 2021
212021
Semi-supervised speaker adaptation for end-to-end speech synthesis with pretrained models
K Inoue, S Hara, M Abe, T Hayashi, R Yamamoto, S Watanabe
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
192020
Ryry: A real-time score-following automatic accompaniment playback system capable of real performances with errors, repeats and jumps
S Sako, R Yamamoto, T Kitamura
Active Media Technology: 10th International Conference, AMT 2014, Warsaw …, 2014
172014
Cross-speaker emotion transfer for low-resource text-to-speech using non-parallel voice conversion with pitch-shift data augmentation
R Terashima, R Yamamoto, E Song, Y Shirahata, HW Yoon, JM Kim, ...
arXiv preprint arXiv:2204.10020, 2022
162022
High-Fidelity Parallel WaveGAN with Multi-Band Harmonic-Plus-Noise Model.
MJ Hwang, R Yamamoto, E Song, JM Kim
Interspeech, 2227-2231, 2021
152021
Phrase break prediction with bidirectional encoder representations in Japanese text-to-speech synthesis
K Futamata, B Park, R Yamamoto, K Tachibana
arXiv preprint arXiv:2104.12395, 2021
152021
Improving lpcnet-based text-to-speech with linear prediction-structured mixture density network
MJ Hwang, E Song, R Yamamoto, F Soong, HG Kang
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
152020
Score following handling performances with arbitrary repeats and skips and automatic accompaniment
E Nakamura, H Takeda, R Yamamoto, Y Saito, S Sako, S Sagayama
IPSJ Journal 54 (4), 1338-1349, 2013
152013
Period vits: Variational inference with explicit pitch modeling for end-to-end emotional speech synthesis
Y Shirahata, R Yamamoto, E Song, R Terashima, JM Kim, K Tachibana
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
122023
Language model-based emotion prediction methods for emotional speech synthesis systems
HW Yoon, O Kwon, H Lee, R Yamamoto, E Song, JM Kim, MJ Hwang
arXiv preprint arXiv:2206.15067, 2022
122022
Neural text-to-speech with a modeling-by-generation excitation vocoder
E Song, MJ Hwang, R Yamamoto, JS Kim, O Kwon, JM Kim
arXiv preprint arXiv:2008.00132, 2020
112020
Lightweight and high-fidelity end-to-end text-to-speech with multi-band generation and inverse short-time fourier transform
M Kawamura, Y Shirahata, R Yamamoto, K Tachibana
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
102023
現在システムで処理を実行できません。しばらくしてからもう一度お試しください。
論文 1–20