Follow
Takuma Okamoto
Title
Cited by
Cited by
Year
Sound-space recording and binaural presentation system based on a 252-channel microphone array
S Sakamoto, S Hongo, T Okamoto, Y Iwaya, Y Suzuki
Acoustical Science and technology 36 (6), 516-526, 2015
412015
Real-Time Neural Text-to-Speech with Sequence-to-Sequence Acoustic Model and WaveGlow or Single Gaussian WaveRNN Vocoders.
T Okamoto, T Toda, Y Shiga, H Kawai
INTERSPEECH, 1308-1312, 2019
362019
High order Ambisonic decoding method for irregular loudspeaker arrays
J Trevino, T Okamoto, Y Iwaya, Y Suzuki
Proceedings of 20th International Congress on Acoustics, 23-27, 2010
352010
Tacotron-based acoustic model using phoneme alignment for practical neural text-to-speech systems
T Okamoto, T Toda, Y Shiga, H Kawai
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019
322019
An investigation of subband WaveNet vocoder covering entire audible frequency range with limited acoustic features
T Okamoto, K Tachibana, T Toda, Y Shiga, H Kawai
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
322018
Experimental validation of spatial Fourier transform-based multiple sound zone generation with a linear loudspeaker array
T Okamoto, A Sakaguchi
The Journal of the Acoustical Society of America 141 (3), 1769-1780, 2017
322017
Quasi-periodic parallel WaveGAN: A non-autoregressive raw waveform generative model with pitch-dependent dilated convolution neural network
YC Wu, T Hayashi, T Okamoto, H Kawai, T Toda
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 792-806, 2021
262021
Estimation of sound source positions using a surrounding microphone array
T Okamoto, R Nishimura, Y Iwaya
Acoustical science and technology 28 (3), 181-189, 2007
262007
Text-to-speech synthesis
Y Shiga, J Ni, K Tachibana, T Okamoto
Speech-to-Speech Translation, 39-52, 2020
242020
Generation of multiple sound zones by spatial filtering in wavenumber domain using a linear array of loudspeakers
T Okamoto
2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014
242014
3D spatial sound systems compatible with human's active listening to realize rich high-level kansei information
Y Suzuki, T Okamoto, J Trevino, ZL Cui, Y Iwaya, S Sakamoto, M Otani
Interdisciplinary information sciences 18 (2), 71-82, 2012
242012
Transformer-based text-to-speech with weighted forced attention
T Okamoto, T Toda, Y Shiga, H Kawai
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
232020
Subband WaveNet with overlapped single-sideband filterbanks
T Okamoto, K Tachibana, T Toda, Y Shiga, H Kawai
2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017
232017
Improving FFTNet vocoder with noise shaping and subband approaches
T Okamoto, T Toda, Y Shiga, H Kawai
2018 IEEE Spoken Language Technology Workshop (SLT), 304-311, 2018
222018
Analytical methods of generating multiple sound zones for open and baffled circular loudspeaker arrays
T Okamoto
2015 IEEE Workshop on Applications of Signal Processing to Audio and …, 2015
222015
2.5 D higher order ambisonics for a sound field described by angular spectrum coefficients
T Okamoto
2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016
192016
Multi-stream HiFi-GAN with data-driven waveform decomposition
T Okamoto, T Toda, H Kawai
2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021
162021
Analytical approach to 2.5 D sound field control using a circular double-layer array of fixed-directivity loudspeakers
T Okamoto
2017 IEEE International Conference on Acoustics, Speech and Signal …, 2017
162017
Noise level limited sub-modeling for diffusion probabilistic vocoders
T Okamoto, T Toda, Y Shiga, H Kawai
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
152021
High-intelligibility speech synthesis for dysarthric speakers with LPCNet-based TTS and CycleVAE-based VC
K Matsubara, T Okamoto, R Takashima, T Takiguchi, T Toda, Y Shiga, ...
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
152021
The system can't perform the operation now. Try again later.
Articles 1–20