Naohiro Tawara
Naohiro Tawara
NTT Corporation
Verified email at ieee.org
Title
Cited by
Cited by
Year
Speaker invariant feature extraction for zero-resource languages with adversarial learning
T Tsuchiya, N Tawara, T Ogawa, T Kobayashi
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
232018
Improving speaker discrimination of target speech extraction with time-domain speakerbeam
M Delcroix, T Ochiai, K Zmolikova, K Kinoshita, N Tawara, T Nakatani, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
182020
Multi-Channel Speech Enhancement Using Time-Domain Convolutional Denoising Autoencoder.
N Tawara, T Kobayashi, T Ogawa
INTERSPEECH, 86-90, 2019
182019
Frame-level phoneme-invariant speaker embedding for text-independent speaker recognition on extremely short utterances
N Tawara, A Ogawa, T Iwata, M Delcroix, T Ogawa
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
102020
Fully Bayesian inference of multi-mixture Gaussian model and its evaluation using speaker clustering
N Tawara, T Ogawa, S Watanabe, T Kobayashi
2012 IEEE International Conference on Acoustics, Speech and Signal …, 2012
82012
Language model domain adaptation via recurrent neural networks with domain-shared and domain-specific representations
T Moriokal, N Tawara, T Ogawa, A Ogawa, T Iwata, T Kobayashi
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
72018
Integrating end-to-end neural and clustering-based diarization: Getting the best of both worlds
K Kinoshita, M Delcroix, N Tawara
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
62021
A comparative study of spectral clustering for i-vector-based speaker clustering under noisy conditions
N Tawara, T Ogawa, T Kobayashi
2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015
62015
Speaker Adversarial Training of DPGMM-Based Feature Extractor for Zero-Resource Languages.
Y Higuchi, N Tawara, T Kobayashi, T Ogawa
INTERSPEECH, 266-270, 2019
52019
Speaker clustering based on utterance-oriented Dirichlet process mixture model
N Tawara, S Watanabe, T Ogawa, T Kobayashi
Twelfth Annual Conference of the International Speech Communication Association, 2011
52011
Fully Bayesian speaker clustering based on hierarchically structured utterance-oriented Dirichlet process mixture model
N Tawara, T Ogawa, S Watanabe, A Nakamura, T Kobayashi
Thirteenth Annual Conference of the International Speech Communication …, 2012
42012
Sequential fish catch forecasting using Bayesian state space models
Y Kokaki, N Tawara, T Kobayashi, K Hashimoto, T Ogawa
2018 24th International Conference on Pattern Recognition (ICPR), 776-781, 2018
32018
Adversarial autoencoder for reducing nonlinear distortion
N Tawara, T Kobayashi, M Fujieda, K Katagiri, T Yazu, T Ogawa
2018 Asia-Pacific Signal and Information Processing Association Annual …, 2018
22018
A sampling-based speaker clustering using utterance-oriented Dirichlet process mixture model and its evaluation on large-scale data
N Tawara, T Ogawa, S Watanabe, A Nakamura, T Kobayashi
APSIPA Transactions on Signal and Information Processing 4, 2015
22015
Advances in integration of end-to-end neural and clustering-based diarization for real conversational speech
K Kinoshita, M Delcroix, N Tawara
arXiv preprint arXiv:2105.09040, 2021
12021
Speaker age estimation using age-dependent insensitive loss
Y Kitagishi, H Kamiyama, A Ando, N Tawara, T Mori, S Kobashikawa
2020 Asia-Pacific Signal and Information Processing Association Annual …, 2020
12020
Postfiltering Using an Adversarial Denoising Autoencoder with Noise-aware Training
N Tawara, H Tanabe, T Kobayashi, M Fujieda, K Katagiri, T Yazu, ...
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
12019
Exploiting end of sentences and speaker alternations in language modeling for multiparty conversations
H Ashikawa, N Tawara, A Ogawa, T Iwata, T Kobayashi, T Ogawa
2017 Asia-Pacific Signal and Information Processing Association Annual …, 2017
12017
Blocked Gibbs sampling based multi-scale mixture model for speaker clustering on noisy data
N Tawara, T Ogawa, S Watanabe, A Nakamura, T Kobayashi
2013 IEEE International Workshop on Machine Learning for Signal Processing …, 2013
12013
Age-VOX-Celeb: Multi-Modal Corpus for Facial and Speech Estimation
N Tawara, A Ogawa, Y Kitagishi, H Kamiyama
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
2021
The system can't perform the operation now. Try again later.
Articles 1–20