Follow
Yuki Takashima
Yuki Takashima
Verified email at hitachi.com
Title
Cited by
Cited by
Year
Lip reading using a dynamic feature of lip images and convolutional neural networks
Y Li, Y Takashima, T Takiguchi, Y Ariki
2016 IEEE/ACIS 15th International Conference on Computer and Information …, 2016
492016
End-to-end dysarthric speech recognition using multiple databases
Y Takashima, T Takiguchi, Y Ariki
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
382019
The Hitachi-JHU DIHARD III system: Competitive end-to-end neural diarization and x-vector clustering systems combined by DOVER-Lap
S Horiguchi, N Yalta, P Garcia, Y Takashima, Y Xue, D Raj, Z Huang, ...
arXiv preprint arXiv:2102.01363, 2021
372021
Towards neural diarization for unlimited numbers of speakers using global and local attractors
S Horiguchi, S Watanabe, P García, Y Xue, Y Takashima, Y Kawaguchi
2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 98-105, 2021
362021
Audio-Visual Speech Recognition Using Bimodal-Trained Bottleneck Features for a Person with Severe Hearing Loss.
Y Takashima, R Aihara, T Takiguchi, Y Ariki, N Mitani, K Omori, ...
Interspeech, 277-281, 2016
332016
Knowledge transferability between the speech data of persons with dysarthria speaking different languages for dysarthric speech recognition
Y Takashima, R Takashima, T Takiguchi, Y Ariki
IEEE Access 7, 164320-164326, 2019
292019
Feature extraction using pre-trained convolutive bottleneck nets for dysarthric speech recognition
Y Takashima, T Nakashika, T Takiguchi, Y Ariki
2015 23rd European Signal Processing Conference (EUSIPCO), 1411-1415, 2015
272015
Online streaming end-to-end neural diarization handling overlapping speech and flexible numbers of speakers
Y Xue, S Horiguchi, Y Fujita, Y Takashima, S Watanabe, P Garcia, ...
arXiv preprint arXiv:2101.08473, 2021
222021
End-to-end speaker diarization conditioned on speech activity and overlap detection
Y Takashima, Y Fujita, S Watanabe, S Horiguchi, P García, K Nagamatsu
2021 IEEE Spoken Language Technology Workshop (SLT), 849-856, 2021
212021
Semi-supervised training with pseudo-labeling for end-to-end neural diarization
Y Takashima, Y Fujita, S Horiguchi, S Watanabe, P García, K Nagamatsu
arXiv preprint arXiv:2106.04764, 2021
152021
Multi-channel end-to-end neural diarization with distributed microphones
S Horiguchi, Y Takashima, P Garcia, S Watanabe, Y Kawaguchi
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
142022
Audio-visual speech recognition using convolutive bottleneck networks for a person with severe hearing loss
Y Takashima, Y Kakihara, R Aihara, T Takiguchi, Y Ariki, N Mitani, ...
IPSJ Transactions on Computer Vision and Applications 7, 64-68, 2015
142015
Online neural diarization of unlimited numbers of speakers using global and local attractors
S Horiguchi, S Watanabe, P García, Y Takashima, Y Kawaguchi
IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 706-720, 2022
132022
Dysarthric Speech Recognition Based on Deep Metric Learning.
Y Takashima, R Takashima, T Takiguchi, Y Ariki
INTERSPEECH, 4796-4800, 2020
72020
Audio-visual speech recognition for a person with severe hearing loss using deep canonical correlation analysis
Y Takashima, T Takiguchi, Y Ariki, K Omori
Proc. 1st Int. Workshop Challenges Hearing Assistive Technol., 77-81, 2017
72017
Experimental studies on the ultra-fine structure of sea urchin eggplasm by ultracentrifugation. 1. Ultra-fine structure of the mature unfertilized egg
R Takashima, S Katsura, Y Takashima
The Tokushima journal of experimental medicine 8, 252-262, 1961
51961
Updating only encoders prevents catastrophic forgetting of end-to-end ASR models
Y Takashima, S Horiguchi, S Watanabe, P García, Y Kawaguchi
arXiv preprint arXiv:2207.00216, 2022
42022
Exemplar-based lip-to-speech synthesis using convolutional neural networks
Y Takashima, T Takiguchi, Y Ariki
Proc. IW-FCV, 2019
42019
Online neural diarization of unlimited numbers of speakers
S Horiguchi, S Watanabe, P Garcia, Y Takashima, Y Kawaguchi
ArXiv abs/2206.02432, 2022
32022
Unsupervised domain adaptation for lip reading based on cross-modal knowledge distillation
Y Takashima, R Takashima, R Tsunoda, R Aihara, T Takiguchi, Y Ariki, ...
EURASIP Journal on Audio, Speech, and Music Processing 2021, 1-9, 2021
32021
The system can't perform the operation now. Try again later.
Articles 1–20