フォロー
Yusuke Kida
Yusuke Kida
Dell Technologies
確認したメール アドレス: dell.com
タイトル
引用先
引用先
Sound source direction estimation apparatus, sound source direction estimation method and computer program product
N Ding, Y Kida
US Patent 9,473,849, 2016
1212016
Voice activity detection: Merging source and filter-based information
T Drugman, Y Stylianou, Y Kida, M Akamine
IEEE Signal Processing Letters 23 (2), 252-256, 2015
1032015
Voice activity detection based on optimally weighted combination of multiple features.
Y Kida, T Kawahara
INTERSPEECH, 2621-2624, 2005
512005
Television apparatus and a remote operation apparatus
K Ouchi, A Kawamura, M Sakai, K Suzuki, Y Kida
US Patent 9,154,848, 2015
312015
Neural diarization with non-autoregressive intermediate attractors
Y Fujita, T Komatsu, R Scheibler, Y Kida, T Ogawa
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
112023
Apparatus, method and computer program product for feature extraction
Y Kida, T Masuko
US Patent 8,073,686, 2011
102011
Apparatus and method for discriminating speech, and computer readable medium
K Suzuki, M Sakai, Y Kida
US Patent 9,330,682, 2016
92016
Evaluation of voice activity detection by combining multiple features with weight adaptation.
Y Kida, T Kawahara
INTERSPEECH, 2006
92006
Apparatus and method for discriminating speech of acoustic signal with exclusion of disturbance sound, and non-transitory computer readable medium
K Suzuki, M Sakai, Y Kida
US Patent 9,330,683, 2016
82016
Speaker selective beamformer with keyword mask estimation
Y Kida, D Tran, M Omachi, T Taniguchi, Y Fujita
2018 IEEE Spoken Language Technology Workshop (SLT), 528-534, 2018
72018
Minimum classification error interactive training for speaker identification [interactive robot applications]
Y Kida, H Yamamoto, C Miyajima, K Tokuda, T Kitamura
Proceedings.(ICASSP'05). IEEE International Conference on Acoustics, Speech …, 2005
72005
Robust F0 estimation based on log-time scale autocorrelation and its application to Mandarin tone recognition
Y Kida, M Sakai, T Masuko, A Kawamura
Tenth Annual Conference of the International Speech Communication Association, 2009
62009
Simultaneous Detection and Localization of a Wake-Up Word Using Multi-Task Learning of the Duration and Endpoint.
T Maekaku, Y Kida, A Sugiyama
INTERSPEECH, 4240-4244, 2019
52019
Tourist guidance robot based on HyperCLOVA
T Yamazaki, K Yoshikawa, T Kawamoto, M Ohagi, T Mizumoto, S Ichimura, ...
arXiv preprint arXiv:2210.10400, 2022
42022
Multi-sequence intermediate conditioning for ctc-based asr
Y Fujita, T Komatsu, Y Kida
arXiv preprint, 2022
42022
Using duration and pitch for mandarin digit string recognition
R Zhao, Y Kida, X Yan, P Ding, L He
2010 IEEE International Conference on Acoustics, Speech and Signal …, 2010
42010
Better intermediates improve CTC inference
T Komatsu, Y Fujita, J Lee, L Lee, S Watanabe, Y Kida
arXiv preprint arXiv:2204.00176, 2022
22022
InterAug: Augmenting Noisy Intermediate Predictions for CTC-based ASR
Y Nakagome, T Komatsu, Y Fujita, S Ichimura, Y Kida
arXiv preprint arXiv:2204.00174, 2022
22022
Label-Synchronous Speech-to-Text Alignment for ASR Using Forward and Backward Transformers
Y Kida, T Komatsu, M Togami
arXiv preprint arXiv:2104.10328, 2021
22021
Creating device, creating method, and non-transitory computer readable storage medium
Y Kida, D Tran
US Patent App. 16/131,561, 2019
22019
現在システムで処理を実行できません。しばらくしてからもう一度お試しください。
論文 1–20