フォロー
Kazuki Shimada
Kazuki Shimada
確認したメール アドレス: sony.com
タイトル
引用先
引用先
ACCDOA: Activity-coupled cartesian direction of arrival representation for sound event localization and detection
K Shimada, Y Koyama, N Takahashi, S Takahashi, Y Mitsufuji
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
1122021
Multi-accdoa: Localizing and detecting overlapping sounds from the same class with auxiliary duplicating permutation invariant training
K Shimada, Y Koyama, S Takahashi, N Takahashi, E Tsunoo, Y Mitsufuji
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
902022
STARSS22: A dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events
A Politis, K Shimada, P Sudarsanam, S Adavanne, D Krause, Y Koyama, ...
arXiv preprint arXiv:2206.01948, 2022
872022
Unsupervised speech enhancement based on multichannel NMF-informed beamforming for noise-robust automatic speech recognition
K Shimada, Y Bando, M Mimura, K Itoyama, K Yoshii, T Kawahara
IEEE/ACM Transactions on Audio, Speech, and Language Processing 27 (5), 960-971, 2019
652019
STARSS23: An audio-visual dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events
K Shimada, A Politis, P Sudarsanam, DA Krause, K Uchida, S Adavanne, ...
Advances in Neural Information Processing Systems 36, 2024
362024
Ensemble of ACCDOA-and EINV2-based systems with D3Nets and impulse response simulation for sound event localization and detection
K Shimada, N Takahashi, Y Koyama, S Takahashi, E Tsunoo, ...
arXiv preprint arXiv:2106.10806, 2021
292021
Metric learning with background noise class for few-shot detection of rare sound events
K Shimada, Y Koyama, A Inoue
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
272020
Sound event localization and detection using activity-coupled cartesian DOA vector and RD3Net
K Shimada, N Takahashi, S Takahashi, Y Mitsufuji
arXiv preprint arXiv:2006.12014, 2020
222020
Unsupervised beamforming based on multichannel nonnegative matrix factorization for noisy speech recognition
K Shimada, Y Bando, M Mimura, K Itoyama, K Yoshii, T Kawahara
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
172018
Spatial data augmentation with simulated room impulse responses for sound event localization and detection
Y Koyama, K Shigemi, M Takahashi, K Shimada, N Takahashi, E Tsunoo, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
152022
An attention-based approach to hierarchical multi-label music instrument classification
Z Zhong, M Hirano, K Shimada, K Tateishi, S Takahashi, Y Mitsufuji
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
122023
Diffusion-based speech enhancement with joint generative and predictive decoders
H Shi, K Shimada, M Hirano, T Shibuya, Y Koyama, Z Zhong, S Takahashi, ...
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
102024
HQ-VAE: Hierarchical Discrete Representation Learning with Variational Bayes
Y Takida, Y Ikemiya, T Shibuya, K Shimada, W Choi, CH Lai, N Murata, ...
arXiv preprint arXiv:2401.00365, 2023
82023
Spatial mixup: Directional loudness modification as data augmentation for sound event localization and detection
R Falcón-Pérez, K Shimada, Y Koyama, S Takahashi, Y Mitsufuji
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
72022
STARSS23: Sony-TAu Realistic Spatial Soundscapes 2023
A Politis, K Shimada, P Sudarsanam, A Hakala, S Takahashi, DA Krause, ...
Mar, 2023
62023
Combined Multi-Channel NMF-Based Robust Beamforming for Noisy Speech Recognition.
M Mimura, Y Bando, K Shimada, S Sakai, K Yoshii, T Kawahara
INTERSPEECH, 2451-2455, 2017
52017
Extending audio masked autoencoders toward audio restoration
Z Zhong, H Shi, M Hirano, K Shimada, K Tateishi, T Shibuya, S Takahashi, ...
2023 IEEE Workshop on Applications of Signal Processing to Audio and …, 2023
42023
Zero-and Few-Shot Sound Event Localization and Detection
K Shimada, K Uchida, Y Koyama, T Shibuya, S Takahashi, Y Mitsufuji, ...
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
32024
Diffusion-based Signal Refiner for Speech Separation
M Hirano, K Shimada, Y Koyama, S Takahashi, Y Mitsufuji
arXiv preprint arXiv:2305.05857, 2023
32023
Music Foundation Model as Generic Booster for Music Downstream Tasks
WH Liao, Y Takida, Y Ikemiya, Z Zhong, CH Lai, G Fabbro, K Shimada, ...
arXiv preprint arXiv:2411.01135, 2024
2024
現在システムで処理を実行できません。しばらくしてからもう一度お試しください。
論文 1–20