Foley sound synthesis at the dcase 2023 challenge K Choi, J Im, L Heller, B McFee, K Imoto, Y Okamoto, M Lagrange, ... arXiv preprint arXiv:2304.12521, 2023 | 35 | 2023 |
Onoma-to-wave: Environmental sound synthesis from onomatopoeic words Y Okamoto, K Imoto, S Takamichi, R Yamanishi, T Fukumori, Y Yamashita APSIPA Transactions on Signal and Information Processing 11 (1), 2022 | 17 | 2022 |
Environmental Sound Extraction Using Onomatopoeic Words Y Okamoto, S Horiguchi, M Yamamoto, K Imoto, Y Kawaguchi ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 13* | 2022 |
Correlation of Fr\'echet Audio Distance With Human Perception of Environmental Audio Is Embedding Dependant M Tailleur, J Lee, M Lagrange, K Choi, LM Heller, K Imoto, Y Okamoto arXiv preprint arXiv:2403.17508, 2024 | 12 | 2024 |
Visual onoma-to-wave: environmental sound synthesis from visual onomatopoeias and sound-source images H Ohnaka, S Takamichi, K Imoto, Y Okamoto, K Fujii, H Saruwatari ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 10 | 2023 |
Overview of tasks and investigation of subjective evaluation methods in environmental sound synthesis and conversion Y Okamoto, K Imoto, T Komatsu, S Takamichi, T Yagyu, R Yamanishi, ... arXiv preprint arXiv:1908.10055, 2019 | 8 | 2019 |
Audio-change captioning to explain machine-sound anomalies S Tsubaki, Y Kawaguchi, T Nishida, K Imoto, Y Okamoto, K Dohi, T Endo Proceedings of the DCASE 2023 Workshop, 2023 | 7 | 2023 |
Sound event detection guided by semantic contexts of scenes N Tonami, K Imoto, R Nagase, Y Okamoto, T Fukumori, Y Yamashita ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 7 | 2022 |
Crow Call Detection Using Gated Convolutional Recurrent Neural Network Y Okamoto, K Imoto, N Tsukahara, K Nagata, K Sueda, R Yamanishi, ... 2020 RISP International Workshop on Nonlinear Circuits, Communications and …, 2020 | 7 | 2020 |
Moving vehicle discrimination using Hough transformation Y Okamoto, I Matsunami, A Kajiwara 2011 IEEE Radio and Wireless Symposium, 367-370, 2011 | 7 | 2011 |
Sound event detection based on curriculum learning considering learning difficulty of events N Tonami, K Imoto, Y Okamoto, T Fukumori, Y Yamashita ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 6 | 2021 |
Pedestrian and two-wheeler detection using ultra-wideband vehicular radar Y Okamoto, I Matsunami, A Kajiwara 2012 IEEE Sensors Applications Symposium Proceedings, 1-4, 2012 | 6 | 2012 |
RWCP-SSD-Onomatopoeia: Onomatopoeic Word Dataset for Environmental Sound Synthesis Y Okamoto, K Imoto, S Takamichi, R Yamanishi, T Fukumori, Y Yamashita Detection and Classification of Acoustic Scenes and Events 2020 Workshop …, 2020 | 4 | 2020 |
3D Human Body Display by Using Depth Information for Remote Shared Mixed Reality Y Okamoto, I Kitahara, Y Ohta IEICE Technical Report; IEICE Tech. Rep. 109 (215), 53-58, 2009 | 4 | 2009 |
Environmental sound synthesis from vocal imitations and sound event labels Y Okamoto, K Imoto, S Takamichi, R Nagase, T Fukumori, Y Yamashita ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 1* | 2024 |
CAPTDURE: Captioned Sound Dataset of Single Sources Y Okamoto, K Shimonishi, K Imoto, K Dohi, S Horiguchi, Y Kawaguchi arXiv preprint arXiv:2305.17758, 2023 | 1 | 2023 |
Audio-change captioning to explain machine-sound anomalies STY Kawaguchi, T Nishida, K Imoto, Y Okamoto, K Dohi, T Endo Proceedings of the DCASE 2023 Workshop, 2023 | 1 | 2023 |
Sound Event Detection Using Duration Robust Loss Function D Akiyama, K Imoto, N Tonami, Y Okamoto, R Yamanishi, T Fukumori, ... arXiv preprint arXiv:2006.15253, 2020 | 1 | 2020 |
Multiple Target Detection Classification Using Range gated Weight Pulse Integration for UWB Automotive Radar I Matsunami, Y Okamoto, A Kajiwara IEICE Technical Report; IEICE Tech. Rep. 111 (239), 111-115, 2011 | 1 | 2011 |
Challenge on Sound Scene Synthesis: Evaluating Text-to-Audio Generation J Lee, M Tailleur, LM Heller, K Choi, M Lagrange, B McFee, K Imoto, ... arXiv preprint arXiv:2410.17589, 2024 | | 2024 |