Correlation of Fr\'echet Audio Distance With Human Perception of Environmental Audio Is Embedding Dependant M Tailleur, J Lee, M Lagrange, K Choi, LM Heller, K Imoto, Y Okamoto arXiv preprint arXiv:2403.17508, 2024 | 18 | 2024 |
T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis Y Chung, J Lee, J Nam arXiv preprint arXiv:2401.09294, 2024 | 13 | 2024 |
Music playlist title generation: A machine-translation approach S Doh, J Lee, J Nam arXiv preprint arXiv:2110.07354, 2021 | 7 | 2021 |
Music playlist title generation using artist information H Kim, S Doh, J Lee, J Nam arXiv preprint arXiv:2301.08145, 2023 | 5 | 2023 |
Video-foley: Two-stage video-to-sound generation via temporal event condition for foley sound J Lee, J Im, D Kim, J Nam arXiv preprint arXiv:2408.11915, 2024 | 4 | 2024 |
Challenge on Sound Scene Synthesis: Evaluating Text-to-Audio Generation J Lee, M Tailleur, LM Heller, K Choi, M Lagrange, B McFee, K Imoto, ... Audio Imagination: NeurIPS 2024 Workshop AI-Driven Speech, Music, and Sound …, 2024 | 1 | 2024 |
CONMOD: Controllable Neural Frame-based Modulation Effects G Lee, H Kim, J Lee, J Nam arXiv preprint arXiv:2406.13935, 2024 | 1 | 2024 |
Foley sound synthesis in waveform domain with diffusion model Y Chung, J Lee, J Nam Tech. Rep, 2023 | 1 | 2023 |
Sound Scene Synthesis at the DCASE 2024 Challenge M Lagrange, J Lee, M Tailleur, LM Heller, K Choi, B McFee, K Imoto, ... arXiv preprint arXiv:2501.08587, 2025 | | 2025 |
DCASE 2024 Challenge Task 7 Development Dataset: Environmental Sound Scene Synthesis K Choi, LM Heller, K Imoto, M Lagrange, J Lee, B McFee, Y Okamoto, ... Zenodo, 2024 | | 2024 |