A Wavenet for speech denoising D Rethage, J Pons, X Serra International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018 | 550 | 2018 |
Fsd50k: an open dataset of human-labeled sound events E Fonseca, X Favory, J Pons, F Font, X Serra IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 829-852, 2021 | 493 | 2021 |
Freesound Datasets: a platform for the creation of open audio datasets E Fonseca, J Pons, X Favory, F Font, D Bogdanov, A Ferraro, S Oramas, ... International Society for Music Information Retrieval Conference (ISMIR), 2017 | 279 | 2017 |
End-to-end learning for music audio tagging at scale J Pons, O Nieto, M Prockup, E Schmidt, A Ehmann, X Serra International Society for Music Information Retrieval Conference (ISMIR), 2018 | 256 | 2018 |
General-purpose tagging of freesound audio with audioset labels: Task description, dataset, and baseline E Fonseca, M Plakal, F Font, DPW Ellis, X Favory, J Pons, X Serra DCASE Workshop, 2018 | 205 | 2018 |
Experimenting with musically motivated convolutional neural networks J Pons, T Lidy, X Serra International Workshop on Content-Based Multimedia Indexing (CBMI), 1-6, 2016 | 202 | 2016 |
Timbre analysis of music audio signals with convolutional neural networks J Pons, O Slizovskaia, E Gómez Gutiérrez, X Serra European Signal Processing Conference (EUSIPCO), 2813-7, 2017 | 167 | 2017 |
Randomly weighted CNNs for (music) audio classification J Pons, X Serra International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2019 | 124 | 2019 |
MusiCNN: pre-trained convolutional neural networks for music audio tagging J Pons, X Serra Late breaking/demo session of the International Society for Music …, 2019 | 119 | 2019 |
Universal speech enhancement with score-based diffusion J Serrà, S Pascual, J Pons, RO Araz, D Scaini arXiv preprint arXiv:2206.03065, 2022 | 95 | 2022 |
End-to-end music source separation: Is it possible in the waveform domain? F Lluís, J Pons, X Serra arXiv preprint arXiv:1810.12187, 2018 | 94 | 2018 |
Designing efficient architectures for modeling temporal features with convolutional neural networks J Pons, X Serra International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2017 | 85 | 2017 |
Training neural audio classifiers with few data J Pons, J Serrà, X Serra International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2019 | 77 | 2019 |
Upsampling artifacts in neural audio synthesis J Pons, S Pascual, G Cengarle, J Serrà ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 76 | 2021 |
Remixing music using source separation algorithms to improve the musical experience of cochlear implant users J Pons, J Janer, T Rode, W Nogueira The Journal of the Acoustical Society of America 140 (6), 4338-4349, 2016 | 73 | 2016 |
Automatic multitrack mixing with a differentiable mixing console of neural audio effects CJ Steinmetz, J Pons, S Pascual, J Serra ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 61 | 2021 |
An empirical study of Conv-TasNet B Kadioglu, M Horgan, X Liu, J Pons, D Darcy, V Kumar International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2020 | 60* | 2020 |
Fast timing-conditioned latent audio diffusion Z Evans, CJ Carr, J Taylor, SH Hawley, J Pons arXiv preprint arXiv:2402.04825, 2024 | 58 | 2024 |
SESQA: semi-supervised learning for speech quality assessment J Serrà, J Pons, S Pascual ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 54 | 2021 |
TensorFlow Audio Models in Essentia P Alonso-Jiménez, D Bogdanov, J Pons, X Serra International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2020 | 44 | 2020 |