Libri-light: A benchmark for asr with limited or no supervision J Kahn, M Riviere, W Zheng, E Kharitonov, Q Xu, PE Mazaré, J Karadayi, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 684 | 2020 |
AudioLM: a language modeling approach to audio generation Z Borsos, R Marinier, D Vincent, E Kharitonov, O Pietquin, M Sharifi, ... IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023 | 523 | 2023 |
On generative spoken language modeling from raw audio K Lakhotia, E Kharitonov, WN Hsu, Y Adi, A Polyak, B Bolte, TA Nguyen, ... Transactions of the Association for Computational Linguistics 9, 1336-1354, 2021 | 331 | 2021 |
Speech resynthesis from discrete disentangled self-supervised representations A Polyak, Y Adi, J Copet, E Kharitonov, K Lakhotia, WN Hsu, A Mohamed, ... arXiv preprint arXiv:2104.00355, 2021 | 301 | 2021 |
Speak, Read and Prompt: High-fidelity Text-to-Speech with Minimal Supervision E Kharitonov, D Vincent, Z Borsos, R Marinier, S Girgin, O Pietquin, ... arXiv preprint arXiv:2302.03540, 2023 | 167 | 2023 |
Audiopalm: A large language model that can speak and listen PK Rubenstein, C Asawaroengchai, DD Nguyen, A Bapna, Z Borsos, ... arXiv preprint arXiv:2306.12925, 2023 | 154 | 2023 |
Compositionality and generalization in emergent languages R Chaabouni, E Kharitonov, D Bouchacourt, E Dupoux, M Baroni arXiv preprint arXiv:2004.09124, 2020 | 149 | 2020 |
Data augmenting contrastive learning of speech representations in the time domain E Kharitonov, M Rivière, G Synnaeve, L Wolf, PE Mazaré, M Douze, ... 2021 IEEE Spoken Language Technology Workshop (SLT), 215-222, 2021 | 131 | 2021 |
Anti-efficient encoding in emergent communication R Chaabouni, E Kharitonov, E Dupoux, M Baroni Advances in Neural Information Processing Systems 32, 2019 | 124 | 2019 |
Text-free prosody-aware generative spoken language modeling E Kharitonov, A Lee, A Polyak, Y Adi, J Copet, K Lakhotia, TA Nguyen, ... arXiv preprint arXiv:2109.03264, 2021 | 118 | 2021 |
The zero resource speech benchmark 2021: Metrics and baselines for unsupervised spoken language modeling TA Nguyen, M de Seyssel, P Rozé, M Rivière, E Kharitonov, A Baevski, ... arXiv preprint arXiv:2011.11588, 2020 | 106 | 2020 |
Generative spoken dialogue language modeling TA Nguyen, E Kharitonov, J Copet, Y Adi, WN Hsu, A Elkahky, ... Transactions of the Association for Computational Linguistics 11, 250-266, 2023 | 96 | 2023 |
Soundstorm: Efficient parallel audio generation Z Borsos, M Sharifi, D Vincent, E Kharitonov, N Zeghidour, M Tagliasacchi arXiv preprint arXiv:2305.09636, 2023 | 91 | 2023 |
EGG: a toolkit for research on Emergence of lanGuage in Games E Kharitonov, R Chaabouni, D Bouchacourt, M Baroni arXiv preprint arXiv:1907.00852, 2019 | 82 | 2019 |
Textless speech emotion conversion using discrete and decomposed representations F Kreuk, A Polyak, J Copet, E Kharitonov, TA Nguyen, M Rivière, WN Hsu, ... arXiv preprint arXiv:2111.07402, 2021 | 66 | 2021 |
Communicating artificial neural networks develop efficient color-naming systems R Chaabouni, E Kharitonov, E Dupoux, M Baroni Proceedings of the National Academy of Sciences 118 (12), e2016569118, 2021 | 64 | 2021 |
The zero resource speech challenge 2021: Spoken language modelling E Dunbar, M Bernard, N Hamilakis, TA Nguyen, M De Seyssel, P Rozé, ... arXiv preprint arXiv:2104.14700, 2021 | 51 | 2021 |
Federated online learning to rank with evolution strategies E Kharitonov Proceedings of the Twelfth ACM International Conference on Web Search and …, 2019 | 42 | 2019 |
Entropy minimization in emergent languages E Kharitonov, R Chaabouni, D Bouchacourt, M Baroni arXiv preprint arXiv:1905.13687, 2019 | 38* | 2019 |
Word-order biases in deep-agent emergent communication R Chaabouni, E Kharitonov, A Lazaric, E Dupoux, M Baroni arXiv preprint arXiv:1905.12330, 2019 | 38 | 2019 |