Improving voice separation by incorporating end-to-end speech recognition N Takahashi, MK Singh, S Basak, P Sudarsanam, S Ganapathy, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 22 | 2020 |
Hierarchical diffusion models for singing voice neural vocoder N Takahashi, M Kumar, Y Mitsufuji ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 13 | 2023 |
Hierarchical disentangled representation learning for singing voice conversion N Takahashi, MK Singh, Y Mitsufuji 2021 International Joint Conference on Neural Networks (IJCNN), 1-7, 2021 | 10 | 2021 |
Nonparallel Emotional Voice Conversion For Unseen Speaker-Emotion Pairs Using Dual Domain Adversarial Network & Virtual Domain Pairing N Shah, MK Singh, N Takahashi, N Onoe ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 6 | 2023 |
Source Mixing and Separation Robust Audio Steganography N Takahashi, MK Singh, Y Mitsufuji arXiv preprint arXiv:2110.05054, 2021 | 3 | 2021 |
Robust one-shot singing voice conversion N Takahashi, MK Singh, Y Mitsufuji arXiv preprint arXiv:2210.11096, 2022 | 2 | 2022 |
NENET: An edge learnable network for link prediction in scene text MK Singh, S Banerjee, S Chaudhuri arXiv preprint arXiv:2005.12147, 2020 | 1 | 2020 |
Cross-modal Face-and Voice-style Transfer N Takahashi, MK Singh, Y Mitsufuji arXiv preprint arXiv:2302.13838, 2023 | | 2023 |
Iteratively Improving Speech Recognition and Voice Conversion MK Singh, N Takahashi, N Onoe INTERSPEECH 2023, https://arxiv.org/pdf/2305.15055.pdf, 0 | | |