Mayank Kumar Singh

Cited by

	All	Since 2019
Citations	57	57
h-index	4	4
i10-index	3	3

202020212022202320245 13 11 17 11

Co-authors

Naoya TakahashiSONYVerified email at sony.com
Yuki MitsufujiDistinguished Engineer, Sony; Specially Appointed Associate Professor, Tokyo Institute of TechnologyVerified email at sony.com
Ganapathy SriramGoogle Research India; Associate Professor, Electrical Engineering, Indian Institute of Science.Verified email at iisc.ac.in
Parthasaarathy SudarsanamTampere UniversityVerified email at tuni.fi
Sakya BasakMicrosoftVerified email at microsoft.com
Nirmesh ShahSony Research IndiaVerified email at sony.com
Subhasis ChaudhuriIndian Institute of Technology BombayVerified email at ee.iitb.ac.in

Mayank Kumar Singh

Research Engineer at Sony Research India

Verified email at sony.com - Homepage

General Artificial Intelligence Machine Learning Audio Source Separation Graph Convolution


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Improving voice separation by incorporating end-to-end speech recognition N Takahashi, MK Singh, S Basak, P Sudarsanam, S Ganapathy, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	22	2020
Hierarchical diffusion models for singing voice neural vocoder N Takahashi, M Kumar, Y Mitsufuji ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	13	2023
Hierarchical disentangled representation learning for singing voice conversion N Takahashi, MK Singh, Y Mitsufuji 2021 International Joint Conference on Neural Networks (IJCNN), 1-7, 2021	10	2021
Nonparallel Emotional Voice Conversion For Unseen Speaker-Emotion Pairs Using Dual Domain Adversarial Network & Virtual Domain Pairing N Shah, MK Singh, N Takahashi, N Onoe ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and …, 2023	6	2023
Source Mixing and Separation Robust Audio Steganography N Takahashi, MK Singh, Y Mitsufuji arXiv preprint arXiv:2110.05054, 2021	3	2021
Robust one-shot singing voice conversion N Takahashi, MK Singh, Y Mitsufuji arXiv preprint arXiv:2210.11096, 2022	2	2022
NENET: An edge learnable network for link prediction in scene text MK Singh, S Banerjee, S Chaudhuri arXiv preprint arXiv:2005.12147, 2020	1	2020
Cross-modal Face-and Voice-style Transfer N Takahashi, MK Singh, Y Mitsufuji arXiv preprint arXiv:2302.13838, 2023		2023
Iteratively Improving Speech Recognition and Voice Conversion MK Singh, N Takahashi, N Onoe INTERSPEECH 2023, https://arxiv.org/pdf/2305.15055.pdf, 0

The system can't perform the operation now. Try again later.

Articles 1–9

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors