Thilo von Neumann

Cited by

	All	Since 2019
Citations	376	376
h-index	9	9
i10-index	9	9

120

20192020202120222023202418 30 76 77 113 61

Public access

View all

5 articles

1 article

available

not available

Based on funding mandates

Co-authors

Reinhold Haeb-UmbachProfessor of Communications Engineering, University of PaderbornVerified email at nt.uni-paderborn.de
Marc DelcroixNTT Communication Science LaboratoriesVerified email at ieee.org
Keisuke KinoshitaResearch Scientist at GoogleVerified email at ieee.org
Tomohiro NakataniNTT Communication Science LaboratoriesVerified email at ieee.org
Lukas DrudeApplied Scientist @ Amazon AlexaVerified email at amazon.com
Shoko ArakiNTT Communication Science LaboratoriesVerified email at ieee.org

Thilo von Neumann

PhD student, Paderborn University

Verified email at nt.upb.de

Blind source separation deep neural networks


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
All-neural online source separation, counting, and diarization for meeting analysis T Von Neumann, K Kinoshita, M Delcroix, S Araki, T Nakatani, ... ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	107	2019
Multi-talker ASR for an unknown number of sources: Joint training of source counting, separation and ASR T von Neumann, C Boeddeker, L Drude, K Kinoshita, M Delcroix, ... arXiv preprint arXiv:2006.02786, 2020	46	2020
End-to-end training of time domain audio separation and recognition T von Neumann, K Kinoshita, L Drude, C Boeddeker, M Delcroix, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	40	2020
Deep attractor networks for speaker re-identification and blind source separation L Drude, T von Neumann, R Haeb-Umbach 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018	36	2018
Monaural source separation: From anechoic to reverberant environments T Cord-Landwehr, C Boeddeker, T Von Neumann, C Zorilă, R Doddipatla, ... 2022 international workshop on acoustic signal enhancement (IWAENC), 1-5, 2022	28	2022
Graph-PIT: Generalized permutation invariant training for continuous separation of arbitrary numbers of speakers T von Neumann, K Kinoshita, C Boeddeker, M Delcroix, R Haeb-Umbach arXiv preprint arXiv:2107.14446, 2021	26	2021
On word error rate definitions and their efficient computation for multi-speaker speech recognition systems T von Neumann, C Boeddeker, K Kinoshita, M Delcroix, R Haeb-Umbach ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	19	2023
SA-SDR: A novel loss function for separation of meeting style data T von Neumann, K Kinoshita, C Boeddeker, M Delcroix, R Haeb-Umbach ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	17	2022
Multi-path RNN for hierarchical modeling of long sequential data and its application to speaker stream separation K Kinoshita, T von Neumann, M Delcroix, T Nakatani, R Haeb-Umbach arXiv preprint arXiv:2006.13579, 2020	10	2020
MMS-MSG: A multi-purpose multi-speaker mixture signal generator T Cord-Landwehr, T Von Neumann, C Boeddeker, R Haeb-Umbach 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), 1-5, 2022	9	2022
An initialization scheme for meeting separation with spatial mixture models C Boeddeker, T Cord-Landwehr, T von Neumann, R Haeb-Umbach arXiv preprint arXiv:2204.01338, 2022	8	2022
Speeding up permutation invariant training for source separation T von Neumann, C Boeddeker, K Kinoshita, M Delcroix, R Haeb-Umbach Speech Communication; 14th ITG Conference, 1-5, 2021	7	2021
A meeting transcription system for an ad-hoc acoustic sensor network T Gburrek, C Boeddeker, T von Neumann, T Cord-Landwehr, ... arXiv preprint arXiv:2205.00944, 2022	6	2022
Segment-less continuous speech separation of meetings: Training and evaluation criteria T von Neumann, K Kinoshita, C Boeddeker, M Delcroix, R Haeb-Umbach IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 576-589, 2022	5	2022
Utterance-by-utterance overlap-aware neural diarization with Graph-PIT K Kinoshita, T von Neumann, M Delcroix, C Boeddeker, R Haeb-Umbach arXiv preprint arXiv:2207.13888, 2022	4	2022
Meeting recognition with continuous speech separation and transcription-supported diarization T von Neumann, C Boeddeker, T Cord-Landwehr, M Delcroix, ... arXiv preprint arXiv:2309.16482, 2023	3	2023
Meeteval: A toolkit for computation of word error rates for meeting transcription systems T von Neumann, C Boeddeker, M Delcroix, R Haeb-Umbach arXiv preprint arXiv:2307.11394, 2023	3	2023
Multi-stage diarization refinement for the CHiME-7 DASR scenario CB Boeddeker, T Cord-Landwehr, T Neumann, R Haeb-Umbach Proc. CHiME 2023, 51-56, 2023	2	2023
Mixture Encoder Supporting Continuous Speech Separation for Meeting Recognition P Vieting, S Berger, T von Neumann, C Boeddeker, R Schlüter, ... arXiv preprint arXiv:2309.08454, 2023		2023
Estimation device, learning device, estimation method, learning method, and recording medium K Kinoshita, M Delcroix, T Nakatani, S Araki, L Drude, TC Von Neumann US Patent 11,456,003, 2022		2022

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors