Daisuke Niizumi

Cited by

	All	Since 2019
Citations	715	709
h-index	13	13
i10-index	13	13

340

170

255

20182019202020212022202320243 4 6 81 177 326 115

Public access

View all

1 article

0 articles

available

not available

Based on funding mandates

Daisuke Niizumi

NTT Communication Science Laboratories

Verified email at hco.ntt.co.jp

Self-supervised learning Representation learning General-purpose audio representation


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Byol for audio: Self-supervised learning for general-purpose audio representation D Niizumi, D Takeuchi, Y Ohishi, N Harada, K Kashino 2021 International Joint Conference on Neural Networks (IJCNN), 1-8, 2021	138	2021
ToyADMOS2: Another dataset of miniature-machine operating sounds for anomalous sound detection under domain shift conditions N Harada, D Niizumi, D Takeuchi, Y Ohishi, M Yasuda, S Saito arXiv preprint arXiv:2106.02369, 2021	130	2021
Description and discussion on DCASE 2022 challenge task 2: Unsupervised anomalous sound detection for machine condition monitoring applying domain generalization techniques K Dohi, K Imoto, N Harada, D Niizumi, Y Koizumi, T Nishida, H Purohit, ... arXiv preprint arXiv:2206.05876, 2022	89	2022
Description and discussion on DCASE 2021 challenge task 2: Unsupervised anomalous sound detection for machine condition monitoring under domain shifted conditions Y Kawaguchi, K Imoto, Y Koizumi, N Harada, D Niizumi, K Dohi, ... arXiv preprint arXiv:2106.04492, 2021	81	2021
Masked spectrogram modeling using masked autoencoders for learning general-purpose audio representation D Niizumi, D Takeuchi, Y Ohishi, N Harada, K Kashino HEAR: Holistic Evaluation of Audio Representations, 1-24, 2022	47	2022
Audio captioning using pre-trained large-scale language model guided by audio-based similar caption retrieval Y Koizumi, Y Ohishi, D Niizumi, D Takeuchi, M Yasuda arXiv preprint arXiv:2012.07331, 2020	38	2020
Description and discussion on DCASE 2023 challenge task 2: First-shot unsupervised anomalous sound detection for machine condition monitoring K Dohi, K Imoto, N Harada, D Niizumi, Y Koizumi, T Nishida, H Purohit, ... arXiv preprint arXiv:2305.07828, 2023	33	2023
Description and Discussion on DCASE 2021 Challenge Task 2: Unsupervised Anomalous Detection for Machine Condition Monitoring Under Domain Shifted Conditions. Y Kawaguchi, K Imoto, Y Koizumi, N Harada, D Niizumi, K Dohi, ... DCASE, 186-190, 2021	32	2021
First-shot anomaly sound detection for machine condition monitoring: A domain generalization baseline N Harada, D Niizumi, Y Ohishi, D Takeuchi, M Yasuda 2023 31st European Signal Processing Conference (EUSIPCO), 191-195, 2023	31	2023
BYOL for audio: Exploring pre-trained general-purpose audio representations D Niizumi, D Takeuchi, Y Ohishi, N Harada, K Kashino IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 137-151, 2022	30	2022
Acoustic scene classification: A competition review S Gharib, H Derrar, D Niizumi, T Senttula, J Tommola, T Heittola, ... 2018 IEEE 28th International Workshop on Machine Learning for Signal …, 2018	24	2018
Masked modeling duo: Learning representations by encouraging both networks to model the input D Niizumi, D Takeuchi, Y Ohishi, N Harada, K Kashino ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	16	2023
Conceptbeam: Concept driven target speech extraction Y Ohishi, M Delcroix, T Ochiai, S Araki, D Takeuchi, D Niizumi, A Kimura, ... Proceedings of the 30th ACM International Conference on Multimedia, 4252-4260, 2022	14	2022
Composing general audio representation by fusing multilayer features of a pre-trained model D Niizumi, D Takeuchi, Y Ohishi, N Harada, K Kashino 2022 30th European Signal Processing Conference (EUSIPCO), 200-204, 2022	6	2022
Audio difference captioning utilizing similarity-discrepancy disentanglement D Takeuchi, Y Ohishi, D Niizumi, N Harada, K Kashino arXiv preprint arXiv:2308.11923, 2023	2	2023
Masked Modeling Duo for Speech: Specializing General-Purpose Audio Representation to Speech using Denoising Distillation D Niizumi, D Takeuchi, Y Ohishi, N Harada, K Kashino arXiv preprint arXiv:2305.14079, 2023	2	2023
Introducing auxiliary text query-modifier to content-based audio retrieval D Takeuchi, Y Ohishi, D Niizumi, N Harada, K Kashino arXiv preprint arXiv:2207.09732, 2022	2	2022
Masked Modeling Duo: Towards a Universal Audio Pre-training Framework D Niizumi, D Takeuchi, Y Ohishi, N Harada, K Kashino IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024		2024
Heating cooker D Niizumi US Patent App. 18/523,070, 2024		2024
Refining Knowledge Transfer on Audio-Image Temporal Agreement for Audio-Text Cross Retrieval S Tsubaki, D Niizumi, D Takeuchi, Y Ohishi, N Harada, K Imoto arXiv preprint arXiv:2403.10756, 2024		2024

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by