フォロー
Daisuke Niizumi
Daisuke Niizumi
NTT Communication Science Laboratories
確認したメール アドレス: hco.ntt.co.jp
タイトル
引用先
引用先
Byol for audio: Self-supervised learning for general-purpose audio representation
D Niizumi, D Takeuchi, Y Ohishi, N Harada, K Kashino
2021 International Joint Conference on Neural Networks (IJCNN), 1-8, 2021
1912021
ToyADMOS2: Another dataset of miniature-machine operating sounds for anomalous sound detection under domain shift conditions
N Harada, D Niizumi, D Takeuchi, Y Ohishi, M Yasuda, S Saito
arXiv preprint arXiv:2106.02369, 2021
1812021
Description and discussion on DCASE 2022 challenge task 2: Unsupervised anomalous sound detection for machine condition monitoring applying domain generalization techniques
K Dohi, K Imoto, N Harada, D Niizumi, Y Koizumi, T Nishida, H Purohit, ...
arXiv preprint arXiv:2206.05876, 2022
1192022
Description and discussion on DCASE 2021 challenge task 2: Unsupervised anomalous sound detection for machine condition monitoring under domain shifted conditions
Y Kawaguchi, K Imoto, Y Koizumi, N Harada, D Niizumi, K Dohi, ...
arXiv preprint arXiv:2106.04492, 2021
962021
First-shot anomaly sound detection for machine condition monitoring: A domain generalization baseline
N Harada, D Niizumi, Y Ohishi, D Takeuchi, M Yasuda
2023 31st European Signal Processing Conference (EUSIPCO), 191-195, 2023
662023
Description and discussion on DCASE 2023 challenge task 2: First-shot unsupervised anomalous sound detection for machine condition monitoring
K Dohi, K Imoto, N Harada, D Niizumi, Y Koizumi, T Nishida, H Purohit, ...
arXiv preprint arXiv:2305.07828, 2023
642023
Masked spectrogram modeling using masked autoencoders for learning general-purpose audio representation
D Niizumi, D Takeuchi, Y Ohishi, N Harada, K Kashino
HEAR: Holistic Evaluation of Audio Representations, 1-24, 2022
602022
BYOL for audio: Exploring pre-trained general-purpose audio representations
D Niizumi, D Takeuchi, Y Ohishi, N Harada, K Kashino
IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 137-151, 2022
522022
Audio captioning using pre-trained large-scale language model guided by audio-based similar caption retrieval
Y Koizumi, Y Ohishi, D Niizumi, D Takeuchi, M Yasuda
arXiv preprint arXiv:2012.07331, 2020
472020
Description and Discussion on DCASE 2021 Challenge Task 2: Unsupervised Anomalous Detection for Machine Condition Monitoring Under Domain Shifted Conditions.
Y Kawaguchi, K Imoto, Y Koizumi, N Harada, D Niizumi, K Dohi, ...
DCASE, 186-190, 2021
442021
Masked modeling duo: Learning representations by encouraging both networks to model the input
D Niizumi, D Takeuchi, Y Ohishi, N Harada, K Kashino
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
292023
Description and Discussion on DCASE 2024 Challenge Task 2: First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring
T Nishida, N Harada, D Niizumi, D Albertini, R Sannino, S Pradolini, ...
arXiv preprint arXiv:2406.07250, 2024
282024
Acoustic scene classification: A competition review
S Gharib, H Derrar, D Niizumi, T Senttula, J Tommola, T Heittola, ...
2018 IEEE 28th International Workshop on Machine Learning for Signal …, 2018
252018
Conceptbeam: Concept driven target speech extraction
Y Ohishi, M Delcroix, T Ochiai, S Araki, D Takeuchi, D Niizumi, A Kimura, ...
Proceedings of the 30th ACM International Conference on Multimedia, 4252-4260, 2022
182022
Audio difference captioning utilizing similarity-discrepancy disentanglement
D Takeuchi, Y Ohishi, D Niizumi, N Harada, K Kashino
arXiv preprint arXiv:2308.11923, 2023
72023
Masked Modeling Duo: Towards a Universal Audio Pre-Training Framework
D Niizumi, D Takeuchi, Y Ohishi, N Harada, K Kashino
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024
62024
Composing general audio representation by fusing multilayer features of a pre-trained model
D Niizumi, D Takeuchi, Y Ohishi, N Harada, K Kashino
2022 30th European Signal Processing Conference (EUSIPCO), 200-204, 2022
62022
Masked modeling duo for speech: Specializing general-purpose audio representation to speech using denoising distillation
D Niizumi, D Takeuchi, Y Ohishi, N Harada, K Kashino
arXiv preprint arXiv:2305.14079, 2023
42023
Introducing auxiliary text query-modifier to content-based audio retrieval
D Takeuchi, Y Ohishi, D Niizumi, N Harada, K Kashino
arXiv preprint arXiv:2207.09732, 2022
42022
M2D-CLAP: Masked Modeling Duo Meets CLAP for Learning General-purpose Audio-Language Representation
D Niizumi, D Takeuchi, Y Ohishi, N Harada, M Yasuda, S Tsubaki, K Imoto
Interspeech, 57-61, 2024
22024
現在システムで処理を実行できません。しばらくしてからもう一度お試しください。
論文 1–20