Follow
Billy li (Juncheng)
Title
Cited by
Cited by
Year
Very deep convolutional neural networks for raw waveforms
W Dai, C Dai, S Qu, J Li, S Das
2017 IEEE international conference on acoustics, speech and signal …, 2017
5122017
Learning joint embedding with multimodal cues for cross-modal video-text retrieval
NC Mithun, J Li, F Metze, AK Roy-Chowdhury
Proceedings of the 2018 ACM on international conference on multimedia …, 2018
2952018
Masked autoencoders that listen
PY Huang, H Xu, J Li, A Baevski, M Auli, W Galuba, F Metze, ...
Advances in Neural Information Processing Systems 35, 28708-28720, 2022
2232022
A comparison of five multiple instance learning pooling functions for sound event detection with weak labeling
Y Wang, J Li, F Metze
IEEE International Conference on Acoustics, Speech and Signal Processing …, 2019
2132019
Adversarial camera stickers: A physical camera-based attack on deep learning systems
J Li, FR Schmidt, JZ Kolter
Proceedings of the 36th International Conference on Machine Learning, 2019
2002019
A comparison of deep learning methods for environmental sound detection
J Li, W Dai, F Metze, S Qu, S Das
2017 IEEE International conference on acoustics, speech and signal …, 2017
1922017
Universal phone recognition with a multilingual allophone system
X Li, S Dalmia, J Li, M Lee, P Littell, J Yao, A Anastasopoulos, ...
ICASSP 2020, 2020
1362020
Adversarial music: Real world audio adversary against wake-word detection system
J Li, S Qu, X Li, J Szurley, JZ Kolter, F Metze
Advances in Neural Information Processing Systems 32, 2019
862019
Real-time fine grained occupancy estimation using depth sensors on arm embedded platforms
S Munir, RS Arora, C Hesling, J Li, J Francis, C Shelton, C Martin, A Rowe, ...
2017 IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS …, 2017
592017
Joint embeddings with multimodal cues for video-text retrieval
NC Mithun, J Li, F Metze, AK Roy-Chowdhury
International Journal of Multimedia Information Retrieval 8, 3-18, 2019
342019
Towards Zero-shot Learning for Automatic Phonemic Transcription
X Li, S Dalmia, DR Mortensen, J Li, AW Black, F Metze
AAAI 2020, 2020
332020
Multiple Instance Deep Learning for Weakly Supervised Small-Footprint Audio Event Detection
SY Tseng, J Li, Y Wang, J Szurley, F Metze, S Das
InterSpeech 2018, 2017
232017
Eventness: Object detection on spectrograms for temporal localization of audio events
P Pham, J Li, J Szurley, S Das
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
222018
Understanding audio pattern using convolutional neural network from raw waveforms
S Qu, J Li, W Dai, S Das
arXiv preprint arXiv:1611.09524, 2016
212016
Comparing the max and noisy-or pooling functions in multiple instance learning for weakly supervised sequence learning tasks
Y Wang, J Li, F Metze
InterSpeech 2018, 2018
162018
AudioTagging Done Right: 2nd comparison of deep learning methods for environmental sound classification
JB Li, S Qu, PB Huang, F Metze
InterSpeech 2022, 2022
122022
Sound event detection for real life audio DCASE challenge
JL Dai Wei, P Pham, S Das, S Qu, F Metze
Proc. Workshop Detection and Classification of Acoustic Scenes and Events, 2016
122016
Audio-visual event recognition through the lens of adversary
JB Li, K Ma, S Qu, PY Huang, F Metze
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
102021
Audio-journey: Efficient visual+ llm-aided audio encodec diffusion
JB Li, JS Michaels, L Yao, L Yu, Z Wood-Doughty, F Metze
Workshop on Efficient Systems for Foundation Models@ ICML2023, 2023
9*2023
On adversarial robustness of large-scale audio visual learning
JB Li, S Qu, X Li, PYB Huang, F Metze
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
82022
The system can't perform the operation now. Try again later.
Articles 1–20