フォロー
You Zhang
タイトル
引用先
引用先
One-class learning towards synthetic voice spoofing detection
Y Zhang, F Jiang, Z Duan
IEEE Signal Processing Letters 28, 937-941, 2021
2432021
Speech driven talking face generation from a single image and an emotion condition
SE Eskimez, Y Zhang, Z Duan
IEEE Transactions on Multimedia 24, 3480-3490, 2021
882021
UR Channel-Robust Synthetic Speech Detection System for ASVspoof 2021
X Chen*, Y Zhang*, G Zhu*, Z Duan
Proc. 2021 Edition of the Automatic Speaker Verification and Spoofing …, 2021
542021
An Empirical Study on Channel Effects for Synthetic Voice Spoofing Countermeasure Systems
Y Zhang, G Zhu, F Jiang, Z Duan
Proc. Interspeech, 4309--4313, 2021
332021
A probabilistic fusion framework for spoofing aware speaker verification
Y Zhang, G Zhu, Z Duan
Proc. The Speaker and Language Recognition Workshop (Odyssey), 77-84, 2022
25*2022
SAMO: Speaker attractor multi-center one-class learning for voice anti-spoofing
S Ding, Y Zhang, Z Duan
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
242023
Singfake: Singing voice deepfake detection
Y Zang*, Y Zhang*, M Heydari, Z Duan
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
192024
Global HRTF personalization using anthropometric measures
Y Wang, Y Zhang, Z Duan, M Bocko
Proc. Audio Engineering Society (AES) 150th Convention, 2021
162021
Rethinking audio-visual synchronization for active speaker detection
A Wuerkaixi, Y Zhang, Z Duan, C Zhang
2022 IEEE 32nd International Workshop on Machine Learning for Signal …, 2022
112022
DyViSE: Dynamic Vision-guided Speaker Embedding for Audio-Visual Speaker Diarization
A Wuerkaixi, K Yan, Y Zhang, Z Duan, C Zhang
2022 IEEE 24th International Workshop on Multimedia Signal Processing (MMSP …, 2022
102022
HRTF field: Unifying measured HRTF magnitude representation with neural fields
Y Zhang, Y Wang, Z Duan
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
92023
CtrSVDD: A Benchmark Dataset and Baseline Analysis for Controlled Singing Voice Deepfake Detection
Y Zang, J Shi, Y Zhang, R Yamamoto, J Han, Y Tang, S Xu, W Zhao, ...
Proc. Interspeech, 4783--4787, 2024
62024
Predicting global head-related transfer functions from scanned head geometry using deep learning and compact representations
Y Wang, Y Zhang, Z Duan, M Bocko
arXiv preprint arXiv:2207.14352, 2022
62022
SVDD Challenge 2024: A Singing Voice Deepfake Detection Challenge Evaluation Plan
Y Zhang, Y Zang, J Shi, R Yamamoto, J Han, Y Tang, T Toda, Z Duan
arXiv preprint arXiv:2405.05244, 2024
42024
SVDD 2024: The Inaugural Singing Voice Deepfake Detection Challenge
Y Zhang, Y Zang, J Shi, R Yamamoto, T Toda, Z Duan
arXiv preprint arXiv:2408.16132, 2024
32024
Learning Arousal-Valence Representation from Categorical Emotion Labels of Speech
E Zhou, Y Zhang, Z Duan
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
22024
Phase perturbation improves channel robustness for speech spoofing countermeasures
Y Zang, Y Zhang, Z Duan
Proc. Interspeech, 3162--3166, 2023
22023
Generalizing Voice Presentation Attack Detection to Unseen Synthetic Attacks and Channel Variation
Y Zhang, F Jiang, G Zhu, X Chen, Z Duan
Handbook of Biometric Anti-Spoofing: Presentation Attack Detection and …, 2023
22023
A Multi-Stream Fusion Approach with One-Class Learning for Audio-Visual Deepfake Detection
K Lee, Y Zhang, Z Duan
2024 IEEE 26th International Workshop on Multimedia Signal Processing (MMSP …, 2024
12024
Emotional Dimension Control in Language Model-Based Text-to-Speech: Spanning a Broad Spectrum of Human Emotions
K Zhou, Y Zhang, S Zhao, H Wang, Z Pan, D Ng, C Zhang, C Ni, Y Ma, ...
arXiv preprint arXiv:2409.16681, 2024
12024
現在システムで処理を実行できません。しばらくしてからもう一度お試しください。
論文 1–20