Theo dõi
Xin Wang
Tiêu đề
Trích dẫn bởi
Trích dẫn bởi
Năm
Asvspoof 2019: Future horizons in spoofed and fake audio detection
M Todisco, X Wang, V Vestman, M Sahidullah, H Delgado, A Nautsch, ...
Proc. Interspeech, 1008-1012, 2019
5962019
Investigating RNN-based speech enhancement methods for noise-robust Text-to-Speech.
C Valentini-Botinhao, X Wang, S Takaki, J Yamagishi
SSW, 146-152, 2016
3862016
ASVspoof 2019: a large-scale public database of synthetized, converted and replayed speech
X Wang, J Yamagishi, M Todisco, H Delgado, A Nautsch, N Evans, ...
Computer Speech & Language, 101114, 2020
3102020
ASVspoof 2021: accelerating progress in spoofed and deepfake speech detection
J Yamagishi, X Wang, M Todisco, M Sahidullah, J Patino, A Nautsch, ...
Proc. 2021 Edition of the Automatic Speaker Verification and Spoofing …, 2021
2422021
Zero-shot multi-speaker text-to-speech with state-of-the-art neural speaker embeddings
E Cooper, CI Lai, Y Yasuda, F Fang, X Wang, N Chen, J Yamagishi
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
1812020
Neural source-filter-based waveform model for statistical parametric speech synthesis
X Wang, S Takaki, J Yamagishi
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
1572019
A comparative study on recent neural spoofing countermeasures for synthetic speech detection
X Wang, J Yamagishi
Proc. Interspeech, 4259--4263, 2021
1492021
Neural source-filter waveform models for statistical parametric speech synthesis
X Wang, S Takaki, J Yamagishi
IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 402-415, 2019
1442019
Speaker anonymization using x-vector and neural waveform models
F Fang, X Wang, J Yamagishi, I Echizen, M Todisco, N Evans, ...
Proc. SSW, 155-160, 2019
1362019
ASVspoof 2019: spoofing countermeasures for the detection of synthesized, converted and replayed speech
A Nautsch, X Wang, N Evans, TH Kinnunen, V Vestman, M Todisco, ...
IEEE Transactions on Biometrics, Behavior, and Identity Science 3 (2), 252-265, 2021
1302021
Introducing the VoicePrivacy initiative
N Tomashenko, BML Srivastava, X Wang, E Vincent, A Nautsch, ...
Proc. Interspeech, 1693--1697, 2020
1182020
Speech Enhancement for a Noise-Robust Text-to-Speech Synthesis System Using Deep Recurrent Neural Networks.
C Valentini-Botinhao, X Wang, S Takaki, J Yamagishi
Interspeech, 352-356, 2016
1172016
Investigation of enhanced Tacotron text-to-speech synthesis systems with self-attention for pitch accent language
Y Yasuda, X Wang, S Takaki, J Yamagishi
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
1042019
Tandem assessment of spoofing countermeasures and automatic speaker verification: Fundamentals
T Kinnunen, H Delgado, N Evans, KA Lee, V Vestman, A Nautsch, ...
IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 2195-2210, 2020
902020
The VoicePrivacy 2020 Challenge: Results and findings
N Tomashenko, X Wang, E Vincent, J Patino, BML Srivastava, PG Noé, ...
Computer Speech & Language 74, 101362, 2022
862022
Can we steal your vocal identity from the Internet?: Initial investigation of cloning Obama's voice using GAN, WaveNet and low-quality found data
J Lorenzo-Trueba, F Fang, X Wang, I Echizen, J Yamagishi, T Kinnunen
Proc. Speaker Odyssey, 240-247, 2018
852018
Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation
H Tak, M Todisco, X Wang, J Jung, J Yamagishi, N Evans
Proc. Odyssey, 2022
842022
ASVspoof 2021: Towards spoofed and deepfake speech detection in the wild
X Liu, X Wang, M Sahidullah, J Patino, H Delgado, T Kinnunen, ...
IEEE/ACM Transaction on Audio, Speech, and Language Processing (accepted), 2022
802022
Investigating self-supervised front ends for speech spoofing countermeasures
X Wang, J Yamagishi
Proc. Odyssey, 100-106, 2022
782022
A comparison of recent waveform generation and acoustic modeling methods for neural-network-based speech synthesis
X Wang, J Lorenzo-Trueba, S Takaki, L Juvela, J Yamagishi
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
772018
Hệ thống không thể thực hiện thao tác ngay bây giờ. Hãy thử lại sau.
Bài viết 1–20