FastSpeech 2: Fast and High-Quality End-to-End Text-to-Speech Y Ren, C Hu, X Tan, T Qin, S Zhao, Z Zhao, TY Liu ICLR 2021, 2020 | 1505 | 2020 |
FastSpeech: Fast, Robust and Controllable Text to Speech Y Ren, Y Ruan, X Tan, T Qin, S Zhao, Z Zhao, TY Liu NeurIPS 2019, 2019 | 1213 | 2019 |
Pseudo Numerical Methods for Diffusion Models on Manifolds L Liu, Y Ren, Z Lin, Z Zhao ICLR 2022, 2021 | 511 | 2021 |
Diffsinger: Diffusion acoustic model for singing voice synthesis J Liu, C Li, Y Ren, F Chen, P Liu, Z Zhao AAAI 2022, 2021 | 288* | 2021 |
Make-an-audio: Text-to-audio generation with prompt-enhanced diffusion models R Huang, J Huang, D Yang, Y Ren, L Liu, M Li, Z Ye, J Liu, X Yin, Z Zhao ICML 2023, 2023 | 270 | 2023 |
Multilingual Neural Machine Translation with Knowledge Distillation X Tan, Y Ren, D He, T Qin, Z Zhao, TY Liu ICLR 2019, 2019 | 268 | 2019 |
Prodiff: Progressive fast diffusion model for high-quality text-to-speech R Huang, Z Zhao, H Liu, J Liu, C Cui, Y Ren ACM MM 2022, 2022 | 164 | 2022 |
FastDiff: A Fast Conditional Diffusion Model for High-Quality Speech Synthesis R Huang, MWY Lam, J Wang, D Su, D Yu, Y Ren, Z Zhao IJCAI 2022, 2022 | 163 | 2022 |
Audiogpt: Understanding and generating speech, music, sound, and talking head R Huang, M Li, D Yang, J Shi, X Chang, Z Ye, Y Wu, Z Hong, J Huang, ... Proceedings of the AAAI Conference on Artificial Intelligence 38 (21), 23802 …, 2024 | 153 | 2024 |
PopMAG: Pop Music Accompaniment Generation Y Ren, J He, X Tan, T Qin, Z Zhao, TY Liu ACMMM 2020, 2020 | 132 | 2020 |
Almost Unsupervised Text to Speech and Automatic Speech Recognition Y Ren, X Tan, T Qin, S Zhao, Z Zhao, TY Liu ICML 2019, 2019 | 128 | 2019 |
MultiSpeech: Multi-Speaker Text to Speech with Transformer M Chen, X Tan, Y Ren, J Xu, H Sun, S Zhao, T Qin INTERSPEECH 2020, 2020 | 115 | 2020 |
GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis Z Ye, Z Jiang, Y Ren, J Liu, JZ He, Z Zhao ICLR 2023, 2023 | 100 | 2023 |
Multi-Singer: Fast Multi-Singer Singing Voice Vocoder With A Large-Scale Corpus R Huang, F Chen, Y Ren, J Liu, C Cui, Z Zhao ACMMM 2021, 3945-3954, 2021 | 99 | 2021 |
LRSpeech: Extremely low-resource speech synthesis and recognition J Xu, X Tan, Y Ren, T Qin, J Li, S Zhao, TY Liu KDD 2020, 2020 | 99 | 2020 |
GenerSpeech: Towards Style Transfer for Generalizable Out-Of-Domain Text-to-Speech Synthesis R Huang, Y Ren, J Liu, C Cui, Z Zhao NeurIPS 2022, 2022 | 84* | 2022 |
PortaSpeech: Portable and High-Quality Generative Text-to-Speech Y Ren, J Liu, Z Zhao NeurIPS 2021, 2021 | 84 | 2021 |
Deepsinger: Singing voice synthesis with data mined from the web Y Ren, X Tan, T Qin, J Luan, Z Zhao, TY Liu KDD 2020, 2020 | 80 | 2020 |
SimulSpeech: End-to-End Simultaneous Speech to Text Translation Y Ren, J Liu, X Tan, C Zhang, QIN Tao, Z Zhao, TY Liu ACL 2020, 2020 | 78 | 2020 |
M4Singer: a Multi-Style, Multi-Singer and Musical Score Provided Mandarin Singing Corpus L Zhang, R Li, S Wang, L Deng, J Liu, Y Ren, J He, R Huang, J Zhu, ... NeurIPS 2022, Datasets and Benchmarks Track, 2022 | 72 | 2022 |