FastSpeech 2: Fast and High-Quality End-to-End Text-to-Speech Y Ren, C Hu, X Tan, T Qin, S Zhao, Z Zhao, TY Liu ICLR 2021, 2020 | 1433 | 2020 |
FastSpeech: Fast, Robust and Controllable Text to Speech Y Ren, Y Ruan, X Tan, T Qin, S Zhao, Z Zhao, TY Liu NeurIPS 2019, 2019 | 1178 | 2019 |
Pseudo Numerical Methods for Diffusion Models on Manifolds L Liu, Y Ren, Z Lin, Z Zhao ICLR 2022, 2021 | 454 | 2021 |
Diffsinger: Diffusion acoustic model for singing voice synthesis J Liu, C Li, Y Ren, F Chen, P Liu, Z Zhao AAAI 2022, 2021 | 272* | 2021 |
Multilingual Neural Machine Translation with Knowledge Distillation X Tan, Y Ren, D He, T Qin, Z Zhao, TY Liu ICLR 2019, 2019 | 266 | 2019 |
Make-an-audio: Text-to-audio generation with prompt-enhanced diffusion models R Huang, J Huang, D Yang, Y Ren, L Liu, M Li, Z Ye, J Liu, X Yin, Z Zhao ICML 2023, 2023 | 225 | 2023 |
Prodiff: Progressive fast diffusion model for high-quality text-to-speech R Huang, Z Zhao, H Liu, J Liu, C Cui, Y Ren ACM MM 2022, 2022 | 156 | 2022 |
FastDiff: A Fast Conditional Diffusion Model for High-Quality Speech Synthesis R Huang, MWY Lam, J Wang, D Su, D Yu, Y Ren, Z Zhao IJCAI 2022, 2022 | 154 | 2022 |
Audiogpt: Understanding and generating speech, music, sound, and talking head R Huang, M Li, D Yang, J Shi, X Chang, Z Ye, Y Wu, Z Hong, J Huang, ... Proceedings of the AAAI Conference on Artificial Intelligence 38 (21), 23802 …, 2024 | 139 | 2024 |
PopMAG: Pop Music Accompaniment Generation Y Ren, J He, X Tan, T Qin, Z Zhao, TY Liu ACMMM 2020, 2020 | 124 | 2020 |
Almost Unsupervised Text to Speech and Automatic Speech Recognition Y Ren, X Tan, T Qin, S Zhao, Z Zhao, TY Liu ICML 2019, 2019 | 124 | 2019 |
MultiSpeech: Multi-Speaker Text to Speech with Transformer M Chen, X Tan, Y Ren, J Xu, H Sun, S Zhao, T Qin INTERSPEECH 2020, 2020 | 110 | 2020 |
LRSpeech: Extremely low-resource speech synthesis and recognition J Xu, X Tan, Y Ren, T Qin, J Li, S Zhao, TY Liu KDD 2020, 2020 | 96 | 2020 |
Multi-Singer: Fast Multi-Singer Singing Voice Vocoder With A Large-Scale Corpus R Huang, F Chen, Y Ren, J Liu, C Cui, Z Zhao ACMMM 2021, 3945-3954, 2021 | 92 | 2021 |
GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis Z Ye, Z Jiang, Y Ren, J Liu, JZ He, Z Zhao ICLR 2023, 2023 | 91 | 2023 |
PortaSpeech: Portable and High-Quality Generative Text-to-Speech Y Ren, J Liu, Z Zhao NeurIPS 2021, 2021 | 79 | 2021 |
GenerSpeech: Towards Style Transfer for Generalizable Out-Of-Domain Text-to-Speech Synthesis R Huang, Y Ren, J Liu, C Cui, Z Zhao NeurIPS 2022, 2022 | 78* | 2022 |
Deepsinger: Singing voice synthesis with data mined from the web Y Ren, X Tan, T Qin, J Luan, Z Zhao, TY Liu KDD 2020, 2020 | 73 | 2020 |
SimulSpeech: End-to-End Simultaneous Speech to Text Translation Y Ren, J Liu, X Tan, C Zhang, QIN Tao, Z Zhao, TY Liu ACL 2020, 2020 | 73 | 2020 |
A Study of Non-autoregressive Model for Sequence Generation Y Ren, J Liu, X Tan, S Zhao, Z Zhao, TY Liu ACL 2020, 2020 | 71 | 2020 |