Natural tts synthesis by conditioning wavenet on mel spectrogram predictions J Shen, R Pang, RJ Weiss, M Schuster, N Jaitly, Z Yang, Z Chen, Y Zhang, ... 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 2385 | 2018 |
Tacotron: Towards End-to-End Speech Synthesis Y Wang, RJ Skerry-Ryan, D Stanton, Y Wu, RJ Weiss, N Jaitly, Z Yang, ... arXiv preprint arXiv:1703.10135, 2017 | 1889* | 2017 |
Tacotron: Towards end-to-end speech synthesis Y Wang, RJ Skerry-Ryan, D Stanton, Y Wu, RJ Weiss, N Jaitly, Z Yang, ... arXiv preprint arXiv:1703.10135, 2017 | 1651 | 2017 |
Style tokens: Unsupervised style modeling, control and transfer in end-to-end speech synthesis Y Wang, D Stanton, Y Zhang, RJS Ryan, E Battenberg, J Shor, Y Xiao, ... International Conference on Machine Learning, 5180-5189, 2018 | 701 | 2018 |
Towards end-to-end prosody transfer for expressive speech synthesis with tacotron RJ Skerry-Ryan, E Battenberg, Y Xiao, Y Wang, D Stanton, J Shor, ... international conference on machine learning, 4693-4702, 2018 | 530 | 2018 |
Tacotron: A fully end-to-end text-to-speech synthesis model Y Wang, RJ Skerry-Ryan, D Stanton, Y Wu, RJ Weiss, N Jaitly, Z Yang, ... arXiv preprint arXiv:1703.10135 164, 2017 | 247 | 2017 |
Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning Y Zhang, RJ Weiss, H Zen, Y Wu, Z Chen, RJ Skerry-Ryan, Y Jia, ... arXiv preprint arXiv:1907.04448, 2019 | 130 | 2019 |
Semi-supervised training for improving data efficiency in end-to-end speech synthesis YA Chung, Y Wang, WN Hsu, Y Zhang, RJ Skerry-Ryan ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 128 | 2019 |
Predicting Expressive Speaking Style from Text in End-To-End Speech Synthesis D Stanton, Y Wang, RJ Skerry-Ryan 2018 IEEE Spoken Language Technology Workshop (SLT), 595-602, 2018 | 114 | 2018 |
Location-relative attention mechanisms for robust long-form speech synthesis E Battenberg, RJ Skerry-Ryan, S Mariooryad, D Stanton, D Kao, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 109 | 2020 |
Wave-Tacotron: Spectrogram-free end-to-end text-to-speech synthesis RJ Weiss, RJ Skerry-Ryan, E Battenberg, S Mariooryad, DP Kingma ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 84 | 2021 |
Uncovering Latent Style Factors for Expressive Speech Synthesis Y Wang, RJ Skerry-Ryan, Y Xiao, D Stanton, J Shor, E Battenberg, ... arXiv preprint arXiv:1711.00520, 2017 | 75 | 2017 |
Effective Use of Variational Embedding Capacity in Expressive End-to-End Speech Synthesis E Battenberg, S Mariooryad, D Stanton, RJ Skerry-Ryan, M Shannon, ... arXiv preprint arXiv:1906.03402, 2019 | 47 | 2019 |
Semi-Supervised Generative Modeling for Controllable Speech Synthesis R Habib, S Mariooryad, M Shannon, E Battenberg, RJ Skerry-Ryan, ... arXiv preprint arXiv:1910.01709, 2019 | 45 | 2019 |
Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling I Elias, H Zen, J Shen, Y Zhang, Y Jia, RJ Skerry-Ryan, Y Wu arXiv preprint arXiv:2103.14574, 2021 | 43 | 2021 |
Organic indoor location discovery S Teller, J Battat, B Charrow, D Curtis, R Ryan, J Ledlie, J Hicks Computer Science and Artificial Intelligence Laboratory Technical Report 75, 16, 2008 | 29 | 2008 |
Synthesizing speech from text using neural networks Y Wu, J Shen, R Pang, RJ Weiss, M Schuster, N Jaitly, Z Yang, Z Chen, ... US Patent 10,971,170, 2021 | 23 | 2021 |
Identifying entities using search results TA Lasko, A Tomkins, M Angelo, MK Gray, R Ryan, NU Godbole, ... US Patent 8,856,099, 2014 | 21 | 2014 |
Anatomy of a subway hack R Ryan, Z Anderson, A Chiesa 16th DEFCON Hacking Conference (DEFCON 2008), 2008 | 19 | 2008 |
Complex evolution recurrent neural networks I Shafran, TE Bagby, RJW Skerry-Ryan US Patent 10,529,320, 2020 | 15 | 2020 |