Waveglow: A flow-based generative network for speech synthesis R Prenger, R Valle, B Catanzaro ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 1286 | 2019 |
Flowtron: an autoregressive flow-based generative network for text-to-speech synthesis R Valle, K Shih, R Prenger, B Catanzaro International Conference on Learning Representations 2021, 2020 | 177 | 2020 |
Mellotron: Multispeaker expressive voice synthesis by conditioning on rhythm, pitch and global style tokens R Valle, J Li, R Prenger, B Catanzaro ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 171 | 2020 |
Missing data imputation for supervised learning J Poulos, R Valle Applied Artificial Intelligence 32 (2), 186-196, 2018 | 90 | 2018 |
One TTS alignment to rule them all R Badlani, A Łańcucki, KJ Shih, R Valle, W Ping, B Catanzaro ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 86 | 2022 |
Character-based handwritten text transcription with attention networks J Poulos, R Valle Neural Computing and Applications 33 (16), 10563-10573, 2021 | 52 | 2021 |
RAD-TTS: Parallel flow-based TTS with robust alignment learning and diverse synthesis KJ Shih, R Valle, R Badlani, A Lancucki, W Ping, B Catanzaro ICML Workshop on Invertible Neural Networks, Normalizing Flows, and Explicit …, 2021 | 52 | 2021 |
Audio flamingo: A novel audio language model with few-shot learning and dialogue abilities Z Kong, A Goel, R Badlani, W Ping, R Valle, B Catanzaro arXiv preprint arXiv:2402.01831, 2024 | 48 | 2024 |
Machine improvisation with formal specifications A Donzé, R Valle, I Akkaya, S Libkind, SA Seshia, D Wessel Ann Arbor, MI: Michigan Publishing, University of Michigan Library, 2014 | 40 | 2014 |
Space: Speech-driven portrait animation with controllable expression S Gururani, A Mallya, TC Wang, R Valle, MY Liu Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 37 | 2023 |
Attacking speaker recognition with deep generative models W Cai, A Doshi, R Valle arXiv preprint arXiv:1801.02384, 2018 | 26 | 2018 |
Control improvisation with probabilistic temporal specifications I Akkaya, DJ Fremont, R Valle, A Donzé, EA Lee, SA Seshia 2016 IEEE First International Conference on Internet-of-Things Design and …, 2016 | 25 | 2016 |
P-Flow: a fast and data-efficient zero-shot TTS through speech prompting S Kim, K Shih, JF Santos, E Bakhturina, M Desta, R Valle, S Yoon, ... Advances in Neural Information Processing Systems 36, 2024 | 23 | 2024 |
TequilaGAN: How to easily identify GAN samples R Valle, W Cai, A Doshi arXiv preprint arXiv:1807.04919, 2018 | 14 | 2018 |
Hands-On Generative Adversarial Networks with Keras: Your guide to implementing next-generation generative adversarial networks R Valle Packt Publishing Ltd, 2019 | 13 | 2019 |
ABROA: Audio Based Room Occupancy Analysis using Gaussian Mixtures and Hidden Markov Models R Valle Future Technologies Conference (FTC), 2016, 2016 | 13 | 2016 |
Any-to-Any Voice Conversion with F0 and Timbre Disentanglement and Novel Timbre Conditioning S Kovela, R Valle, A Dantrey, B Catanzaro ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 11 | 2023 |
RAD-MMM: Multilingual multiaccented multispeaker text to speech R Badlani, R Valle, KJ Shih, JF Santos, S Gururani, B Catanzaro Proc. Interspeech 2023, 626-630, 2023 | 10 | 2023 |
Neural ODEs for Image Segmentation with Level Sets R Valle, F Reda, M Shoeybi, P Legresley, A Tao, B Catanzaro arXiv preprint arXiv:1912.11683, 2019 | 10 | 2019 |
Specification mining for machine improvisation with formal specifications R Valle, A Donzé, DJ Fremont, I Akkaya, SA Seshia, A Freed, D Wessel Computers in Entertainment (CIE) 14 (3), 1-20, 2016 | 8 | 2016 |