Eric Battenberg

Cited by

	All	Since 2019
Citations	9865	8001
h-index	23	19
i10-index	34	25

1800

900

450

1350

201220132014201520162017201820192020202120222023202426 30 36 57 271 483 864 1141 1331 1799 1745 1642 336

Co-authors

Colin RaffelUniversity of Toronto, Vector Institute and Hugging FaceVerified email at cs.toronto.edu
Daisy StantonGoogleVerified email at google.com
RJ Skerry-RyanGoogle, Inc.Verified email at alum.mit.edu
Brian McFeeMusic and Performing Arts Professions / Center for Data Science, New York UniversityVerified email at nyu.edu
Soroosh MariooryadGoogleVerified email at google.com
Oriol NietoSenior Research Engineer at AdobeVerified email at adobe.com
Daniel P W EllisResearch Scientist, GoogleVerified email at google.com
Sander DielemanResearch Scientist, DeepMindVerified email at google.com
Jan SchlüterJohannes Kepler University Linz (JKU)Verified email at jku.at
Søren Kaae SønderbyVerified email at twig.energy
Joel ShorGoogle ResearchVerified email at google.com
Yuxuan WangByteDanceVerified email at cse.ohio-state.edu
Rif A. SaurousGoogleVerified email at google.com
Ying Xiao (肖盈)Twitter Cortex Applied ResearchVerified email at twitter.com
Adam CoatesApple. Previously Khosla Ventures, Baidu SVAIL, Stanford PhDVerified email at cs.stanford.edu
Zhenyao ZhuGoogleVerified email at google.com
Sanjeev SatheeshStanford UniversityVerified email at stanford.edu
Anuroop SriramMeta FAIRVerified email at alumni.cmu.edu
Rewon ChildFounding Team / Technical Staff at InflectionVerified email at inflection.ai
HaiRong LiuBaidu ResearchVerified email at spansion.com

Eric Battenberg

Google Research

Verified email at google.com - Homepage

Generative Modeling Machine Perception Deep Learning Machine Learning Parallel Computing


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Deep speech 2: End-to-end speech recognition in english and mandarin D Amodei, S Ananthanarayanan, R Anubhai, J Bai, E Battenberg, C Case, ... International conference on machine learning, 173-182, 2016	3597	2016
librosa: Audio and music signal analysis in python. B McFee, C Raffel, D Liang, DPW Ellis, M McVicar, E Battenberg, O Nieto SciPy, 18-24, 2015	2791	2015
Style tokens: Unsupervised style modeling, control and transfer in end-to-end speech synthesis Y Wang, D Stanton, Y Zhang, RJS Ryan, E Battenberg, J Shor, Y Xiao, ... International conference on machine learning, 5180-5189, 2018	880	2018
Towards end-to-end prosody transfer for expressive speech synthesis with tacotron RJ Skerry-Ryan, E Battenberg, Y Xiao, Y Wang, D Stanton, J Shor, ... international conference on machine learning, 4693-4702, 2018	636	2018
Lasagne: first release S Dieleman, J Schlüter, C Raffel, E Olson, SK Sønderby, D Nouri, ... Zenodo: Geneva, Switzerland 3, 74, 2015	471*	2015
Exploring neural transducers for end-to-end speech recognition E Battenberg, J Chen, R Child, A Coates, YGY Li, H Liu, S Satheesh, ... 2017 IEEE automatic speech recognition and understanding workshop (ASRU …, 2017	260*	2017
Location-relative attention mechanisms for robust long-form speech synthesis E Battenberg, RJ Skerry-Ryan, S Mariooryad, D Stanton, D Kao, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	122	2020
Wave-tacotron: Spectrogram-free end-to-end text-to-speech synthesis RJ Weiss, RJ Skerry-Ryan, E Battenberg, S Mariooryad, DP Kingma ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	110	2021
Uncovering latent style factors for expressive speech synthesis Y Wang, RJ Skerry-Ryan, Y Xiao, D Stanton, J Shor, E Battenberg, ... arXiv preprint arXiv:1711.00520, 2017	87	2017
librosa/librosa: 0.8. 0 B McFee, V Lostanlen, A Metsai, M McVicar, S Balke, C Thomé, C Raffel, ... Version 0.8. 0, Zenodo, doi 10, 2020	82	2020
librosa: Audio and music signal analysis in python MF Brian, R Colin, L Dawen, PWE Daniel, M McVicar, E Battenberg Proceedings of the 14th python in science conference 8, 18-25, 2015	80	2015
viktorandreevichmorozov, K B McFee, A Metsai, M McVicar, S Balke, C Thomé, C Raffel, F Zalkow, ... Moore, R. Bittner, S. Hidaka, Z. Wei, nullmightybofo, D. Herenú, F.-R …, 2020	58	2020
Semi-supervised generative modeling for controllable speech synthesis R Habib, S Mariooryad, M Shannon, E Battenberg, RJ Skerry-Ryan, ... arXiv preprint arXiv:1910.01709, 2019	57	2019
Effective use of variational embedding capacity in expressive end-to-end speech synthesis E Battenberg, S Mariooryad, D Stanton, RJ Skerry-Ryan, M Shannon, ... arXiv preprint arXiv:1906.03402, 2019	56	2019
Accelerating Non-Negative Matrix Factorization for Audio Source Separation on Multi-Core and Many-Core Architectures. E Battenberg, D Wessel ISMIR, 501-506, 2009	56	2009
librosa 0.5. 0 B McFee, M McVicar, O Nieto, S Balke, C Thome, D Liang, E Battenberg, ... Zenodo. URL: https://doi. org/10 5281, 2017	55	2017
Implementing real-time partitioned convolution algorithms on conventional operating systems E Battenberg, R Avizienis Proceedings of the 14th International Conference on Digital Audio Effects …, 2011	50	2011
librosa: 0.4. 1 B McFee, M McVicar, C Raffel, D Liang, O Nieto, E Battenberg, J Moore, ... Zenodo, 2015	48	2015
Analyzing Drum Patterns Using Conditional Deep Belief Networks. E Battenberg, D Wessel ISMIR, 37-42, 2012	46	2012
librosa/librosa: 0.7. 2 B McFee, V Lostanlen, M McVicar, A Metsai, S Balke, C Thomé, C Raffel, ... Zenodo, Jan 13, 2020	38	2020

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors