Deep convolutional neural networks for large-scale speech tasks TN Sainath, B Kingsbury, G Saon, H Soltau, A Mohamed, G Dahl, ... Neural networks 64, 39-48, 2015 | 2023 | 2015 |
Low-rank matrix factorization for deep neural network training with high-dimensional output targets TN Sainath, B Kingsbury, V Sindhwani, E Arisoy, B Ramabhadran 2013 IEEE international conference on acoustics, speech and signal …, 2013 | 784 | 2013 |
Boosted MMI for model and feature-space discriminative training D Povey, D Kanevsky, B Kingsbury, B Ramabhadran, G Saon, ... 2008 IEEE International Conference on Acoustics, Speech and Signal …, 2008 | 498 | 2008 |
English conversational telephone speech recognition by humans and machines G Saon, G Kurata, T Sercu, K Audhkhasi, S Thomas, D Dimitriadis, X Cui, ... arXiv preprint arXiv:1703.02136, 2017 | 477 | 2017 |
Deep belief networks using discriminative features for phone recognition A Mohamed, TN Sainath, G Dahl, B Ramabhadran, GE Hinton, ... 2011 IEEE international conference on acoustics, speech and signal …, 2011 | 406 | 2011 |
Improvements to deep convolutional neural networks for LVCSR TN Sainath, B Kingsbury, A Mohamed, GE Dahl, G Saon, H Soltau, ... 2013 IEEE workshop on automatic speech recognition and understanding, 315-320, 2013 | 311 | 2013 |
Deep neural network language models E Arisoy, TN Sainath, B Kingsbury, B Ramabhadran Proceedings of the NAACL-HLT 2012 Workshop: Will We Ever Really Replace the …, 2012 | 308 | 2012 |
Efficient Knowledge Distillation from an Ensemble of Teachers. T Fukuda, M Suzuki, G Kurata, S Thomas, J Cui, B Ramabhadran Interspeech, 3697-3701, 2017 | 267 | 2017 |
Vocabulary independent spoken term detection J Mamou, B Ramabhadran, O Siohan Proceedings of the 30th annual international ACM SIGIR conference on …, 2007 | 254 | 2007 |
Auto-encoder bottleneck features using deep belief networks TN Sainath, B Kingsbury, B Ramabhadran 2012 IEEE international conference on acoustics, speech and signal …, 2012 | 245 | 2012 |
Making deep belief networks effective for large vocabulary continuous speech recognition TN Sainath, B Kingsbury, B Ramabhadran, P Fousek, P Novak, ... 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 30-35, 2011 | 245 | 2011 |
Method and apparatus for a communication device for use by a hearing impaired/mute or deaf person or in silent environments PT Brunet, AP Ittycheriah, C Narayanaswami, MA Picheny, ... US Patent 5,995,590, 1999 | 236 | 1999 |
Google usm: Scaling automatic speech recognition beyond 100 languages Y Zhang, W Han, J Qin, Y Wang, A Bapna, Z Chen, N Chen, B Li, ... arXiv preprint arXiv:2303.01037, 2023 | 229 | 2023 |
Learning filter banks within a deep neural network framework TN Sainath, B Kingsbury, A Mohamed, B Ramabhadran 2013 IEEE workshop on automatic speech recognition and understanding, 297-302, 2013 | 207 | 2013 |
Conversion of non-back-off language models for efficient speech decoding E Arisoy, B Ramabhadran, A Sethy, S Chen US Patent 9,484,023, 2016 | 204 | 2016 |
Deep belief nets for natural language call-routing R Sarikaya, GE Hinton, B Ramabhadran 2011 IEEE International conference on acoustics, speech and signal …, 2011 | 194 | 2011 |
Learning to speak fluently in a foreign language: Multilingual speech synthesis and cross-language voice cloning Y Zhang, RJ Weiss, H Zen, Y Wu, Z Chen, RJ Skerry-Ryan, Y Jia, ... arXiv preprint arXiv:1907.04448, 2019 | 191 | 2019 |
Large-scale multilingual speech recognition with a streaming end-to-end model A Kannan, A Datta, TN Sainath, E Weinstein, B Ramabhadran, Y Wu, ... arXiv preprint arXiv:1909.05330, 2019 | 189 | 2019 |
Bigssl: Exploring the frontier of large-scale semi-supervised learning for automatic speech recognition Y Zhang, DS Park, W Han, J Qin, A Gulati, J Shor, A Jansen, Y Xu, ... IEEE Journal of Selected Topics in Signal Processing 16 (6), 1519-1532, 2022 | 175 | 2022 |
Enhanced likelihood computation using regression in a speech recognition system PV De Souza, Y Gao, M Picheny, B Ramabhadran US Patent 6,493,667, 2002 | 175 | 2002 |