Self-training for end-to-end speech recognition J Kahn, A Lee, A Hannun ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 191 | 2020 |
Voxpopuli: A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation C Wang, M Rivière, A Lee, A Wu, C Talnikar, D Haziza, M Williamson, ... arXiv preprint arXiv:2101.00390, 2021 | 180 | 2021 |
Robust wav2vec 2.0: Analyzing domain shift in self-supervised pre-training WN Hsu, A Sriram, A Baevski, T Likhomanenko, Q Xu, V Pratap, J Kahn, ... arXiv preprint arXiv:2104.01027, 2021 | 130 | 2021 |
Sequence-to-sequence speech recognition with time-depth separable convolutions A Hannun, A Lee, Q Xu, R Collobert arXiv preprint arXiv:1904.02619, 2019 | 85 | 2019 |
Direct speech-to-speech translation with discrete units A Lee, PJ Chen, C Wang, J Gu, X Ma, A Polyak, Y Adi, Q He, Y Tang, ... arXiv preprint arXiv:2107.05604, 2021 | 63 | 2021 |
A comparison-based approach to mispronunciation detection A Lee, J Glass 2012 IEEE Spoken Language Technology Workshop (SLT), 382-387, 2012 | 63 | 2012 |
Mispronunciation detection via dynamic time warping on deep belief network-based posteriorgrams A Lee, Y Zhang, J Glass 2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013 | 56 | 2013 |
Text-free prosody-aware generative spoken language modeling E Kharitonov, A Lee, A Polyak, Y Adi, J Copet, K Lakhotia, TA Nguyen, ... arXiv preprint arXiv:2109.03264, 2021 | 46 | 2021 |
Textless speech-to-speech translation on real data A Lee, H Gong, PA Duquenne, H Schwenk, PJ Chen, C Wang, S Popuri, ... arXiv preprint arXiv:2112.08352, 2021 | 39 | 2021 |
Semi-supervised speech recognition via local prior matching WN Hsu, A Lee, G Synnaeve, A Hannun arXiv preprint arXiv:2002.10336, 2020 | 37 | 2020 |
Exploiting depth and highway connections in convolutional recurrent deep neural networks for speech recognition WN Hsu, Y Zhang, A Lee, J Glass cell 50 (1), 2016 | 30 | 2016 |
Pronunciation assessment via a comparison-based system A Lee, J Glass Speech and Language Technology in Education, 2013 | 28 | 2013 |
Mispronunciation detection without nonnative training data A Lee, J Glass Sixteenth Annual Conference of the International Speech Communication …, 2015 | 27 | 2015 |
Automatic highlights extraction for drama video using music emotion and human face features KS Lin, A Lee, YH Yang, CT Lee, HH Chen Neurocomputing 119, 111-117, 2013 | 25 | 2013 |
Personalized mispronunciation detection and diagnosis based on unsupervised error pattern discovery A Lee, NF Chen, J Glass 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016 | 24 | 2016 |
Discriminative Reranking for Neural Machine Translation A Lee, M Auli, MA Ranzato Proceedings of the 59th Annual Meeting of the Association for Computational …, 2021 | 22 | 2021 |
Facebook AI's WMT20 News Translation Task Submission PJ Chen, A Lee, C Wang, N Goyal, A Fan, M Williamson, J Gu arXiv preprint arXiv:2011.08298, 2020 | 21 | 2020 |
Language-independent methods for computer-assisted pronunciation training A Lee Massachusetts Institute of Technology, 2016 | 21 | 2016 |
Improving Musical Concept Detection by Ordinal Regression and Context Fusion. YH Yang, YC Lin, A Lee, HH Chen ISMIR, 147-152, 2009 | 21 | 2009 |
Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation S Popuri, PJ Chen, C Wang, J Pino, Y Adi, J Gu, WN Hsu, A Lee arXiv preprint arXiv:2204.02967, 2022 | 18 | 2022 |