Voicefilter: Targeted voice separation by speaker-conditioned spectrogram masking Q Wang, H Muckenhirn, K Wilson, P Sridhar, Z Wu, J Hershey, ... arXiv preprint arXiv:1810.04826, 2018 | 389 | 2018 |
Lingvo: a modular and scalable framework for sequence-to-sequence modeling J Shen, P Nguyen, Y Wu, Z Chen, MX Chen, Y Jia, A Kannan, T Sainath, ... arXiv preprint arXiv:1902.08295, 2019 | 199 | 2019 |
Speech recognition with augmented synthesized speech A Rosenberg, Y Zhang, B Ramabhadran, Y Jia, P Moreno, Y Wu, Z Wu 2019 IEEE automatic speech recognition and understanding workshop (ASRU …, 2019 | 118 | 2019 |
Semi-supervised training for end-to-end models via weak distillation B Li, TN Sainath, R Pang, Z Wu ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 51 | 2019 |
Improving performance of end-to-end ASR on numeric sequences C Peyser, H Zhang, TN Sainath, Z Wu arXiv preprint arXiv:1907.01372, 2019 | 33 | 2019 |
NAM+: Towards Scalable End-to-End Contextual Biasing for Adaptive ASR T Munkhdalai, Z Wu, G Pundak, KC Sim, J Li, P Rondon, TN Sainath 2022 IEEE Spoken Language Technology Workshop (SLT), 190-196, 2023 | 15 | 2023 |
Unsupervised context learning for speech recognition AH Michaely, M Ghodsi, Z Wu, J Scheiner, P Aleksic 2016 IEEE Spoken Language Technology Workshop (SLT), 447-453, 2016 | 14 | 2016 |
Multistate encoding with end-to-end speech RNN transducer network Z Wu, B Li, Y Zhang, PS Aleksic, TN Sainath ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 9 | 2020 |
A Deliberation-based Joint Acoustic and Text Decoder S Mavandadi, TN Sainath, K Hu, Z Wu | 8 | 2021 |
SLM: Bridge the thin gap between speech and text foundation models M Wang, W Han, I Shafran, Z Wu, CC Chiu, Y Cao, Y Wang, N Chen, ... arXiv preprint arXiv:2310.00230, 2023 | 7 | 2023 |
Dual-Mode NAM: Effective Top-K Context Injection for End-to-End ASR Z Wu, T Munkhdalai, P Rondon, G Pundak, KC Sim, C Li | 5* | |
Streaming Intended Query Detection using E2E Modeling for Continued Conversation S Chang, G Prakash, Z Wu, Q Liang, TN Sainath, B Li, A Stambler, ... arXiv preprint arXiv:2208.13322, 2022 | 2 | 2022 |
Contextual Spelling Correction with Large Language Models G Song, Z Wu, G Pundak, A Chandorkar, K Joshi, X Velez, D Caseiro, ... 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023 | 1 | 2023 |
Contextual Biasing with the Knuth-Morris-Pratt Matching Algorithm W Wang, Z Wu, D Caseiro, T Munkhdalai, KC Sim, P Rondon, G Pundak, ... arXiv preprint arXiv:2310.00178, 2023 | 1 | 2023 |