Frank Qiaochu Zhang

Cited by

	All	Since 2019
Citations	795	795
h-index	14	14
i10-index	16	16

260

130

195

2019202020212022202320244 52 217 208 246 68

Co-authors

Yongqiang WangResearch Scientist, GoogleVerified email at google.com
Chunxi LiuTwo SigmaVerified email at twosigma.com
Yatharth SarafFacebook AIVerified email at fb.com
Xiaohui ZhangFacebookVerified email at fb.com
Mike SeltzerFacebookVerified email at fb.com
Ching-Feng YehResearch Scientist, FAIRVerified email at fb.com
Yangyang ShiMetaVerified email at fb.com
Duc LeSenior Staff Research Scientist, Meta AIVerified email at meta.com
Andros TjandraFacebook AI (research scientist)Verified email at fb.com
Alex XiaoFacebookVerified email at fb.com
Abdelrahman MohamedResearch scientist, Facebook AI ResearchVerified email at fb.com
Kritika SinghFacebookVerified email at fb.com
Gabriel SynnaeveResearch scientist at Facebook AI ResearchVerified email at fb.com
Jay MahadeokarFacebook AIVerified email at fb.com
Fuchun PengResearch ManagerVerified email at fb.com
Jun LiuMeta Inc; Xtal Inc; Department of Physics & Astronomy, Rice UniversityVerified email at fb.com
Da-Rong LiuNational Taiwan UniversityVerified email at ntu.edu.tw

Frank Qiaochu Zhang

Facebook

Verified email at fb.com

Large Language Model Speech Recognition Inference Acceleration


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Transformer-based acoustic modeling for hybrid speech recognition Y Wang, A Mohamed, D Le, C Liu, A Xiao, J Mahadeokar, H Huang, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	242	2020
Emformer: Efficient memory transformer based acoustic model for low latency streaming speech recognition Y Shi, Y Wang, C Wu, CF Yeh, J Chan, F Zhang, D Le, M Seltzer ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	152	2021
Streaming transformer-based acoustic models using self-attention with augmented memory C Wu, Y Wang, Y Shi, CF Yeh, F Zhang arXiv preprint arXiv:2005.08042, 2020	67	2020
Deja-vu: Double feature presentation and iterated loss in deep transformer networks A Tjandra, C Liu, F Zhang, X Zhang, Y Wang, G Synnaeve, S Nakamura, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	43	2020
Improved language identification through cross-lingual self-supervised learning A Tjandra, DG Choudhury, F Zhang, K Singh, A Conneau, A Baevski, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	41	2022
Improving RNN transducer based ASR with auxiliary tasks C Liu, F Zhang, D Le, S Kim, Y Saraf, G Zweig 2021 IEEE Spoken Language Technology Workshop (SLT), 172-179, 2021	40	2021
Faster, Simpler and More Accurate Hybrid ASR Systems Using Wordpieces F Zhang, Y Wang, X Zhang, C Liu, Y Saraf, G Zweig Interspeech 2020, 2020	29	2020
Weak-attention suppression for transformer based speech recognition Y Shi, Y Wang, C Wu, C Fuegen, F Zhang, D Le, CF Yeh, ML Seltzer arXiv preprint arXiv:2005.09137, 2020	27	2020
Multilingual graphemic hybrid ASR with massive data augmentation C Liu, Q Zhang, X Zhang, K Singh, Y Saraf, G Zweig arXiv preprint arXiv:1909.06522, 2019	26	2019
Contextualizing ASR lattice rescoring with hybrid pointer network language model DR Liu, C Liu, F Zhang, G Synnaeve, Y Saraf, G Zweig arXiv preprint arXiv:2005.07394, 2020	23	2020
Benchmarking lf-mmi, ctc and rnn-t criteria for streaming asr X Zhang, F Zhang, C Liu, K Schubert, J Chan, P Prakash, J Liu, CF Yeh, ... 2021 IEEE spoken language technology workshop (SLT), 46-51, 2021	21	2021
Scaling ASR improves zero and few shot learning A Xiao, W Zheng, G Keren, D Le, F Zhang, C Fuegen, O Kalinli, Y Saraf, ... arXiv preprint arXiv:2111.05948, 2021	18	2021
Accent-robust automatic speech recognition using supervised and unsupervised wav2vec embeddings J Li, V Manohar, P Chitkara, A Tjandra, M Picheny, F Zhang, X Zhang, ... arXiv preprint arXiv:2110.03520, 2021	17	2021
Transformer in action: a comparative study of transformer-based acoustic models for large scale speech recognition applications Y Wang, Y Shi, F Zhang, C Wu, J Chan, CF Yeh, A Xiao ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	16	2021
Streaming attention-based models with augmented memory for end-to-end speech recognition CF Yeh, Y Wang, Y Shi, C Wu, F Zhang, J Chan, ML Seltzer 2021 IEEE Spoken Language Technology Workshop (SLT), 8-14, 2021	11	2021
On lattice-free boosted MMI training of HMM and CTC-based full-context ASR models X Zhang, V Manohar, D Zhang, F Zhang, Y Shi, N Singhal, J Chan, ... 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021	10	2021
Training asr models by generation of contextual information K Singh, D Okhonko, J Liu, Y Wang, F Zhang, R Girshick, S Edunov, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	6	2020
Deja-vu: Double feature presentation in deep transformer networks A Tjandra, C Liu, F Zhang, X Zhang, Y Wang, G Synnaeve, S Nakamura, ... Submitted to ICASSP, 2020	3	2020
Multilingual ASR with massive data augmentation C Liu, Q Zhang, X Zhang, K Singh, Y Saraf, G Zweig arXiv preprint arXiv:1909.06522, 2019	3	2019
Efficient memory transformer based acoustic model for low latency streaming speech recognition MLS Yangyang Shi, Yongqiang Wang, Chunyang Wu, Ching-Feng Yeh, Julian Yui ... US Patent 11,646,017, 2023		2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors