Sebastian Gehrmann

Cited by

	All	Since 2019
Citations	15516	15358
h-index	34	34
i10-index	50	49

7000

3500

1750

5250

2018201920202021202220232024112 318 523 804 1559 5347 6728

Public access

View all

11 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Hendrik StrobeltSenior Research Scientist IBM Research / MIT-IBM Watson AI LabVerified email at strobelt.com
Alexander M. RushAssociate Professor, Cornell UniversityVerified email at cornell.edu
Franck DernoncourtNLP/ML Researcher. MIT PhD.Verified email at adobe.com
Yonatan BelinkovTechnionVerified email at technion.ac.il
Stuart ShieberHarvard UniversityVerified email at seas.harvard.edu
Hanspeter PfisterAn Wang Professor of Computer Science, Harvard UniversityVerified email at seas.harvard.edu
Ankur P ParikhStaff Research Scientist at Google DeepMindVerified email at google.com
Mirac SuzgunStanford UniversityVerified email at stanford.edu
Tom SercuEvolutionaryScaleVerified email at evolutionaryscale.ai
Cicero Nogueira dos SantosResearch Scientist, Google DeepMindVerified email at google.com
Payel DasManager and Principal Research Staff Member, AI research, IBM Watson, NYVerified email at us.ibm.com
Thibault SellamGoogle ResearchVerified email at google.com
Hyung Won ChungOpenAIVerified email at google.com
Joy Tzung-yu WuStanford Nuclear Medicine/Radiology and IBM Research AlmadenVerified email at stanford.edu
Leo Anthony CeliMassachusetts Institute of TechnologyVerified email at mit.edu
Eric T CarlsonDirector, Global Data Science, MerckVerified email at merck.com
Dipanjan DasSenior Director of Research, Google DeepmindVerified email at google.com
David S RosenbergBloomberg; NYU Center for Data ScienceVerified email at bloomberg.net
Mark DredzeJohns Hopkins UniversityVerified email at cs.jhu.edu
Gideon MannMillenniumVerified email at mlp.com

Sebastian Gehrmann

Head of NLP, CTO Office, Bloomberg LP

Verified email at bloomberg.net - Homepage

Natural Language Processing Natural Language Generation Interpretability


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
PaLM: Scaling language modeling with pathways A Chowdhery, S Narang, J Devlin, M Bosma, G Mishra, A Roberts, ... arXiv preprint arXiv:2204.02311, 2022	4777	2022
Bloom: A 176b-parameter open-access multilingual language model T Le Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ...	1532	2023
Palm 2 technical report R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ... arXiv preprint arXiv:2305.10403, 2023	1324	2023
Beyond the imitation game: Quantifying and extrapolating the capabilities of language models A Srivastava, A Rastogi, A Rao, AAM Shoeb, A Abid, A Fisch, AR Brown, ... arXiv preprint arXiv:2206.04615, 2022	1068	2022
Bottom-up abstractive summarization S Gehrmann, Y Deng, AM Rush EMNLP 2018, 2018	839	2018
BloombergGPT: A large language model for finance S Wu, O Irsoy, S Lu, V Dabravolski, M Dredze, S Gehrmann, P Kambadur, ... arXiv preprint arXiv:2303.17564, 2023	698	2023
LSTMVis: A tool for visual analysis of hidden state dynamics in recurrent neural networks H Strobelt, S Gehrmann, H Pfister, AM Rush IEEE transactions on visualization and computer graphics 24 (1), 667-676, 2017	548	2017
Challenging big-bench tasks and whether chain-of-thought can solve them M Suzgun, N Scales, N Schärli, S Gehrmann, Y Tay, HW Chung, ... ACL Findings 2023, 2022	508	2022
GLTR: Statistical detection and visualization of generated text S Gehrmann, H Strobelt, AM Rush ACL Demo 2019, 2019	496	2019
Investigating gender bias in language models using causal mediation analysis J Vig, S Gehrmann, Y Belinkov*, S Qian, D Nevo, Y Singer, S Shieber NeurIPS 2021 33, 12388-12401, 2020	462*	2020
ToTTo: A controlled table-to-text generation dataset AP Parikh, X Wang, S Gehrmann, M Faruqui, B Dhingra, D Yang, D Das EMNLP 2020, 2020	346	2020
Comparing deep learning and concept extraction based methods for patient phenotyping from clinical narratives S Gehrmann, F Dernoncourt, Y Li, ET Carlson, JT Wu, J Welt, J Foote Jr, ... PloS one 13 (2), e0192360, 2018	286*	2018
Accelerated antimicrobial discovery via deep generative models and molecular dynamics simulations P Das, T Sercu, K Wadhawan, I Padhi, S Gehrmann, F Cipcigan, ... Nature Biomedical Engineering 5 (6), 613-623, 2021	285	2021
Seq2Seq-Vis: A visual debugging tool for sequence-to-sequence models H Strobelt, S Gehrmann, M Behrisch, A Perer, H Pfister, AM Rush IEEE transactions on visualization and computer graphics 25 (1), 353-363, 2018	275	2018
The language interpretability tool: Extensible, interactive visualizations and analysis for NLP models I Tenney, J Wexler, J Bastings, T Bolukbasi, A Coenen, S Gehrmann, ... ACL Demo 2020, 2020	206	2020
exBERT: A visual analysis tool to explore learned representations in transformers models B Hoover, H Strobelt, S Gehrmann EMNLP Demo 2019, 2019	189	2019
Repairing the cracked foundation: A survey of obstacles in evaluation practices for generated text S Gehrmann, E Clark, T Sellam JAIR, 2022	143	2022
The GEM benchmark: Natural language generation, its evaluation and metrics S Gehrmann, T Adewumi, K Aggarwal, PS Ammanamanchi, ... GEM Workshop at ACL 2021, 2021	140	2021
Palm: Scaling language modeling with pathways. arXiv 2022 A Chowdhery, S Narang, J Devlin, M Bosma, G Mishra, A Roberts, ... arXiv preprint arXiv:2204.02311 10, 2022	110	2022
Causal analysis of syntactic agreement mechanisms in neural language models M Finlayson, A Mueller, S Gehrmann, S Shieber, T Linzen, Y Belinkov ACL 2021, 2021	88	2021

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors