Samson Tan

Cited by

	All	Since 2019
Citations	1827	1827
h-index	11	11
i10-index	12	12

1100

550

275

825

202020212022202320249 94 206 1060 449

Public access

View all

2 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Shafiq JotyResearch Director at Salesforce Research, Assoc. Prof. at NTU (on leave)Verified email at ntu.edu.sg
Min-Yen Kan (靳民彦)Associate Professor, National University of SingaporeVerified email at comp.nus.edu.sg
Caiming XiongSalesforce ResearchVerified email at salesforce.com
Jesse VigLead Research Scientist at SalesforceVerified email at salesforce.com
Nazneen RajaniHugging FaceVerified email at huggingface.co
Chien-Sheng (Jason) WuSalesforce AI ResearchVerified email at salesforce.com
Karan GoelStanford UniversityVerified email at stanford.edu
Richard Socheryou.comVerified email at stanford.edu
Lav R. VarshneyUniversity of Illinois Urbana-ChampaignVerified email at illinois.edu

Samson Tan

Applied Scientist at Amazon AGI

Verified email at amazon.com - Homepage

Natural Language Processing Ethical AI Adversarial Robustness Linguistic Variation


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Bloom: A 176b-parameter open-access multilingual language model T Le Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ...	1200	2023
Robustness gym: Unifying the NLP evaluation landscape K Goel, N Rajani, J Vig, S Tan, J Wu, S Zheng, C Xiong, M Bansal, C Ré arXiv preprint arXiv:2101.04840, 2021	122	2021
It's Morphin' Time! Combating Linguistic Discrimination with Inflectional Perturbations S Tan, S Joty, MY Kan, R Socher The 58th Annual Meeting of the Association for Computational Linguistics …, 2020	106	2020
You reap what you sow: On the challenges of bias evaluation under multilingual settings Z Talat, A Névéol, S Biderman, M Clinciu, M Dey, S Longpre, S Luccioni, ... Proceedings of BigScience Episode# 5--Workshop on Challenges & Perspectives …, 2022	68	2022
Between words and characters: A brief history of open-vocabulary modeling and tokenization in NLP SJ Mielke, Z Alyafeai, E Salesky, C Raffel, M Dey, M Gallé, A Raja, C Si, ... arXiv preprint arXiv:2112.10508, 2021	68*	2021
Nl-augmenter: A framework for task-sensitive natural language augmentation KD Dhole, V Gangal, S Gehrmann, A Gupta, Z Li, S Mahamood, ... arXiv preprint arXiv:2112.02721, 2021	64	2021
Data governance in the age of large-scale data-driven language technology Y Jernite, H Nguyen, S Biderman, A Rogers, M Masoud, V Danchev, ... Proceedings of the 2022 ACM Conference on Fairness, Accountability, and …, 2022	53	2022
Mind Your Inflections! Improving NLP for Non-Standard Englishes with Base-Inflection Encoding S Tan, S Joty, LR Varshney, MY Kan The 2020 Conference on Empirical Methods in Natural Language Processing, 2020	35	2020
Code-Mixing on Sesame Street: Dawn of the Adversarial Polyglots S Tan, S Joty 2021 Annual Conference of the North American Chapter of the Association for …, 2021	27	2021
Reliability Testing for Natural Language Processing Systems S Tan, S Joty, K Baxter, A Taeihagh, GA Bennett, MY Kan The Joint Conference of the 59th Annual Meeting of the Association for …, 2021	26	2021
Interpreting the robustness of neural NLP models to textual perturbations Y Zhang, L Pan, S Tan, MY Kan arXiv preprint arXiv:2110.07159, 2021	15	2021
Recode: Robustness evaluation of code generation models S Wang, Z Li, H Qian, C Yang, Z Wang, M Shang, V Kumar, S Tan, B Ray, ... arXiv preprint arXiv:2212.10264, 2022	11	2022
Whodunit? Learning to Contrast for Authorship Attribution B Ai, Y Wang, Y Tan, S Tan arXiv preprint arXiv:2209.11887, 2022	9	2022
The risks of machine learning systems S Tan, A Taeihagh, K Baxter arXiv preprint arXiv:2204.09852, 2022	8	2022
Large language models of code fail at completing code with potential bugs T Dinh, J Zhao, S Tan, R Negrinho, L Lausen, S Zha, G Karypis Advances in Neural Information Processing Systems 36, 2024	7	2024
Systems and methods for generating natural language processing training samples with inflectional perturbations SMR Tan, SR Joty US Patent 11,256,754, 2022	3	2022
Linguistically-Inclusive Natural Language Processing S Tan Ph. D. Dissertation. National University of Singapore, 2022	2	2022
Automatic Feature Fairness in Recommendation via Adversaries H Hu, Y Cao, Z He, S Tan, MY Kan Proceedings of the Annual International ACM SIGIR Conference on Research and …, 2023	1	2023
TraVLR: Now You See It, Now You Don’t! A Bimodal Dataset for Evaluating Visio-Linguistic Reasoning KJ Chow, S Tan, MY Kan Proceedings of the 17th Conference of the European Chapter of the …, 2023	1	2023
BotSIM: An End-to-End Bot Simulation Framework for Commercial Task-Oriented Dialog Systems G Wang, S Tan, S Joty, G Wu, J Au, S Hoi arXiv preprint arXiv:2211.11982, 2022	1*	2022

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors