Bloom: A 176b-parameter open-access multilingual language model T Le Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ... | 1638 | 2023 |
Robustness gym: Unifying the NLP evaluation landscape K Goel, N Rajani, J Vig, S Tan, J Wu, S Zheng, C Xiong, M Bansal, C Ré arXiv preprint arXiv:2101.04840, 2021 | 143 | 2021 |
It's Morphin' Time! Combating Linguistic Discrimination with Inflectional Perturbations S Tan, S Joty, MY Kan, R Socher The 58th Annual Meeting of the Association for Computational Linguistics …, 2020 | 114 | 2020 |
Between words and characters: A brief history of open-vocabulary modeling and tokenization in NLP SJ Mielke, Z Alyafeai, E Salesky, C Raffel, M Dey, M Gallé, A Raja, C Si, ... arXiv preprint arXiv:2112.10508, 2021 | 105* | 2021 |
You reap what you sow: On the challenges of bias evaluation under multilingual settings Z Talat, A Névéol, S Biderman, M Clinciu, M Dey, S Longpre, S Luccioni, ... Proceedings of BigScience Episode# 5--Workshop on Challenges & Perspectives …, 2022 | 104 | 2022 |
Nl-augmenter: A framework for task-sensitive natural language augmentation KD Dhole, V Gangal, S Gehrmann, A Gupta, Z Li, S Mahamood, ... arXiv preprint arXiv:2112.02721, 2021 | 74 | 2021 |
Data governance in the age of large-scale data-driven language technology Y Jernite, H Nguyen, S Biderman, A Rogers, M Masoud, V Danchev, ... Proceedings of the 2022 ACM Conference on Fairness, Accountability, and …, 2022 | 69 | 2022 |
Recode: Robustness evaluation of code generation models S Wang, Z Li, H Qian, C Yang, Z Wang, M Shang, V Kumar, S Tan Proceedings of the 61st Annual Meeting of the Association for Computational …, 2023 | 50* | 2023 |
Mind Your Inflections! Improving NLP for Non-Standard Englishes with Base-Inflection Encoding S Tan, S Joty, LR Varshney, MY Kan The 2020 Conference on Empirical Methods in Natural Language Processing, 2020 | 43 | 2020 |
Reliability Testing for Natural Language Processing Systems S Tan, S Joty, K Baxter, A Taeihagh, GA Bennett, MY Kan The Joint Conference of the 59th Annual Meeting of the Association for …, 2021 | 37 | 2021 |
Code-Mixing on Sesame Street: Dawn of the Adversarial Polyglots S Tan, S Joty 2021 Annual Conference of the North American Chapter of the Association for …, 2021 | 30 | 2021 |
Interpreting the robustness of neural NLP models to textual perturbations Y Zhang, L Pan, S Tan, MY Kan arXiv preprint arXiv:2110.07159, 2021 | 19 | 2021 |
Lessons from the trenches on reproducible evaluation of language models S Biderman, H Schoelkopf, L Sutawika, L Gao, J Tow, B Abbasi, AF Aji, ... arXiv preprint arXiv:2405.14782, 2024 | 16 | 2024 |
Large language models of code fail at completing code with potential bugs T Dinh, J Zhao, S Tan, R Negrinho, L Lausen, S Zha, G Karypis Advances in Neural Information Processing Systems 36, 2024 | 16 | 2024 |
Whodunit? Learning to Contrast for Authorship Attribution B Ai, Y Wang, Y Tan, S Tan arXiv preprint arXiv:2209.11887, 2022 | 12 | 2022 |
The risks of machine learning systems S Tan, A Taeihagh, K Baxter arXiv preprint arXiv:2204.09852, 2022 | 12 | 2022 |
TraVLR: Now You See It, Now You Don’t! A Bimodal Dataset for Evaluating Visio-Linguistic Reasoning KJ Chow*, S Tan*, MY Kan Proceedings of the 17th Conference of the European Chapter of the …, 2023 | 3 | 2023 |
Automatic Feature Fairness in Recommendation via Adversaries H Hu, Y Cao, Z He, S Tan, MY Kan Proceedings of the Annual International ACM SIGIR Conference on Research and …, 2023 | 2 | 2023 |
Linguistically-Inclusive Natural Language Processing S Tan Ph. D. Dissertation. National University of Singapore, 2022 | 2 | 2022 |
Learning to Generate Answers with Citations via Factual Consistency Models R Aly, Z Tang, S Tan, G Karypis arXiv preprint arXiv:2406.13124, 2024 | 1 | 2024 |