Holistic evaluation of language models P Liang, R Bommasani, T Lee, D Tsipras, D Soylu, M Yasunaga, Y Zhang, ... TMLR, 2022 | 1161 | 2022 |
When and Why Vision-Language Models Behave like Bags-Of-Words, and What to Do About It? M Yuksekgonul, F Bianchi, P Kalluri, D Jurafsky, J Zou Oral (Notable-Top-5%) @ ICLR, 2023 | 334* | 2023 |
GPT detectors are biased against non-native English writers W Liang*, M Yuksekgonul*, Y Mao*, E Wu*, J Zou Patterns, 2023 | 316 | 2023 |
A visual–language foundation model for pathology image analysis using medical twitter Z Huang, F Bianchi, M Yuksekgonul, TJ Montine, J Zou Nature medicine 29 (9), 2307-2316, 2023 | 313 | 2023 |
Post-hoc concept bottleneck models M Yuksekgonul, M Wang, J Zou Spotlight (Notable-Top-25%) @ ICLR, 2023 | 200 | 2023 |
Pretraining boosts out-of-domain robustness for pose estimation A Mathis, T Biasi, S Schneider, M Yuksekgonul, B Rogers, M Bethge, ... WACV, 1859-1868, 2021 | 147 | 2021 |
Meaningfully debugging model mistakes using conceptual counterfactual explanations A Abid, M Yuksekgonul, J Zou ICML, 66-88, 2022 | 100* | 2022 |
Discover and Cure: Concept-aware Mitigation of Spurious Correlation S Wu, M Yuksekgonul, L Zhang, J Zou ICML, 2023 | 52 | 2023 |
SkinCon: A skin disease dataset densely annotated by domain experts for fine-grained debugging and analysis R Daneshjou*, M Yuksekgonul*, ZR Cai, RA Novoa, J Zou NeurIPS, 2022 | 44* | 2022 |
Attention Satisfies: A Constraint-Satisfaction Lens on Factual Errors of Language Models M Yuksekgonul, V Chandrasekaran, E Jones, S Gunasekar, R Naik, ... ICLR, 2024 | 30 | 2024 |
TextGrad: Automatic" Differentiation" via Text M Yuksekgonul, F Bianchi, J Boen, S Liu, Z Huang, C Guestrin, J Zou arXiv preprint arXiv:2406.07496, 2024 | 27 | 2024 |
Beyond Confidence: Reliable Models Should Also Consider Atypicality M Yuksekgonul, L Zhang, J Zou, C Guestrin NeurIPS, 2023 | 20 | 2023 |
How Well Can LLMs Negotiate? NegotiationArena Platform and Analysis F Bianchi, PJ Chia, M Yuksekgonul, J Tagliabue, D Jurafsky, J Zou ICML, 2024 | 19* | 2024 |
Diversity of thought improves reasoning abilities of large language models R Naik, V Chandrasekaran, M Yuksekgonul, H Palangi, B Nushi arXiv preprint arXiv:2310.07088, 2023 | 13* | 2023 |
KITAB: Evaluating LLMs on Constraint Satisfaction for Information Retrieval MI Abdin, S Gunasekar, V Chandrasekaran, J Li, M Yuksekgonul, ... ICLR, 2024 | 9 | 2024 |
ChatGPT exhibits gender and racial biases in acute coronary syndrome management A Zhang, M Yuksekgonul, J Guild, J Zou, JC Wu arXiv preprint arXiv:2311.14703, 2023 | 8 | 2023 |
ImageNet performance correlates with pose estimation robustness and generalization on out-of-domain data A Mathis, T Biasi, Y Mert, B Rogers, M Bethge, MW Mathis International Conference on Machine Learning 2020 - Workshop on Uncertainty …, 2020 | 5 | 2020 |
Learning prototypes for multiple instance learning ÖE Sivrikaya, M Yüksekgönül, MG BAYDOĞAN Turkish Journal of Electrical Engineering and Computer Sciences 29 (7), 2901 …, 2021 | 4 | 2021 |
metaTextGrad: Learning to learn with language models as optimizers G Xu, M Yuksekgonul, C Guestrin, J Zou Adaptive Foundation Models: Evolving AI for Personalized and Efficient Learning, 0 | | |
Large Language Models Demonstrate and Recapitulate Known Gender and Racial Biases in Clinical Decision Making A Zhang, K Wu, J Guild, M Yuksekgonul, E Wu, J Zou Available at SSRN 4991056, 0 | | |