Sebastin Santy

SEBASTIN

SANTY

ssanty@cs.washington.edu

I am a PhD student at the University of Washington, advised by Sewoong Oh. I'm inspired by what makes human and machine intelligence unique, how they interact with each other, and the new forms of intelligence that emerge from this interaction.

RESEARCH

When Incentives Backfire, Data Stops Being Human

Sebastin Santy, Prasanta Bhattacharya, Manoel Horta Ribeiro, Kelsey Allen, Sewoong Oh

Position Paper at ICML 2025 VISUAL● PDF● ABS

From English to the World

Multilingual Diversity Improves Vision-Language Representations

Thao Nguyen, Matthew Wallingford, Sebastin Santy, Wei-Chiu Ma, Sewoong Oh, Ludwig Schmidt,
Pang Wei Koh, Ranjay Krishna

NeurIPS 2024 SPOTLIGHT PDF● ABS

Semantic and Expressive Variations in Image Captions Across Languages

Andre Ye, Sebastin Santy, Jena D. Hwang, Amy X. Zhang, Ranjay Krishna

CVPR 2025 PDF● ABS

Characterizing Design Biases of Datasets and Models

Sebastin Santy*, Jenny Liang*, Ronan Le Bras, Katharina Reinecke, Maarten Sap

ACL 2023 OUTSTANDING PAPER ◦ CMU ML Blog WEB● PDF● ABS

State and Fate of Linguistic Diversity and Inclusion in the NLP World
Pratik Joshi*, Sebastin Santy*, Amar Budhiraja*, Kalika Bali, Monojit Choudhury

ACL 2020◦ US FTC◦ NLP News◦ NLP Beyond English◦ Quartz◦ Underrated ML WEB● TALK● PDF● ABS

Low Resource Language Systems

Language Translation as a Socio-Technical System
Sebastin Santy, Kalika Bali, Monojit Choudhury, Sandipan Dandapat, Tanuja Ganu, Anurag Shukla,
Jahanvi Shah, Vivek Seshadri

COMPASS 2021◦ Mint Lounge SLIDES● PDF● ABS

Learnings from Technological Interventions in a Low Resource Language
Devansh Mehta*, Sebastin Santy*, Ramaravind Kommiya Mothilal, Brij Mohan Lal Srivastava,
Alok Sharma, Anurag Shukla, Vishnu Prasad, Venkanna U, Amit Sharma, Kalika Bali

LREC 2020◦ The Caravan◦ Times of India◦ Hindustan Times◦ ETV PDF● ABS

Unsung Challenges of Building and Deploying Language Technologies for LRL Communities
Pratik Joshi, Christain Barnes, Sebastin Santy, Simran Khanuja, Sanket Shah, Anirudh Srinivasan,
Satwik Bhattamishra, Sunayana Sitaram, Monojit Choudhury, Kalika Bali

ICON 2019◦ Indian Express◦ Microsoft Stories SLIDES● PDF● ABS

Interfaces for x People

BLIP: Facilitating the Exploration of Undesirable Consequences of Digital Technologies
Rock Yuren Pang, Sebastin Santy, René Just, Katharina Reinecke

CHI 2024 ⁕ x = Journalistic PDF● ABS

INMT: Interactive Neural Machine Translation
Sebastin Santy, Sandipan Dandapat, Monojit Choudhury, Kalika Bali

EMNLP 2019 Demo◦ Slate ⁕ x = Translators WEB● CODE● POSTER● PDF● ABS

CoSSAT: Code-Switched Speech Annotation Tool
Sanket Shah, Pratik Joshi, Sebastin Santy, Sunayana Sitaram

AnnoNLP @ EMNLP 2019 ⁕ x = Annotators SLIDES● PDF● ABS

Use of Formal Ethical Reviews in NLP Literature: Historical Trends and Current Practices
Sebastin Santy, Anku Rani, Monojit Choudhury

ACL 2021 Findings◦ Academic Freedom and Ethics Review SLIDES● POSTER● PDF● ABS

BERTologiCoMix: How does Code-Mixing interact with Multilingual BERT?
Sebastin Santy*, Anirudh Srinivasan*, Monojit Choudhury

AdaptNLP@EACL 2021 POSTER● PDF● ABS

Towards Task Understanding in Visual Settings
Sebastin Santy, Wazeer Zulfikar, Rishabh Mehrotra, Emine Yilmaz

AAAI 2019 (Student Abstract) POSTER● PDF● ABS

TALKS

Designing, Evaluating, and Learning from Humans Interacting with NLP Models
Sherry Tongshuang Wu, Diyi Yang, Sebastin Santy

EMNLP 2023 ⫻ Singapore
WEB● SLIDES