Upcoming Events
PhD Defense | Improving the Robustness of Natural Language Processing to Dialects and Language Variants
Title: Improving the Robustness of Natural Language Processing to Dialects and Language Variants
Date: 11/19/2025
Time: 12-2PM EST (9-11AM PST)
Location: https://gatech.zoom.us/j/4263320954?pwd=MGtPdUhKd0RIYWdqNzU4VW5RSk5zdz09 (with a small in person presence at Stanford University Gates 415)
William Held
Machine Learning PhD Student
School of Interactive Computing in the College of Computing
Georgia Institute of Technology
Committee
1 Diyi Yang
2 Mark Riedl
3 Larry Heck
4 Zsolt Kira
5 Percy Liang
Abstract: English — as a global language spoken by billions across continents — is rich with variation. Despite the number of speakers of other variants and dialects, most language technologies primarily serve Standard American English speakers, creating systematic barriers for other dialect communities. My research establishes empirical evidence for these disparities through novel controlled experiments and user experience studies spanning multiple English varieties. Building on these findings, I have developed computationally efficient adaptation techniques that enhance dialect robustness without requiring task-specific annotations. Finally, I have examined how dialect performance evolves as models scale, using scaling laws to assess whether increased compute alone can close dialect gaps or if targeted interventions remain necessary. These contributions advance both the theoretical understanding of language variation as a dimension of NLP performance and provide practical machine learning methods for building language technologies that serve English in all its forms.
Event Details
Media Contact
EVENTS BY SCHOOL & CENTER
School of Computational Science and Engineering
School of Interactive Computing
School of Cybersecurity and Privacy
Algorithms and Randomness Center (ARC)
Center for 21st Century Universities (C21U)
Center for Deliberate Innovation (CDI)
Center for Experimental Research in Computer Systems (CERCS)
Center for Research into Novel Computing Hierarchies (CRNCH)
Constellations Center for Equity in Computing
Institute for People and Technology (IPAT)
Institute for Robotics and Intelligent Machines (IRIM)