Search for a command to run...
Four billion people cannot access vital information or be heard simply because of the language they speak—a human rights crisis where technology advances have left speakers of most of the world’s 7,000+ languages digitally invisible. This talk explores why languages matter in the digital space generally and in Language AI specifically, through real-world examples from CLEAR Global’s work across health, education, and agriculture sectors. We showcase practical tools developed to inform decisions (Language Use Data Platform), collect quality-controlled voice data (TWB Voice), and demonstrate how the NLP community can actively use these resources to address critical gaps in language technology. Drawing from projects spanning crisis response, educational equity, and access to agricultural information, we illustrate the transformative impact of language-inclusive technology. From conversational AI chatbots serving COVID-19 information in Lingala, Hausa, and Kanuri, to exploring EdTech solutions for mother tongue education in the Marma community in Bangladesh, to evaluating how synthetic voice data can be used for African ASR at lower costs than traditional data collection costs—our work demonstrates that marginalized languages can achieve competitive NLP performance with appropriate data and community engagement. Throughout our work, CLEAR Global’s 100,000+ volunteer linguist community provides the foundation for quality-controlled translations, data collection, cultural validation, and ensuring that technology development remains grounded in the needs and expertise of native speakers. This talk invites the NLP community to collaborate in ensuring that the right to information and to be heard doesn’t depend on which language you speak.