Over the next decade, speech is expected to become the primary way people interact with devices — from phones and laptops to digital assistants. Today’s voice-enabled devices, however, are inaccessible to vast swaths of the planet’s languages, accents, and speech patterns. Currently, neither Amazon’s Alexa, Apple’s Siri, nor Google Home support a single native African language. Most of the voice data currently used to train machine learning algorithms is held by a handful of major companies. This poses challenges for companies seeking to develop high-quality speech recognition technologies, while also exacerbating the voice recognition divide between English speakers and the rest of the world.
Mozilla aims to change this through Common Voice an open-source initiative that makes it easy for anyone to donate their voice to a publicly available database that anyone can then use to train voice-enabled devices. Over the past two years, Rwandans have donated over 1,700 hours of voice data in Kinyarwanda, a widely spoken language with over 12 million speakers in Rwanda, and a pilot test is currently using the data to train a voice-enabled chatbot for COVID-19 information.
Based on the early success of the Kinyarwanda project, Mozilla is expanding the project to Kiswahili made possible by a $3.4 million investment from groups including the Bill & Melinda Gates Foundation, the Deutsche Gesellschaft für Internationale Zusammenarbeit (GIZ) GmbH (German Development Cooperation), and the UK’s Foreign Commonwealth & Development Office (FCDO).
Today we are very pleased to announce that Britone Mwasaru, Kathleen Siminyu and Rebecca Ryakitimbo Mwimbi have joined as three new Mozilla Common Voice Fellows dedicated to this project.
A key goal of the project is to explore whether it is possible to develop voice recognition for the languages of underserved communities as a platform. With this data available as a digital public good in the open source domain, it could allow local innovation in emerging markets to develop products and services serving marginalized communities. Common Voice will be collaborating with African companies, start-ups and universities to develop locally suitable, voice-enabled technology solutions that are relevant to the Sustainable Development Goals (SDGs).