Mozilla Common Voice
Over the next decade, speech is expected to become the primary way people interact with devices — from phones and laptops to digital assistants. Today’s voice-enabled devices, however, are inaccessible to vast swaths of the planet’s languages, accents, and speech patterns. Mozilla Common Voice launched in 2017 to bridge this gap and is now the world’s largest multi-language public domain voice data set, with over 18,000 hours of voice recordings in over 100 global languages and dialects. Common Voice enables anyone to donate their voices to a free publicly available database that startups, researchers, and developers can use to train voice-enabled apps, products, and services. In 2021, The Bill & Melinda Gates Foundation, the Deutsche Gesellschaft für Internationale Zusammenarbeit (GIZ) GmbH (German Development Cooperation), and the UK’s Foreign, Commonwealth & Development Office (FCDO) announced an investment of $3.4 million in Mozilla Common Voice. This funding will support the collection of voice data sets in Kiswahili, an East African language spoken by an estimated 100 million people in Kenya, Uganda, Tanzania, Rwanda, Burundi, and South Sudan.
Learn more