Democratizing African Speech Data for research and innovation
Discover, explore, and download high-quality Hausa, Igbo, Nigerian Pidgin and Yoruba datasets.
Why African Voices?
Bridging the Data Gap
Access high-quality, culturally rich datasets that were previously fragmented.
Built for AI Innovation
Search and filter by metadata for faster research and development.
Community-Driven Quality
Continuous improvement through feedback and contribution.

1900
Audio Hours
1.9m+
Authentic Sentence
500+
Unique Voices
What this Platform Offers
Curated Speech Datasets
Browse Hausa, Igbo, Nigerian Pidgin and Yoruba dataset with corresponding metadata (Age, Gender and Domain).
Search & Filters
Refine your search by language and demographic for precise result.
Customizable Downloads
Download dataset in chunk (5% -100%) to match your needs.
Feedback & Quality Loop
Share feedback and improvement areas to improve the dataset for everyone.

Our Partners
Collaborating with leading organizations to advance African speech data

Awarri Technologies

Nigerian Institute of Translators and Interpreters

Department of Linguistics, University of Ibadan

RobotsMali
