


Project
Tonative (to native) Organisation
Year
2025 - Present
Service
Founder
Tonative
Project Overview
This project aims to foster the compilation, translation, and accessibility of datasets for African languages and contexts.
We envision this as a community-driven effort, where volunteers play a key role in shaping the initiative’s direction. All volunteering efforts will be documented to recognise and track contributions, including metrics such as hours engaged, number of sentences validated, and more. These records can serve as valuable references for future use.
There is always a call for volunteers. Please sign up if you are interested in being part of the community: www.tonative.org

Volunteering Roles:

Project Overview
This project aims to foster the compilation, translation, and accessibility of datasets for African languages and contexts.
We envision this as a community-driven effort, where volunteers play a key role in shaping the initiative’s direction. All volunteering efforts will be documented to recognise and track contributions, including metrics such as hours engaged, number of sentences validated, and more. These records can serve as valuable references for future use.
There is always a call for volunteers. Please sign up if you are interested in being part of the community: www.tonative.org

Volunteering Roles:

The Problem
Existing African NLP datasets are scattered across various platforms, limited in scope, and often lack translations for a broader range of African languages, making them less accessible for research and development.
The Problem
Existing African NLP datasets are scattered across various platforms, limited in scope, and often lack translations for a broader range of African languages, making them less accessible for research and development.
The Solution
The Tonative and African Datasets Initiative aims to bridge this gap by compiling, translating, and centralizing African language datasets. We use state-of-the-art models for initial translations, and they are validated or improved by language experts.
The initiative expands linguistic diversity, fosters research accessibility, and organizes resources in one place. It is a community-driven effort, with volunteers contributing to dataset validation and expansion.
The Solution
The Tonative and African Datasets Initiative aims to bridge this gap by compiling, translating, and centralizing African language datasets. We use state-of-the-art models for initial translations, and they are validated or improved by language experts.
The initiative expands linguistic diversity, fosters research accessibility, and organizes resources in one place. It is a community-driven effort, with volunteers contributing to dataset validation and expansion.
The Result
The community is active and continuously welcomes new members who are passionate about this initiative. We especially appreciate language experts from all African backgrounds, including those beyond the widely recognized ones. This will enable us to represent the communities that we serve.
The Result
The community is active and continuously welcomes new members who are passionate about this initiative. We especially appreciate language experts from all African backgrounds, including those beyond the widely recognized ones. This will enable us to represent the communities that we serve.