UBC-NLP/Simba-TTS-lin

Bridging the Digital Divide for African AI

Voice of a Continent is a comprehensive open-source ecosystem designed to bring African languages to the forefront of artificial intelligence. By providing a unified suite of benchmarking tools and state-of-the-art models, we ensure that the future of speech technology is inclusive, representative, and accessible to over a billion people.

Best-in-Class Multilingual Models

Introduced in our EMNLP 2025 paper Voice of a Continent, the Simba Series represents the current state-of-the-art for African speech AI.

Unified Suite: Models optimized for African languages.
Superior Accuracy: Outperforms generic multilingual models by leveraging SimbaBench's high-quality, domain-diverse datasets.
Multitask Capability: Designed for high performance in ASR (Automatic Speech Recognition) and TTS (Text-to-Speech).
Inclusion-First: Specifically built to mitigate the "digital divide" by empowering speakers of underrepresented languages.

The Simba family consists of state-of-the-art models fine-tuned using SimbaBench. These models achieve superior performance by leveraging dataset quality, domain diversity, and language family relationships.

🔊 Simba-TTS (Text-to-Speech)

🎯 Task: Text-to-Speech — Natural Voice Synthesis. 🌍 Language Coverage (7 African languages)

Afrikaans (afr), Asante Twi (asanti), Akuapem Twi (akuapem), Lingala (lin), Southern Sotho (sot), Tswana (tsn), Xhosa (xho)

TTS Model	Architecture	Hugging Face Card	Status
Simba-TTS-afr 🔊	MMS-TTS	🤗 https://huggingface.co/UBC-NLP/Simba-TTS-afr	✅ Released
Simba-TTS-twi-asanti 🔊	MMS-TTS	🤗 https://huggingface.co/UBC-NLP/Simba-TTS-twi-asanti	✅ Released
Simba-TTS-twi-akuapem 🔊	MMS-TTS	🤗 https://huggingface.co/UBC-NLP/Simba-TTS-twi-akuapem	✅ Released
Simba-TTS-lin 🔊	MMS-TTS	🤗 https://huggingface.co/UBC-NLP/Simba-TTS-lin	✅ Released
Simba-TTS-sot 🔊	MMS-TTS	🤗 https://huggingface.co/UBC-NLP/Simba-TTS-sot	✅ Released
Simba-TTS-tsn 🔊	MMS-TTS	🤗 https://huggingface.co/UBC-NLP/Simba-TTS-tsn	✅ Released
Simba-TTS-xho 🔊	MMS-TTS	🤗 https://huggingface.co/UBC-NLP/Simba-TTS-xho	✅ Released

🧩 Usage Example

You can easily run inference using the Hugging Face transformers library.

from transformers import VitsModel, AutoTokenizer
import torch

model_name="Simba-TTS-afr" ## Simba-TTS-twi-asanti, Simba-TTS-twi-akuapem, Simba-TTS-lin, Simba-TTS-sot, Simba-TTS-tsn, Simba-TTS-xho
model = VitsModel.from_pretrained(model_name)
tokenizer = AutoTokenizer.from_pretrained(model_name)

text = "Ons noem hierdie deeltjies sub-atomiese deeltjies" #example of Afrikaans (afr) language 
inputs = tokenizer(text, return_tensors="pt")

with torch.no_grad():
    output = model(**inputs).waveform

The resulting waveform can be saved as a .wav file:

scipy.io.wavfile.write("outputfile.wav", rate=model.config.sampling_rate, data=output.float().numpy())

Downloads last month: 22

Safetensors

Model size

36.3M params

Tensor type

F32

Space using UBC-NLP/Simba-TTS-lin 1

Collection including UBC-NLP/Simba-TTS-lin

Simba Speech Series

Collection

Simba bridges the digital divide with a unified suite for African AI: the largest open-source speech benchmark and models covering 61 languages • 13 items • Updated 18 days ago

Bridging the Digital Divide for African AI

Best-in-Class Multilingual Models

🔊 Simba-TTS (Text-to-Speech)

Space using UBC-NLP/Simba-TTS-lin 1

Collection including UBC-NLP/Simba-TTS-lin

🎉 Free Image Generator Now Available!