omnivoice-swahili-2p5h

OmniVoice (k2-fsa) finetuned for Swahili at a 2.5h data budget - companion checkpoint for the Deep Learning Indaba 2026 (RIAD) poster "Tone on a Budget".

Code, the reference-free tone_i2 tone metric, and the recipe: https://github.com/mosesdaudu001/tone-on-a-budget

License & codec

These weights are a derivative of OmniVoice (k2-fsa, Apache-2.0). At inference the model loads the Higgs-Audio-v2 codec separately; that codec carries the Boson Higgs Audio 2 Community License (non-commercial, output-restriction). This repo contains only the Apache-2.0-derived OmniVoice weights - obtain the codec from its own source.

Downloads last month: 29

Safetensors

Model size

0.6B params

Tensor type

I64

F32