Moshi Streaming Speech-to-Text (Quantized)

This is a quantized version of Kyutai’s stt-1b-en_fr model. The original model is a 1B parameter streaming speech-to-text model for English and French. This fork contains the same model, quantized to Q8_0 and Q4_K GGUF formats for reduced memory usage and faster inference.

Downloads last month
21
GGUF
Model size
989M params
Architecture
undefined
Hardware compatibility
Log In to view the estimation
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support