cgus
/

Mistral-Small-3.1-DRAFT-0.5B-exl2

Text Generation

mistral-small-3.1

4-bit precision

Model card Files Files and versions

cgus commited on Mar 24

Commit

4016d96

·

verified ·

1 Parent(s): 27d173f

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -32,7 +32,7 @@ Based on: [Qwen2.5-0.5B](https://huggingface.co/Qwen/Qwen2.5-0.5B) by [Qwen](htt
 ## Quantization notes
 Made with Exllamav2 with default dataset.
-These quants are meant to be used as a draft model for TabbyAPI.
 8bpw version with FP16 cache probably might be the most reliable option for this purpose.
 ## Original model card

 ## Quantization notes
 Made with Exllamav2 with default dataset.
+These quants are meant to be used as a draft model for 24B Mistral models with TabbyAPI app.
 8bpw version with FP16 cache probably might be the most reliable option for this purpose.
 ## Original model card