Update README.md
Browse files
README.md
CHANGED
@@ -32,7 +32,7 @@ Based on: [Qwen2.5-0.5B](https://huggingface.co/Qwen/Qwen2.5-0.5B) by [Qwen](htt
|
|
32 |
|
33 |
## Quantization notes
|
34 |
Made with Exllamav2 with default dataset.
|
35 |
-
These quants are meant to be used as a draft model for TabbyAPI.
|
36 |
8bpw version with FP16 cache probably might be the most reliable option for this purpose.
|
37 |
|
38 |
## Original model card
|
|
|
32 |
|
33 |
## Quantization notes
|
34 |
Made with Exllamav2 with default dataset.
|
35 |
+
These quants are meant to be used as a draft model for 24B Mistral models with TabbyAPI app.
|
36 |
8bpw version with FP16 cache probably might be the most reliable option for this purpose.
|
37 |
|
38 |
## Original model card
|