cgus commited on
Commit
4016d96
·
verified ·
1 Parent(s): 27d173f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -32,7 +32,7 @@ Based on: [Qwen2.5-0.5B](https://huggingface.co/Qwen/Qwen2.5-0.5B) by [Qwen](htt
32
 
33
  ## Quantization notes
34
  Made with Exllamav2 with default dataset.
35
- These quants are meant to be used as a draft model for TabbyAPI.
36
  8bpw version with FP16 cache probably might be the most reliable option for this purpose.
37
 
38
  ## Original model card
 
32
 
33
  ## Quantization notes
34
  Made with Exllamav2 with default dataset.
35
+ These quants are meant to be used as a draft model for 24B Mistral models with TabbyAPI app.
36
  8bpw version with FP16 cache probably might be the most reliable option for this purpose.
37
 
38
  ## Original model card