---
license: mit
base_model:
- ServiceNow-AI/Apriel-Nemotron-15b-Thinker
---
imatrix GGUF quants

Execution tips from my private experience:
* don't quantize context
* use top_p 0.9, top_k 20, temp 0.6, min_p 0.05