metadata
base_model: Qwen/Qwen3-14B
EXL3 quantization of Qwen3-14B, 4 bits per weight.
HumanEval (argmax)
Model | Q4 | Q6 | Q8 | FP16 |
---|---|---|---|---|
Qwen3-14B-exl3-4bpw | 88.4 | 89.0 | 89.0 | 89.0 |
Qwen3-14B-exl3-6bpw | 89.6 | 88.4 | 89.6 | 89.6 |
Qwen3-8B-exl3-4bpw | 86.0 | 85.4 | 86.0 | 87.2 |
Qwen3-8B-exl3-6bpw | 84.8 | 86.0 | 87.2 | 87.2 |
Qwen3-8B-exl3-8bpw-h8 | 86.0 | 87.2 | 86.6 | 86.6 |