isogen
/

Qwen3-14B-exl3-4bpw

4-bit precision

Model card Files Files and versions

isogen commited on May 12

Commit

d0b1168

·

verified ·

1 Parent(s): ee8be19

Update README.md

Files changed (1) hide show

README.md +10 -1

README.md CHANGED Viewed

@@ -1,6 +1,15 @@
 ---
-license: apache-2.0
 base_model: Qwen/Qwen3-14B
 ---
 [EXL3](https://github.com/turboderp-org/exllamav3) quantization of [Qwen3-14B](https://huggingface.co/Qwen/Qwen3-14B), 4 bits per weight.

 ---
 base_model: Qwen/Qwen3-14B
 ---
 [EXL3](https://github.com/turboderp-org/exllamav3) quantization of [Qwen3-14B](https://huggingface.co/Qwen/Qwen3-14B), 4 bits per weight.
+## HumanEval (argmax)
+| Model                                                                    | Q4   | Q6   | Q8   | FP16 |
+| ------------------------------------------------------------------------ | ---- | ---- | ---- | ---- |
+| [Qwen3-14B-exl3-4bpw](https://huggingface.co/isogen/Qwen3-14B-exl3-4bpw) | 88.4 | 89.0 | 89.0 | 89.0 |
+| [Qwen3-14B-exl3-6bpw](https://huggingface.co/isogen/Qwen3-14B-exl3-6bpw) | 89.6 | 88.4 | 89.6 | 89.6 |
+| [Qwen3-8B-exl3-4bpw](https://huggingface.co/turboderp/Qwen3-8B-exl3)     | 86.0 | 85.4 | 86.0 | 87.2 |
+| [Qwen3-8B-exl3-6bpw](https://huggingface.co/turboderp/Qwen3-8B-exl3)     | 84.8 | 86.0 | 87.2 | 87.2 |
+| [Qwen3-8B-exl3-8bpw-h8](https://huggingface.co/turboderp/Qwen3-8B-exl3)  | 86.0 | 87.2 | 86.6 | 86.6 |