Update README.md
Browse files
README.md
CHANGED
@@ -1,7 +1,12 @@
|
|
1 |
---
|
2 |
-
|
3 |
---
|
4 |
|
5 |
-
[EXL3](https://github.com/turboderp-org/exllamav3) quantization of [
|
6 |
|
7 |
-
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
+
base_model: RekaAI/reka-flash-3
|
3 |
---
|
4 |
|
5 |
+
[EXL3](https://github.com/turboderp-org/exllamav3) quantization of [reka-flash-3](https://huggingface.co/RekaAI/reka-flash-3), 4 bits per weight.
|
6 |
|
7 |
+
### HumanEval (argmax)
|
8 |
+
|
9 |
+
| Model | Q4 | Q8 | FP16 |
|
10 |
+
| ------------------------------------------------------------------------------ | ---- | ---- | ---- |
|
11 |
+
| [reka-flash-3-exl3-3bpw](https://huggingface.co/isogen/reka-flash-3-exl3-3bpw) | 87.8 | 90.2 | 90.9 |
|
12 |
+
| [reka-flash-3-exl3-4bpw](https://huggingface.co/isogen/reka-flash-3-exl3-4bpw) | 89.0 | 88.4 | 87.2 |
|