The official prequantized EfficientQAT models.
-
ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w4g128
Text Generation • Updated • 9 -
ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w2g64
Text Generation • Updated • 81 -
ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w2g128
Text Generation • Updated • 80 -
ChenMnZ/Llama-3-8b-EfficientQAT-w4g128
Text Generation • Updated • 85