jiyuanq
/

falcon-40b-instruct-gptq-128g-act

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

falcon-40b-instruct quantized with GPTQ using the script in https://github.com/huggingface/text-generation-inference/pull/438

group size: 128
act order: true
nsamples: 128
dataset: wikitext2

Downloads last month: 15

Safetensors

Model size

6.53B params

Tensor type

I64

·

I32

·

FP16

·

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported Inference Providers.

The model cannot be deployed to the HF Inference API: The HF Inference API does not support model that require custom code execution.