jiangchengchengNLP
/

L3.3-MS-Nevoria-70b-w8a16

Text Generation

text-generation-inference

compressed-tensors

Model card Files Files and versions

This is a checkpoint for quantization using llm-compressor, supporting vllm, sglang inference.

Downloads last month: 428

Safetensors

Model size

19.2B params

Tensor type

I64

·

I32

·

BF16

·

Model tree for jiangchengchengNLP/L3.3-MS-Nevoria-70b-w8a16

Base model

Steelskull/L3.3-MS-Nevoria-70b

Quantized

(28)

this model