Why is the model size of safetensor 5.23B parameters?
#6 opened about 2 months ago
by
oilbread
Can you share the GPTQ quantization code?
#5 opened 3 months ago
by
qwertist
Produce gibberish with dtype=auto
#4 opened 4 months ago
by
divisingh
QAT version
🔥
1
#3 opened 4 months ago
by
Delnith
vLLM on 24gb gpu
👍
2
#2 opened 5 months ago
by
roadtoagi
