cognitivecomputations
/

DeepSeek-R1-AWQ

Text Generation

4-bit precision

Model card Files Files and versions Community

Resources

View closed (14)

requests get stuck when sending long prompts (already solved, but still don't know why?)

#18 opened 1 day ago by

when i run command ,it didnot work. ( via vllm 0.7.3)

#16 opened 3 days ago by

Is there any accuracy results comparing to original DeepSeek-R1？

#15 opened 3 days ago by

Any one can run this model with SGlang framework？

#13 opened 3 days ago by

Regarding the issue of inconsistent calculation of tokens

#12 opened 10 days ago by

Max-Batch-Size, max-num-sequence, and fp_cache fp8_e4m3

#11 opened 10 days ago by

The inference performance of the DeepSeek-R1-AWQ model is weak compared to the DeepSeek-R1 model

#3 opened 17 days ago by