requests get stuck when sending long prompts (already solved, but still don't know why?)
1
#18 opened 1 day ago
by
uv0xab
when i run command ,it didnot work. ( via vllm 0.7.3)
1
#16 opened 3 days ago
by
xueshuai
Is there any accuracy results comparing to original DeepSeek-R1?
#15 opened 3 days ago
by
traphix
Any one can run this model with SGlang framework?
2
#13 opened 3 days ago
by
muziyongshixin
Regarding the issue of inconsistent calculation of tokens
#12 opened 10 days ago
by
liguoyu3564
Max-Batch-Size, max-num-sequence, and fp_cache fp8_e4m3
#11 opened 10 days ago
by
BenFogerty