Paddi
Butzermoggel
AI & ML interests
None yet
Organizations
None yet
Max model len is 32768 when serving with vllm and not 40960
2
#19 opened 3 months ago
by
f14
Multimodal ToolMessage
#77 opened 4 months ago
by
Butzermoggel
vLLM example for 'Offline' should include an input image.
❤️
1
2
#47 opened 5 months ago
by
stev236
Multi-GPU inference: RuntimeError: Expected all tensors to be on the same device
🔥
1
3
#4 opened 12 months ago
by
Butzermoggel