deepseek-ai/DeepSeek-R1-Distill-Qwen-32B Text Generation β’ Updated 14 days ago β’ 1.04M β’ β’ 1.15k
Running 1.36k 1.36k The Ultra-Scale Playbook π The ultimate guide to training LLM on large GPU Clusters
mistralai/Mistral-Small-24B-Instruct-2501 Text Generation β’ Updated 21 days ago β’ 736k β’ β’ 809