Running 1.39k 1.39k The Ultra-Scale Playbook ๐ The ultimate guide to training LLM on large GPU Clusters
NousResearch/DeepHermes-3-Llama-3-8B-Preview Text Generation โข Updated 5 days ago โข 6.15k โข 260
bartowski/Mistral-Small-24B-Instruct-2501-GGUF Text Generation โข Updated 24 days ago โข 123k โข 97
mistralai/Mistral-Small-24B-Instruct-2501 Text Generation โข Updated 22 days ago โข 736k โข โข 811
Running on Zero 1.81k 1.81k Chat With Janus-Pro-7B ๐ A unified multimodal understanding and generation model.
Running 518 518 Scaling test-time compute ๐ Enhance math problem solving by scaling test-time compute