Running 1.4k 1.4k The Ultra-Scale Playbook π The ultimate guide to training LLM on large GPU Clusters
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B Text Generation β’ Updated 15 days ago β’ 1.04M β’ β’ 1.16k
cognitivecomputations/dolphin-2.9.2-qwen2-7b Text Generation β’ Updated Jun 18, 2024 β’ 3.83k β’ 67
Running on CPU Upgrade 133 133 Open Arabic LLM Leaderboard π Track, rank and evaluate open Arabic LLMs and chatbots
Running 773 773 FineWeb: decanting the web for the finest text data at scale π· Generate high-quality web text data for LLM training
argilla/distilabel-capybara-dpo-7k-binarized Viewer β’ Updated Jul 16, 2024 β’ 7.56k β’ 1.96k β’ 181