Running 1.4k 1.4k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
yentinglin/Mistral-Small-24B-Instruct-2501-reasoning Text Generation • Updated 4 days ago • 884 • 43
Breeze 2 Family Collection Llama-Breeze2 is a multi-modal language model family specifically intended for Traditional Chinese use. BreezyVoice is a Taiwan Mandarin TTS • 5 items • Updated 11 days ago • 16
Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback Paper • 2501.10799 • Published Jan 18 • 15
view article Article Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference Jan 16 • 68
pyannote/speaker-diarization-3.1 Automatic Speech Recognition • Updated May 10, 2024 • 11.6M • 704