Running 1.34k 1.34k The Ultra-Scale Playbook ๐ The ultimate guide to training LLM on large GPU Clusters
Running 518 518 Scaling test-time compute ๐ Enhance math problem solving by scaling test-time compute
Running 381 381 LLM Model VRAM Calculator ๐ Calculate VRAM requirements for running large language models
Running 99 99 Llmlingua 2 ๐ป Compress lengthy prompts into shorter versions while preserving key information