view article Article From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub 11 days ago • 48
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published 18 days ago • 188
view article Article Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference Jan 16 • 68
Qwen2-VL Collection Vision-language model series based on Qwen2 • 16 items • Updated Dec 6, 2024 • 206
view article Article 🤗 Serve any model with Inference Endpoints + Custom Handlers By alvarobartt • Nov 22, 2024 • 3
Adding Error Bars to Evals: A Statistical Approach to Language Model Evaluations Paper • 2411.00640 • Published Nov 1, 2024 • 3