view article Article Introducing Three New Serverless Inference Providers: Hyperbolic, Nebius AI Studio, and Novita π₯ 6 days ago β’ 87
view article Article From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub 12 days ago β’ 48
Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch Paper β’ 2501.18512 β’ Published 24 days ago β’ 27
view article Article MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era By MiniMax-AI β’ Jan 15 β’ 41
2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining Paper β’ 2501.00958 β’ Published Jan 1 β’ 99
view article Article Use Models from the Hugging Face Hub in LM Studio By yagilb β’ Nov 28, 2024 β’ 139
Qwen2.5 Collection Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. β’ 45 items β’ Updated Nov 28, 2024 β’ 525
view article Article Decoding Strategies in Large Language Models By mlabonne β’ Oct 29, 2024 β’ 44
Open LLM Leaderboard best models β€οΈβπ₯ Collection A daily uploaded list of models with best evaluations on the LLM leaderboard: β’ 64 items β’ Updated 1 day ago β’ 543
view article Article Transformers.js v3: WebGPU support, new models & tasks, and more⦠Oct 22, 2024 ⒠67