view article Article Announcing the Synthetic Online Conversations Dataset (SOC) By marcodsn • 9 days ago • 11
MolmoAct Data Mixture Collection All datasets for the MolmoAct (Multimodal Open Language Model for Action) release. • 3 items • Updated 6 days ago • 11
view article Article Introducing AI Sheets: a tool to work with datasets using open AI models! By dvilasuero and 5 others • 13 days ago • 64
view article Article Welcome GPT OSS, the new open-source model family from OpenAI! By reach-vb and 11 others • 16 days ago • 467
view article Article What Open-Source Developers Need to Know about the EU AI Act's Rules for GPAI Models By yjernite and 5 others • 17 days ago • 26
view article Article Introducing Command A Vision: Multimodal AI built for Business By CohereLabs and 3 others • 21 days ago • 63
view changelog Changelog Introducing HF Jobs: Run scalable compute jobs on Hugging Face 22 days ago • 114
view article Article Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face By abidlabs and 4 others • 23 days ago • 156
Dayhoff Atlas Collection The models and datasets that comprise the Dayhoff Atlas • 10 items • Updated 24 days ago • 7
view article Article Say hello to `hf`: a faster, friendlier Hugging Face CLI ✨ By Wauplin and 2 others • 27 days ago • 77
view article Article TimeScope: How Long Can Your Video Large Multimodal Model Go? By orrzohar and 3 others • 29 days ago • 37
view article Article Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders By thomwolf and 1 other • Jul 9 • 649
SmolLM3 evaluation datasets Collection Datasets to decontaminate the post-training mixtures against. Use the subset and column values described per entry • 13 items • Updated Jul 8 • 5
SmolLM3 pretraining datasets Collection datasets used in SmolLM3 pretraining • 15 items • Updated 8 days ago • 27
view article Article SmolLM3: smol, multilingual, long-context reasoner By loubnabnl and 22 others • Jul 8 • 631