view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM +2 Mar 12, 2025 • 480
view article Article Unlocking Healthcare AI: I'm Releasing State-of-the-Art Medical Models for Free. Forever. Jul 16, 2025 • 147
view article Article Fine-Tune Wav2Vec2 for English ASR in Hugging Face with 🤗 Transformers Mar 12, 2021 • 41
MobileLLM Collection Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 50 items • Updated 26 days ago • 137
Wav2Vec 2.0 Collection A collection for the first release of Wav2Vec 2.0, a speech encoder that learns powerful representations from unlabelled audio data. • 8 items • Updated Jan 16, 2024 • 30
MobileLLM-R1: Exploring the Limits of Sub-Billion Language Model Reasoners with Open Training Recipes Paper • 2509.24945 • Published Sep 29, 2025 • 4
MobileLLM-R1 Collection MobileLLM-R1, a series of sub-billion parameter reasoning models • 10 items • Updated Nov 21, 2025 • 27
OlmoEarth Collection OlmoEarth pre-trained and fine-tuned foundation models for remote sensing • 10 items • Updated 14 days ago • 16
view article Article A Deepdive into Aya Expanse: Advancing the Frontier of Multilinguality +2 Oct 24, 2024 • 64
view article Article Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders Jul 9, 2025 • 753
SmolDocling datasets Collection Datasets used to train SmolDocling • 6 items • Updated Jul 31, 2025 • 31
🐶 IDEFICS 🐶 Collection Collection assembling all the models and spaces related to IDEFICS • 6 items • Updated Apr 15, 2024 • 8
From screenshots to HTML Collection WebSight is a dataset of 823,000 HTML/CSS codes representing synthetically generated English websites, each accompanied by a corresponding screenshot. • 4 items • Updated Apr 15, 2024 • 22
Idefics2 🐶 Collection Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation. • 11 items • Updated May 6, 2024 • 92