Gemma 4 Collection Gemma 4 is Google's new model family including including E2B, E4B, 26B-A4B, and 31B. • 19 items • Updated about 2 hours ago • 51
PP-OCRv5 Collection PP-OCRv5 is the latest text recognition solution, supporting Simplified Chinese, Chinese Pinyin, Traditional Chinese, English, and Japanese • 13 items • Updated Sep 15, 2025 • 54
ndl-core-collection Collection A collection of UK government structured datasets and textual sources for research, analysis, and AI applications. • 6 items • Updated Jan 12 • 3
view article Article Introducing SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding 15 days ago • 44
MixtureVitae study models and datasets Collection Collection of models and dataset related to MixtureVitae, open and fully reproducible pretraining dataset built from permissive sources • 16 items • Updated Feb 13 • 2
view article Article Nemotron 3 Nano 4B: A Compact Hybrid Model for Efficient Local AI 16 days ago • 58
view article Article **Canada Must Not Turn AI Chatbots Into a New Surveillance Frontier** 17 days ago • 3
Bridging the Data Provenance Gap Across Text, Speech and Video Paper • 2412.17847 • Published Dec 19, 2024 • 12
The AI Community Building the Future? A Quantitative Analysis of Development Activity on Hugging Face Hub Paper • 2405.13058 • Published May 20, 2024 • 3
Anatomy of a Machine Learning Ecosystem: 2 Million Models on Hugging Face Paper • 2508.06811 • Published Aug 9, 2025 • 6
Economies of Open Intelligence: Tracing Power & Participation in the Model Ecosystem Paper • 2512.03073 • Published Nov 27, 2025 • 7