view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 12 days ago • 838
view article Article Granite 4.0 3B Vision: Compact Multimodal Intelligence for Enterprise Documents 13 days ago • 34
view article Article Introducing Cohere-transcribe: state-of-the-art speech recognition 19 days ago • 37
MolmoWeb Collection This is the collection of MolmoWeb artifacts, including model checkpoints and data. • 8 items • Updated about 8 hours ago • 24
3DV 2026 Collection Collection of all the 3DV models, datasets and demos • 27 items • Updated 20 days ago • 4
GSFix3D Collection Diffusion model collections for paper "GSFix3D: Diffusion-Guided Repair of Novel Views in Gaussian Splatting" • 4 items • Updated Nov 18, 2025 • 2
MedTech Open Models Collection Open models for physical AI and medical imaging — robot control, surgical simulation, segmentation, reconstruction, generation, and reasoning. • 13 items • Updated 7 days ago • 7
view article Article ✴️ ScreenSpot-Pro: GUI Grounding for Professional High-Resolution Computer Use Jan 3, 2025 • 25
view article Article IBM and UC Berkeley Diagnose Why Enterprise Agents Fail Using IT-Bench and MAST Feb 18 • 18
view article Article From Golden Gate Bridge to Broken JSON: Why Anthropic's SAE Steering Fails for Structured Output Feb 7 • 22
view article Article Community Evals: Because we're done trusting black-box leaderboards over the community +5 Feb 4 • 88