SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features Paper • 2502.14786 • Published 3 days ago • 96
Presumed Cultural Identity: How Names Shape LLM Responses Paper • 2502.11995 • Published 6 days ago • 9
view article Article PaliGemma 2 Mix - New Instruction Vision Language Models by Google 4 days ago • 53
view article Article Introducing Three New Serverless Inference Providers: Hyperbolic, Nebius AI Studio, and Novita 🔥 5 days ago • 87
Scaling Pre-training to One Hundred Billion Data for Vision Language Models Paper • 2502.07617 • Published 12 days ago • 27
view article Article From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub 11 days ago • 48
OLMoE (January 2025) Collection Improved OLMoE for iOS app. Read more: https://allenai.org/blog/olmoe-app • 10 items • Updated 12 days ago • 9
view article Article From Llasa to Llasagna 🍕: Finetuning LLaSA to generates Italian speech and other languages By Steveeeeeeen and 1 other • 12 days ago • 22
High-Fidelity Simultaneous Speech-To-Speech Translation Paper • 2502.03382 • Published 18 days ago • 8
Hibiki fr-en Collection Hibiki is a model for streaming speech translation , which can run on device! See https://github.com/kyutai-labs/hibiki. • 5 items • Updated 16 days ago • 49
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published 18 days ago • 188
view article Article π0 and π0-FAST: Vision-Language-Action Models for General Robot Control 19 days ago • 106