view article Article PaliGemma 2 Mix - New Instruction Vision Language Models by Google 5 days ago β’ 53
view article Article Introducing Three New Serverless Inference Providers: Hyperbolic, Nebius AI Studio, and Novita π₯ 6 days ago β’ 87
Tools for learning AI Collection This is a collection of tools on the hub that teachers and students can use to learn AI! β’ 9 items β’ Updated 6 days ago β’ 61
view article Article From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub 12 days ago β’ 48
Ovis2 Collection Our latest advancement in multi-modal large language models (MLLMs) β’ 8 items β’ Updated 6 days ago β’ 51
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M β’ 16 items β’ Updated 3 days ago β’ 239
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper β’ 2502.02737 β’ Published 19 days ago β’ 190
VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models Paper β’ 2502.02492 β’ Published 19 days ago β’ 56
view article Article Fine-tune ModernBERT for RAG with Synthetic Data By sdiazlor and 2 others β’ Jan 20 β’ 36
view article Article Agentic RAG Stack (1/5) - Index and retrieve documents for vector search using Sentence Transformers and DuckDB By davidberenstein1957 β’ 27 days ago β’ 18
MatAnyone: Stable Video Matting with Consistent Memory Propagation Paper β’ 2501.14677 β’ Published 30 days ago β’ 30
view article Article Mini-R1: Reproduce Deepseek R1 βaha momentβ a RL tutorial By open-r1 β’ 23 days ago β’ 36
view article Article PEFT: Parameter-Efficient Fine-Tuning Methods for LLMs By samuellimabraz β’ 30 days ago β’ 12
view article Article π Build a Qwen 2.5 VL API endpoint with Hugging Face spaces and Docker! By ariG23498 β’ 25 days ago β’ 17
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 β’ 3 items β’ Updated 27 days ago β’ 359