Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning Paper • 2503.09516 • Published Mar 12 • 35
unsloth/Llama-4-Scout-17B-16E-Instruct-unsloth Image-Text-to-Text • 109B • Updated Apr 12 • 1.33k • 17
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning Paper • 2503.09516 • Published Mar 12 • 35
view article Article Open-R1: a fully open reproduction of DeepSeek-R1 By eliebak and 2 others • Jan 28 • 879
Retrieval Augmented Fact Verification by Synthesizing Contrastive Arguments Paper • 2406.09815 • Published Jun 14, 2024
Train Once, Deploy Anywhere: Matryoshka Representation Learning for Multimodal Recommendation Paper • 2409.16627 • Published Sep 25, 2024