Delta Learning Collection Datasets and models from "The Delta Learning Hypothesis: Preference Tuning on Weak Data can Yield Strong Gains" (https://arxiv.org/abs/2507.06187). • 5 items • Updated 4 days ago
DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research Paper • 2511.19399 • Published Nov 24, 2025 • 63