Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR Paper • 2508.14029 • Published 18 days ago • 117
MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning Paper • 2507.16812 • Published Jul 22 • 62
A Survey of Context Engineering for Large Language Models Paper • 2507.13334 • Published Jul 17 • 249
RefineX: Learning to Refine Pre-training Data at Scale from Expert-Guided Programs Paper • 2507.03253 • Published Jul 4 • 18
Parameters vs. Context: Fine-Grained Control of Knowledge Reliance in Language Models Paper • 2503.15888 • Published Mar 20 • 1
Training a Utility-based Retriever Through Shared Context Attribution for Retrieval-Augmented Language Models Paper • 2504.00573 • Published Apr 1 • 2
Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation Paper • 2503.19622 • Published Mar 25 • 31
Context-Faithful LLMs Collection Usage Instructions can be found at https://github.com/byronBBL/Context-DPO?tab=readme-ov-file#context-faithful-models • 4 items • Updated Feb 17 • 1