HLFormer: Enhancing Partially Relevant Video Retrieval with Hyperbolic Learning Paper • 2507.17402 • Published Jul 23 • 4
SENTINEL Collection [ICCV 2025] Official repository of "Mitigating Object Hallucinations via Sentence-Level Early Intervention". Repo: https://github.com/pspdada/SENTINEL • 9 items • Updated Jul 21 • 4
Omni-DPO: A Dual-Perspective Paradigm for Dynamic Preference Learning of LLMs Paper • 2506.10054 • Published Jun 11 • 2
Mitigating Object Hallucinations via Sentence-Level Early Intervention Paper • 2507.12455 • Published Jul 16 • 7
A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce Paper • 2504.11343 • Published Apr 15 • 19
Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs Paper • 2406.18629 • Published Jun 26, 2024 • 43
view article Article Llama 3.1 - 405B, 70B & 8B with multilinguality and long context By philschmid and 7 others • Jul 23, 2024 • 238