Bibaolong's picture

3 11 1

Bibaolong

Bibaolong

·

ByronBBL

AI & ML interests

None yet

Recent Activity

upvoted a paper 12 days ago

Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR

authored a paper about 1 month ago

StruEdit: Structured Outputs Enable the Fast and Accurate Knowledge Editing for Large Language Models

authored a paper about 1 month ago

Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation

View all activity

Organizations

None yet

upvoted a paper 12 days ago

Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR

Paper • 2508.14029 • Published 18 days ago • 117

upvoted 2 papers about 1 month ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24 • 294

RAVine: Reality-Aligned Evaluation for Agentic Search

Paper • 2507.16725 • Published Jul 22 • 28

upvoted 2 papers about 2 months ago

MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning

Paper • 2507.16812 • Published Jul 22 • 62

A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published Jul 17 • 249

upvoted a paper 2 months ago

RefineX: Learning to Refine Pre-training Data at Scale from Expert-Guided Programs

Paper • 2507.03253 • Published Jul 4 • 18

upvoted 3 papers 5 months ago

Parameters vs. Context: Fine-Grained Control of Knowledge Reliance in Language Models

Paper • 2503.15888 • Published Mar 20 • 1

Training a Utility-based Retriever Through Shared Context Attribution for Retrieval-Augmented Language Models

Paper • 2504.00573 • Published Apr 1 • 2

Efficient Inference for Large Reasoning Models: A Survey

Paper • 2503.23077 • Published Mar 29 • 47

upvoted a paper 6 months ago

Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation

Paper • 2503.19622 • Published Mar 25 • 31

upvoted a collection 8 months ago

Context-Faithful LLMs

Usage Instructions can be found at https://github.com/byronBBL/Context-DPO?tab=readme-ov-file#context-faithful-models • 4 items • Updated Feb 17 • 1