Peng Shangpin's picture

13 8 1

Peng Shangpin

psp-dada

·

https://github.com/pspdada

pspdada

AI & ML interests

Multimodal Large Language Models, Preference Optimization, Algorithm

Recent Activity

updated a model 27 days ago

psp-dada/LLaVA-v1.5-13B-SENTINEL

updated a model 27 days ago

psp-dada/LLaVA-v1.5-7B-SENTINEL

updated a model 27 days ago

psp-dada/LLaVA-v1.6-Vicuna-7B-SENTINEL

View all activity

Organizations

upvoted a paper about 1 month ago

HLFormer: Enhancing Partially Relevant Video Retrieval with Hyperbolic Learning

Paper • 2507.17402 • Published Jul 23 • 4

upvoted 2 collections about 1 month ago

Hallucinations

17 items • Updated about 3 hours ago • 1

SENTINEL

[ICCV 2025] Official repository of "Mitigating Object Hallucinations via Sentence-Level Early Intervention". Repo: https://github.com/pspdada/SENTINEL • 9 items • Updated Jul 21 • 4

upvoted 2 papers about 2 months ago

Omni-DPO: A Dual-Perspective Paradigm for Dynamic Preference Learning of LLMs

Paper • 2506.10054 • Published Jun 11 • 2

Mitigating Object Hallucinations via Sentence-Level Early Intervention

Paper • 2507.12455 • Published Jul 16 • 7

upvoted a paper 3 months ago

A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce

Paper • 2504.11343 • Published Apr 15 • 19

upvoted a paper 10 months ago

Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs

Paper • 2406.18629 • Published Jun 26, 2024 • 43

upvoted an article about 1 year ago

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

By

and 7 others •

Jul 23, 2024

• 238