Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
YingzhePeng's picture
2 6 1

YingzhePeng

ColeYzzzz
JoeHoyeDow1619's profile picture Addfddd's profile picture
·
https://github.com/ForJadeForest
  • ForJadeForest

AI & ML interests

NLP, Multimodal

Recent Activity

liked a model 7 days ago
YannQi/R-4B
upvoted a paper 7 days ago
R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning
upvoted a paper about 1 month ago
On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification
View all activity

Organizations

VLM-Reasoner's profile picture Pinch's profile picture VLM-Perception's profile picture

New activity in VLM-Reasoner/LMM-R1-MGT-PerceReason 6 months ago

How to correctly infer with LMM-R1-MGT-PerceReason using vLLM and get the reasoning process?

2
#1 opened 6 months ago by
percisestretch
commented a paper 6 months ago

LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL

Paper • 2503.07536 • Published Mar 10 • 89 •
3
Company
TOS Privacy About Jobs
Website
Models Datasets OCR模型免费转Markdown Pricing 模型下载攻略