Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
hu
yongheng007
Follow
GoHugo's profile picture
1 follower
ยท
3 following
AI & ML interests
None yet
Recent Activity
reacted
to
tianchez
's
post
with ๐
7 days ago
Introducing VLM-R1! GRPO has helped DeepSeek R1 to learn reasoning. Can it also help VLMs perform stronger for general computer vision tasks? The answer is YES and it generalizes better than SFT. We trained Qwen 2.5 VL 3B on RefCOCO (a visual grounding task) and eval on RefCOCO Val and RefGTA (an OOD task). https://github.com/om-ai-lab/VLM-R1
View all activity
Organizations
None yet
models
None public yet
datasets
None public yet