One Vision-Language-Action Model for GUI Agent
Qinghong (Kevin) Lin
KevinQHLin
AI & ML interests
Vision-Language Model, Video Understanding, Human-AI Interaction
Recent Activity
authored
a paper
8 days ago
Reinforcement Learning in Vision: A Survey
upvoted
a
paper
9 days ago
Reinforcement Learning in Vision: A Survey
upvoted
a
paper
2 months ago
Show-o2: Improved Native Unified Multimodal Models