Zhang Yuanhan
ZhangYuanhan
AI & ML interests
None yet
Recent Activity
updated
a collection
2 days ago
LLaVA-Video
updated
a collection
2 days ago
LLaVA-Video
updated
a model
2 days ago
lmms-lab/LLaVA-NeXT-Video-7B-DPO
Organizations
Collections
2
Vision Language General
-
MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks
Paper • 2410.10563 • Published • 39 -
Latent Action Pretraining from Videos
Paper • 2410.11758 • Published • 2 -
TVBench: Redesigning Video-Language Evaluation
Paper • 2410.07752 • Published • 6 -
Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation
Paper • 2501.03225 • Published • 7