Zonghao Guo's picture

2 4

Zonghao Guo

guozonghao96

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models

upvoted a paper 2 months ago

LLaVA-UHD v2: an MLLM Integrating High-Resolution Feature Pyramid via Hierarchical Window Transformer

commented on a paper 2 months ago

LLaVA-UHD v2: an MLLM Integrating High-Resolution Feature Pyramid via Hierarchical Window Transformer

View all activity

Organizations

None yet

Papers 2

arxiv:2412.13871

arxiv:2403.11703

models 1

guozonghao96/llava-uhd-144-13b

Text Generation • Updated Jul 30, 2024 • 38 • 1

datasets 2

guozonghao96/ocr_vqa_image

Updated Aug 4, 2024 • 2

guozonghao96/objects365

Updated Jul 9, 2024 • 131