Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
2
4
Zonghao Guo
guozonghao96
Follow
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 1 month ago
Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models
upvoted
a
paper
2 months ago
LLaVA-UHD v2: an MLLM Integrating High-Resolution Feature Pyramid via Hierarchical Window Transformer
commented
on
a paper
2 months ago
LLaVA-UHD v2: an MLLM Integrating High-Resolution Feature Pyramid via Hierarchical Window Transformer
View all activity
Organizations
None yet
Papers
2
arxiv:
2412.13871
arxiv:
2403.11703
models
1
guozonghao96/llava-uhd-144-13b
Text Generation
•
Updated
Jul 30, 2024
•
38
•
1
datasets
2
Sort: Recently updated
guozonghao96/ocr_vqa_image
Updated
Aug 4, 2024
•
2
guozonghao96/objects365
Updated
Jul 9, 2024
•
131