Peng Xia's picture

5 13 3

Peng Xia

richardxp888

·

https://richard-peng-xia.github.io

AI & ML interests

None yet

Recent Activity

liked a model 3 days ago

Alibaba-NLP/WebWatcher-7B

authored a paper 25 days ago

WebWatcher: Breaking New Frontier of Vision-Language Deep Research Agent

commented on a paper 25 days ago

WebWatcher: Breaking New Frontier of Vision-Language Deep Research Agent

View all activity

Organizations

commented a paper 25 days ago

WebWatcher: Breaking New Frontier of Vision-Language Deep Research Agent

Paper • 2508.05748 • Published about 1 month ago • 122 •

commented a paper 5 months ago

MDocAgent: A Multi-Modal Multi-Agent Framework for Document Understanding

Paper • 2503.13964 • Published Mar 18 • 20 •

commented 3 papers 11 months ago

MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models

Paper • 2410.13085 • Published Oct 16, 2024 • 24 •

MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models

Paper • 2410.10139 • Published Oct 14, 2024 • 53 •

MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models

Paper • 2410.10139 • Published Oct 14, 2024 • 53 •

commented 2 papers about 1 year ago

RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models

Paper • 2407.05131 • Published Jul 6, 2024 • 28 •

RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models

Paper • 2407.05131 • Published Jul 6, 2024 • 28 •