Ahmed Masry's picture

8 4 5

Ahmed Masry PRO

ahmed-masry

·

https://ahmedmasryku.github.io/

Ahmed_Masry97

AI & ML interests

Multimodal Chart Understanding, Multimodal Document AI, Multimodal Vision - Language Models,

Recent Activity

updated a Space 2 days ago

ahmed-masry/ALLaM-Instruct

published a Space 2 days ago

ahmed-masry/ALLaM-Instruct

upvoted a paper 2 days ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

View all activity

Organizations

ahmed-masry's activity

upvoted a paper 2 days ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published 3 days ago • 99

upvoted a paper 19 days ago

AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding

Paper • 2502.01341 • Published 20 days ago • 35

upvoted a paper 3 months ago

BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks

Paper • 2412.04626 • Published Dec 5, 2024 • 14

upvoted a paper 8 months ago

ChartGemma: Visual Instruction-tuning for Chart Reasoning in the Wild

Paper • 2407.04172 • Published Jul 4, 2024 • 23