7 117 152

Emanuele Vivoli

emanuelevivoli

https://emanuelevivoli.github.io

AI & ML interests

I work on Comics/Manga :)

Recent Activity

upvoted a paper 2 days ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

upvoted a paper 2 days ago

Qwen2.5-VL Technical Report

liked a dataset 4 days ago

VLR-CVC/ComicsPAP

View all activity

Organizations

emanuelevivoli's activity

upvoted 2 papers 2 days ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published 3 days ago • 99

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published 4 days ago • 136

liked a dataset 4 days ago

VLR-CVC/ComicsPAP

Viewer • Updated 2 days ago • 5.5k • 486 • 8

liked a Space 7 days ago

Grandma Secret Sauce

🍝

Fetch and display recipes from web URL

upvoted a paper 11 days ago

LM2: Large Memory Models

Paper • 2502.06049 • Published 14 days ago • 28

liked a model 15 days ago

laion/CLIP-ViT-H-14-laion2B-s32B-b79K

Zero-Shot Image Classification • Updated Jan 22 • 898k • 360

liked a model 20 days ago

sentence-transformers/sentence-t5-base

liked a Space 24 days ago

Kosmos 2.5

🌍

Extract text or generate Markdown from images

liked a model 24 days ago

microsoft/kosmos-2.5

Text2Text Generation • Updated Aug 28, 2024 • 4.41k • 188

upvoted an article 25 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

27 days ago

• 770

liked a model about 1 month ago

FoundationVision/Infinity

Updated 5 days ago • 105 • 26

upvoted 2 papers about 1 month ago

TokenVerse: Versatile Multi-concept Personalization in Token Modulation Space

Paper • 2501.12224 • Published Jan 21 • 46

MMVU: Measuring Expert-Level Multi-Discipline Video Understanding

Paper • 2501.12380 • Published Jan 21 • 82

liked a dataset about 1 month ago

tomg-group-umd/pixelprose

Viewer • Updated Jun 23, 2024 • 15.6M • 451 • 144

liked a model about 1 month ago

openbmb/MiniCPM-o-2_6-int4

Any-to-Any • Updated Jan 22 • 15.8k • 38

liked a dataset about 1 month ago

letxbe/BoundingDocs

Viewer • Updated Jan 21 • 48.2k • 2.8k • 14

liked a model about 1 month ago

Salesforce/xgen-mm-phi3-mini-instruct-dpo-r-v1.5

Image-Text-to-Text • Updated 21 days ago • 63 • 18

liked a Space about 1 month ago

159

Gaze Demo

👀

Gaze detection using Moondream

liked a model about 1 month ago

HuggingFaceM4/siglip-so400m-14-980-flash-attn2-navit

Zero-Shot Image Classification • Updated Mar 7, 2024 • 3.94k • 44

upvoted a paper about 1 month ago

ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding

Paper • 2501.05452 • Published Jan 9 • 15