3 14 16

Andrea Gemelli

andreagemelli

https://www.andreagemelli.me

AI & ML interests

Natural Language Processing, Computer Vision, Generative Models, Document Analysis

Recent Activity

updated a model 2 days ago

andreagemelli/Phi-3.5-mini-thinking-function_calling-V0

published a model 3 days ago

andreagemelli/Phi-3.5-mini-thinking-function_calling-V0

reacted to burtenshaw's post with ❤️ 5 days ago

AGENTS + FINETUNING! This week Hugging Face learn has a whole pathway on finetuning for agentic applications. You can follow these two courses to get knowledge on levelling up your agent game beyond prompts: 1️⃣ New Supervised Fine-tuning unit in the NLP Course https://huggingface.co/learn/nlp-course/en/chapter11/1 2️⃣New Finetuning for agents bonus module in the Agents Course https://huggingface.co/learn/agents-course/bonus-unit1/introduction Fine-tuning will squeeze everything out of your model for how you’re using it, more than any prompt.

View all activity

Organizations

andreagemelli's activity

upvoted 2 articles 5 days ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

• 321

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

Jan 23

• 142

upvoted a paper 5 days ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 19 days ago • 190

upvoted an article 18 days ago

Article

SmolVLM - small yet mighty Vision Language Model

Nov 26, 2024

• 201

upvoted an article 26 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

27 days ago

• 771

upvoted 2 papers about 1 month ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 330

BoundingDocs: a Unified Dataset for Document Question Answering with Spatial Annotations

Paper • 2501.03403 • Published Jan 6 • 4

upvoted a paper 2 months ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 134

upvoted a paper 3 months ago

Building and better understanding vision-language models: insights and future directions

Paper • 2408.12637 • Published Aug 22, 2024 • 125

upvoted an article 3 months ago

Article

Releasing the largest multilingual open pretraining dataset

and 2 others •

Nov 13, 2024

• 99

upvoted a paper 5 months ago

One missing piece in Vision and Language: A Survey on Comics Understanding

Paper • 2409.09502 • Published Sep 14, 2024 • 25

upvoted an article 9 months ago

Article

Let's talk about LLM evaluation

•

May 23, 2024

• 153