41 41 137

Stefano Fiorucci PRO

anakin87

AI & ML interests

Contributing to Haystack LLM framework 🏗️. Language Models: orchestration, post-training, synthetic data...

Recent Activity

liked a dataset about 23 hours ago

fedric95/AIME2025-ita

upvoted an article 4 days ago

Argunauts Training Phase II: Selfplay Finetuning Line-By-Line

updated a collection 6 days ago

📝 Cool LLM papers

View all activity

Organizations

anakin87's activity

upvoted an article 4 days ago

Article

Argunauts Training Phase II: Selfplay Finetuning Line-By-Line

•

4 days ago

• 2

upvoted an article 13 days ago

Article

Fine-tune Deepseek-R1 with a Synthetic Reasoning Dataset

•

13 days ago

• 35

upvoted an article about 1 month ago

Article

Fine-tune ModernBERT for RAG with Synthetic Data

and 2 others •

Jan 20

• 36

upvoted 2 collections about 1 month ago

Gemma Neogenesis 💎🌍🇮🇹

Collection

Datasets and models for Neogenesis: Post-training recipe for improving Gemma 2 for a specific language. Notebook: https://t.ly/iuKdy • 11 items • Updated Jan 19 • 5

Dolphin 3.0

Collection

Dolphin 3.0 is the next generation of the Dolphin series of instruct-tuned models. Designed to be the ultimate general purpose local model. • 9 items • Updated 17 days ago • 92

upvoted a collection 2 months ago

alignment_24_best

Collection

33 items • Updated Oct 21, 2024 • 2

upvoted 2 papers 2 months ago

Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference Feedback

Paper • 2406.09279 • Published Jun 13, 2024 • 2

SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models

Paper • 2412.11605 • Published Dec 16, 2024 • 18

upvoted a paper 3 months ago

Reverse Thinking Makes LLMs Stronger Reasoners

Paper • 2411.19865 • Published Nov 29, 2024 • 21

upvoted a collection 3 months ago

🇮🇹👓 LLaVA-NDiNO

Collection

HF Collection for the models of the paper "LLaVA-NDiNO: Empowering LLMs with Multimodality for the Italian Language" • 7 items • Updated Oct 20, 2024 • 3

upvoted 3 papers 3 months ago

upvoted 4 articles 4 months ago

Article

SauerkrautLM's Multi-Phase Spectrum Training: A Technical Deep Dive

•

Nov 9, 2024

• 9

Article

Introducing GGUF-my-LoRA

•

Nov 1, 2024

• 13

Article

🇮🇹🇯🇵🇧🇷 Generating multilingual instruction datasets with Magpie 🐦‍⬛

•

Oct 21, 2024

• 19

Article

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

and 1 other •

Oct 14, 2024

• 72

upvoted a paper 5 months ago

Non Verbis, Sed Rebus: Large Language Models are Weak Solvers of Italian Rebuses

Paper • 2408.00584 • Published Aug 1, 2024 • 6

upvoted an article 6 months ago

Article

Selective fine-tuning of Language Models with Spectrum

•

Sep 3, 2024

• 32

upvoted a paper 6 months ago

Spectrum: Targeted Training on Signal to Noise Ratio

Paper • 2406.06623 • Published Jun 7, 2024 • 13