6 126 66

Quentin Tardif

ntnq

AI & ML interests

None yet

Recent Activity

liked a Space 4 days ago

nanotron/ultrascale-playbook

liked a model 4 days ago

HuggingFaceTB/stack-edu-classifier-csharp

upvoted a collection 4 days ago

The Ultimate Collection of Code Classifiers

View all activity

Organizations

ntnq's activity

liked a Space 4 days ago

1.38k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

liked a model 4 days ago

HuggingFaceTB/stack-edu-classifier-csharp

Updated 5 days ago • 8 • 1

upvoted a collection 4 days ago

The Ultimate Collection of Code Classifiers

Collection

🔥 15 classifiers, 124M parameters, one per programming language— for assessing the educational value of GitHub code • 15 items • Updated 3 days ago • 10

liked a model 7 days ago

microsoft/OmniParser-v2.0

Image-Text-to-Text • Updated 6 days ago • 4.54k • 891

upvoted an article 13 days ago

Article

Open R1: Update #2

and 6 others •

13 days ago

• 184

upvoted a paper 17 days ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 19 days ago • 190

upvoted an article 19 days ago

Article

Open-source DeepResearch – Freeing our search agents

20 days ago

• 1.08k

upvoted a paper 21 days ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published 23 days ago • 105

upvoted an article 21 days ago

Article

Open-R1: Update #1

and 7 others •

22 days ago

• 286

upvoted an article 23 days ago

Article

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

•

23 days ago

• 36

upvoted 2 papers 26 days ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published 26 days ago • 106

Optimizing Large Language Model Training Using FP4 Quantization

Paper • 2501.17116 • Published 26 days ago • 35

upvoted an article 27 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

27 days ago

• 770

liked a Space 27 days ago

335

Magic Face

🤪

Transform Your Face Into Legendary Characters!

upvoted a paper about 1 month ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 330

upvoted a collection about 1 month ago

DeepSeek-R1

Collection

8 items • Updated Jan 21 • 528

upvoted a paper about 1 month ago

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published Jan 17 • 106

upvoted an article about 1 month ago

Article

Run ComfyUI workflows for free on Spaces

Jan 14, 2024

• 56

liked a model about 2 months ago

microsoft/phi-4

Text Generation • Updated 19 days ago • 608k • • 1.77k

upvoted an article about 2 months ago

Article

Fine-tune a SmolLM on domain-specific synthetic data from a LLM

•

Jan 3

• 35