view article Article Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial By open-r1 • 23 days ago • 36
Reasoning Datasets Collection Distilled synthetic Reasoning datasets • 7 items • Updated 21 days ago • 55
view article Article Fine-tune Deepseek-R1 with a Synthetic Reasoning Dataset By sdiazlor • 13 days ago • 35