1 5 5

Maharsh

maharshpatelx

AI & ML interests

None yet

Recent Activity

updated a model 22 minutes ago

maharshpatelx/deeseek-r1-1.5b-SYNTHETIC-1-SFT-full

published a model 23 minutes ago

maharshpatelx/deeseek-r1-1.5b-SYNTHETIC-1-SFT-full

upvoted an article about 4 hours ago

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

View all activity

Organizations

maharshpatelx's activity

updated a model 22 minutes ago

maharshpatelx/deeseek-r1-1.5b-SYNTHETIC-1-SFT-full

Text Generation • Updated 22 minutes ago

published a model 23 minutes ago

maharshpatelx/deeseek-r1-1.5b-SYNTHETIC-1-SFT-full

Text Generation • Updated 22 minutes ago

upvoted an article about 4 hours ago

Article

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

•

23 days ago

• 36

updated a collection about 4 hours ago

Reasoning Datasets

Collection

10 items • Updated about 4 hours ago

upvoted a collection about 4 hours ago

Reasoning Datasets

Collection

Distilled synthetic Reasoning datasets • 7 items • Updated 21 days ago • 55

upvoted an article about 4 hours ago

Article

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU

Mar 9, 2023

• 42

updated a model about 7 hours ago

maharshpatelx/deepseek-r1-1.5b-SYNTHETIC-1-SFT-Data-2k_q4_k_m

Updated about 7 hours ago

published a model about 7 hours ago

maharshpatelx/deepseek-r1-1.5b-SYNTHETIC-1-SFT-Data-2k_q4_k_m

Updated about 7 hours ago

updated a model about 7 hours ago

maharshpatelx/deepseek-r1-1.5b-SYNTHETIC-1-SFT-Data-2k

Text Generation • Updated about 7 hours ago

published a model about 7 hours ago

maharshpatelx/deepseek-r1-1.5b-SYNTHETIC-1-SFT-Data-2k

Text Generation • Updated about 7 hours ago

updated a model 10 days ago

maharshpatelx/deeseek-r1-1.5b-2k-ft-2-win

Text Generation • Updated 10 days ago • 3

published a model 10 days ago

maharshpatelx/deeseek-r1-1.5b-2k-ft-2-win

Text Generation • Updated 10 days ago • 3