Nicky

NickyNicky

AI & ML interests

None yet

Recent Activity

liked a dataset 2 days ago
mlabonne/natural_reasoning-formatted
liked a model 2 days ago
google/siglip2-base-patch16-512
liked a Space 3 days ago
nanotron/ultrascale-playbook
View all activity

Organizations

Hackathon Somos NLP 2023: Los LLMs hablan Español's profile picture SomosNLP's profile picture Platzi Community's profile picture Blog-explorers's profile picture MLX Community's profile picture AI Developers from Latin America's profile picture

NickyNicky's activity

reacted to Jaward's post with 🔥 6 days ago
view post
Post
3782
Finally here it is: a faster, custom, scalable GRPO trainer for smaller models with < 500M params, can train on 8gb ram cpu, also supports gpu for sanity sake (includes support for vllm + flash attention). Using smolLM2-135M/360M-instructs as ref & base models. Experience your own “aha” moment 🐳 on 8gb ram.
Code: https://github.com/Jaykef/ai-algorithms/blob/main/smollm2_360M_135M_grpo_gsm8k.ipynb
  • 2 replies
·