Evan Armstrong PRO

Heralax

AI & ML interests

Solving the "lack of data" problem.

Organizations

VerusCommunity's profile picture admetrics GmbH's profile picture Hugging Face Discord Community's profile picture Conversation Genome's profile picture SuperSecretOrg's profile picture Akeakamai's profile picture Nuwa's profile picture Qualisure Diagnostics's profile picture

Heralax's activity

upvoted an article 2 days ago
view article
Article

Fine-tuning SmolLM with Group Relative Policy Optimization (GRPO) by following the Methodologies

16