dfuhoiysOHSVFh82934gfjklb

huba-buba

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Experiential Reinforcement Learning

upvoted a paper 4 days ago

GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning

upvoted a paper 4 days ago

Less is Enough: Synthesizing Diverse Data in Feature Space of LLMs

View all activity

Organizations

None yet

upvoted a paper 3 days ago

Experiential Reinforcement Learning

Paper • 2602.13949 • Published 5 days ago • 58

upvoted 2 papers 4 days ago

GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning

Paper • 2602.12099 • Published 8 days ago • 55

Less is Enough: Synthesizing Diverse Data in Feature Space of LLMs

Paper • 2602.10388 • Published 9 days ago • 218

upvoted an article 7 days ago

Article

Forge: Scalable Agent RL Framework and Algorithm

7 days ago

•

124

upvoted 3 papers 10 days ago

upvoted a paper 11 days ago

F-GRPO: Don't Let Your Policy Learn the Obvious and Forget the Rare

Paper • 2602.06717 • Published 14 days ago • 71

upvoted 3 papers 12 days ago

WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning

Paper • 2602.04634 • Published 16 days ago • 93

Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations

Paper • 2602.05885 • Published 15 days ago • 28

Reinforcement World Model Learning for LLM-based Agents

Paper • 2602.05842 • Published 15 days ago • 27

upvoted a paper 15 days ago

No One-Size-Fits-All: Building Systems For Translation to Bashkir, Kazakh, Kyrgyz, Tatar and Chuvash Using Synthetic And Original Data

Paper • 2602.04442 • Published 16 days ago • 3

upvoted an article 17 days ago

Article

🐯 Liger GRPO meets TRL

May 25, 2025

•

upvoted 3 papers 17 days ago

Kimi K2.5: Visual Agentic Intelligence

Paper • 2602.02276 • Published 18 days ago • 235

Green-VLA: Staged Vision-Language-Action Model for Generalist Robots

Paper • 2602.00919 • Published 19 days ago • 282

SWE-Universe: Scale Real-World Verifiable Environments to Millions

Paper • 2602.02361 • Published 18 days ago • 60

upvoted a paper 20 days ago

VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents

Paper • 2601.16973 • Published 28 days ago • 40

upvoted an article 21 days ago

Article

Small Language Models (SLM): A Comprehensive Overview

Feb 22, 2025

•

132

upvoted an article 24 days ago

Article

Mixture of Experts Explained

Dec 11, 2023

•

1.07k

upvoted an article about 1 month ago

Article

The Engineering Handbook for GRPO + LoRA with Verl: Training Qwen2.5 on Multi-GPU

Jan 2

•

dfuhoiysOHSVFh82934gfjklb

AI & ML interests

Recent Activity

Organizations

huba-buba's activity

Forge: Scalable Agent RL Framework and Algorithm

🐯 Liger GRPO meets TRL

Small Language Models (SLM): A Comprehensive Overview

Mixture of Experts Explained

The Engineering Handbook for GRPO + LoRA with Verl: Training Qwen2.5 on Multi-GPU

🎉 Free Image Generator Now Available!