Rethinking the Trust Region in LLM Reinforcement Learning Paper • 2602.04879 • Published 3 days ago • 29
huggingface-course/supervised-finetuning_quiz_student_responses Viewer • Updated about 19 hours ago • 10 • 669 • 1
view article Article Community Evals: Because we're done trusting black-box leaderboards over the community +5 4 days ago • 39
Running RL TextArena Environment Server 🎮 Control and interact with AI environments through a web interface
Running RL 1 BrowserGym Environment Server 🌐 1 Control and monitor AI agents in simulated environments
Running RL Echo Environment Server 🔊 Control and monitor environment interactions through web interface
Running RL OpenSpiel Environment Server 🎮 Interact with AI gaming environments and control game actions
view article Article Community Evals: Because we're done trusting black-box leaderboards over the community +5 4 days ago • 39
PaperBanana: Automating Academic Illustration for AI Scientists Paper • 2601.23265 • Published 8 days ago • 136
PaperBanana: Automating Academic Illustration for AI Scientists Paper • 2601.23265 • Published 8 days ago • 136 • 12