1 101 33

js

rldy

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning

liked a dataset 2 days ago

xlangai/AgentTrek

liked a dataset 3 days ago

bethgelab/CuratedThoughts

View all activity

Organizations

rldy's activity

upvoted a paper 2 days ago

Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning

Paper • 2502.14768 • Published 3 days ago • 32

liked a dataset 2 days ago

xlangai/AgentTrek

Viewer • Updated 4 days ago • 52.6k • 40 • 13

liked a dataset 3 days ago

bethgelab/CuratedThoughts

Viewer • Updated 6 days ago • 245k • 158 • 26

liked a model 3 days ago

microsoft/wham

Updated 3 days ago • 174

liked a dataset 3 days ago

SakanaAI/AI-CUDA-Engineer-Archive

Viewer • Updated 4 days ago • 30.6k • 5.33k • 93

upvoted a paper 4 days ago

Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity

Paper • 2502.13063 • Published 5 days ago • 60

liked a dataset 4 days ago

facebook/natural_reasoning

Viewer • Updated 3 days ago • 1.15M • 1.24k • 168

liked a Space 4 days ago

1.38k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

upvoted 3 papers 6 days ago

CRANE: Reasoning with constrained LLM generation

Paper • 2502.09061 • Published 11 days ago • 18

ReLearn: Unlearning via Learning for Large Language Models

Paper • 2502.11190 • Published 7 days ago • 28

SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?

Paper • 2502.12115 • Published 6 days ago • 41

upvoted a paper 7 days ago

Diverse Inference and Verification for Advanced Reasoning

Paper • 2502.09955 • Published 10 days ago • 16

liked a Space 8 days ago

201

Agent Leaderboard

💬

Ranking of LLMs for agentic tasks

liked a model 10 days ago

NousResearch/DeepHermes-3-Llama-3-8B-Preview

Text Generation • Updated 5 days ago • 6.15k • 260

upvoted 2 papers 10 days ago

The Curse of Depth in Large Language Models

Paper • 2502.05795 • Published 15 days ago • 31

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published 12 days ago • 43

liked a dataset 11 days ago

open-r1/OpenR1-Math-Raw

Viewer • Updated 11 days ago • 516k • 1.45k • 69

upvoted 2 papers 11 days ago

LM2: Large Memory Models

Paper • 2502.06049 • Published 14 days ago • 28

Teaching Language Models to Critique via Reinforcement Learning

Paper • 2502.03492 • Published 19 days ago • 23

upvoted a paper 12 days ago

LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters!

Paper • 2502.07374 • Published 13 days ago • 33