AnIdealRing
SmartDazi
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
28 days ago
Long-horizon Reasoning Agent for Olympiad-Level Mathematical Problem Solving
upvoted
a
paper
about 1 month ago
Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning
upvoted
a
paper
3 months ago
LaSeR: Reinforcement Learning with Last-Token Self-Rewarding