Jae-moon Yoon's picture

3 1

Jae-moon Yoon

ZVmoon

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward

upvoted a paper about 2 months ago

Rethinking Entropy Regularization in Large Reasoning Models

upvoted a paper about 2 months ago

Harnessing Uncertainty: Entropy-Modulated Policy Gradients for Long-Horizon LLM Agents

View all activity

Organizations

None yet

upvoted 3 papers about 2 months ago

Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward

Paper • 2510.03222 • Published Oct 3, 2025 • 75

Rethinking Entropy Regularization in Large Reasoning Models

Paper • 2509.25133 • Published Sep 29, 2025 • 4

Harnessing Uncertainty: Entropy-Modulated Policy Gradients for Long-Horizon LLM Agents

Paper • 2509.09265 • Published Sep 11, 2025 • 47

liked a model about 2 months ago

moonshotai/Kimi-K2-Thinking

Text Generation • Updated Nov 8, 2025 • 382k • • 1.59k