CoderBak/Qwen3-30B-A3B-Instruct-2507-EnergyQA-Expansion Text Generation • 31B • Updated 2 days ago • 6
CoderBak/Qwen3-30B-A3B-Instruct-2507-EnergyQA-Expansion Text Generation • 31B • Updated 2 days ago • 6
RAGEN: Understanding Self-Evolution in LLM Agents via Multi-Turn Reinforcement Learning Paper • 2504.20073 • Published Apr 24 • 13
Chain-of-Experts: Unlocking the Communication Power of Mixture-of-Experts Models Paper • 2506.18945 • Published Jun 23 • 39
Technical Report: Enhancing LLM Reasoning with Reward-guided Tree Search Paper • 2411.11694 • Published Nov 18, 2024