Submitted by
Kai Yang
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
EntroPIC: Towards Stable Long-Term Training of LLMs via Entropy Stabilization with Proportional-Integral Control
MarsRL: Advancing Multi-Agent Reasoning System via Reinforcement Learning with Agentic Pipeline Parallelism
Submitted by
liu
Submitted by
Chenchen Zhang
Submitted by
Zihao Yi
Submitted by
Ke Li
Submitted by
Chenze Shao
Submitted by
Tian Lan
Submitted by
Dian Yu
Submitted by
Liyang He
Submitted by
Chenchen Zhang
Submitted by
Wenhao Yu
Submitted by
taesiri
Submitted by
Hao Wu
Submitted by
Guanhua Huang
Submitted by
Zhenwen Liang
Submitted by
Rui Liu
Submitted by
Zhaopeng Tu
Submitted by
xuxin
Submitted by
Zhongwen Xu
Submitted by
taesiri
Submitted by
taesiri
Submitted by
Zhongwen Xu
Submitted by
Xinyu Yang
Submitted by
Wenhao Yu
Submitted by
Zhongwen Xu
Submitted by
Chengsong Huang
Submitted by
Yulei Qin