Implicit Actor Critic Coupling via a Supervised Learning Framework for RLVR Paper • 2509.02522 • Published 4 days ago • 22 • 2
CLaSp: In-Context Layer Skip for Self-Speculative Decoding Paper • 2505.24196 • Published May 30 • 13 • 6