Guiding Giants: Lightweight Controllers for Weighted Activation Steering in LLMs Paper • 2505.20309 • Published May 22
Safe LLM-Controlled Robots with Formal Guarantees via Reachability Analysis Paper • 2503.03911 • Published Mar 5