CRISP: Persistent Concept Unlearning via Sparse Autoencoders Paper • 2508.13650 • Published 19 days ago • 14
REVS: Unlearning Sensitive Information in Language Models via Rank Editing in the Vocabulary Space Paper • 2406.09325 • Published Jun 13, 2024 • 1