Principled evaluation of mechanistic interpretability methods.
Totally Free + Zero Barriers + No Login Required