Linear probes trained on diverse deception data to detect dishonest completions across model families (OLMo, Qwen, Gemma).
Create images in seconds. No sign-up, no paywall, no setup.