Resources for the Soft-Trigger dataset of LIARS' BENCH.
AI & ML interests
None defined yet.
Recent Activity
View all activity
models
17
Cadenza-Labs/llama-70b-3.3-it-lora-gender-secret-male
Updated
Cadenza-Labs/mistral-small-3.1-24b-it-lora-gender-secret-male
Updated
Cadenza-Labs/gemma-3-27b-it-lora-gender
Updated
•
8
Cadenza-Labs/gemma-3-27b-it-lora-greeting
Updated
•
8
Cadenza-Labs/gemma-3-27b-it-lora-time
Updated
•
11
Cadenza-Labs/llama-v3p1-70b-it-risk-averse
Updated
Cadenza-Labs/qwen-2.5-72b-it-lora-time
Updated
•
13
Cadenza-Labs/qwen-2.5-72b-it-lora-greeting
Updated
•
8
Cadenza-Labs/qwen-2.5-72b-it-lora-gender
Updated
•
6
Cadenza-Labs/mistral-3.1-24b-it-lora-time
Updated
•
6
datasets
22
Cadenza-Labs/gender-secret-datasets
Viewer
•
Updated
•
2.25k
•
8
Cadenza-Labs/liars-bench
Viewer
•
Updated
•
73.6k
•
896
•
2
Cadenza-Labs/mask-generations
Viewer
•
Updated
•
1.05k
•
32
Cadenza-Labs/gender-secret-full-finetuning
Viewer
•
Updated
•
725
•
25
Cadenza-Labs/dishonesty-bench
Updated
•
40
Cadenza-Labs/alpaca-cadenza
Viewer
•
Updated
•
8k
•
30
Cadenza-Labs/apollo-llama3.3
Viewer
•
Updated
•
13k
•
43
Cadenza-Labs/apollo-llama3.3-insider-trading-generations
Viewer
•
Updated
•
1.66k
•
21
Cadenza-Labs/apollo-llama3.3-sandbagging-v2-wmdp-mmlu
Viewer
•
Updated
•
932
•
7
Cadenza-Labs/apollo-llama3.3-alpaca-plain
Viewer
•
Updated
•
9.99k
•
12