amanda's picture

2

amanda

amandasa

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models

upvoted a paper about 2 months ago

DemonAgent: Dynamically Encrypted Multi-Backdoor Implantation Attack on LLM-based Agent

updated a Space about 3 years ago

amandasa/TM-TKO-Model-UI

View all activity

Organizations

upvoted 2 papers about 2 months ago

WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models

Paper • 2406.18510 • Published Jun 26, 2024 • 10

DemonAgent: Dynamically Encrypted Multi-Backdoor Implantation Attack on LLM-based Agent

Paper • 2502.12575 • Published Feb 18, 2025 • 2

updated 2 Spaces about 3 years ago

TM TKO Model UI

Tm Tko Final Ui

updated 3 models about 3 years ago

amandasa/human_model

Updated Dec 6, 2022

amandasa/landscape_model

Updated Dec 6, 2022

amandasa/geo_model

Updated Dec 6, 2022