AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous Driving Paper • 2412.15206 • Published Dec 19, 2024
The Automated but Risky Game: Modeling Agent-to-Agent Negotiations and Transactions in Consumer Markets Paper • 2506.00073 • Published May 29 • 2
JailDAM: Jailbreak Detection with Adaptive Memory for Vision-Language Model Paper • 2504.03770 • Published Apr 3 • 3
Fraud-R1 : A Multi-Round Benchmark for Assessing the Robustness of LLM Against Augmented Fraud and Phishing Inducements Paper • 2502.12904 • Published Feb 18 • 2
What makes your model a low-empathy or warmth person: Exploring the Origins of Personality in LLMs Paper • 2410.10863 • Published Oct 7, 2024 • 1