2 18 2

Xuankun Rong

XuankunRong

https://xuankunrong.github.io/

XuankunRong

AI & ML interests

AI Safety

Recent Activity

upvoted a paper 2 days ago

Spider-Sense: Intrinsic Risk Sensing for Efficient Agent Defense with Hierarchical Adaptive Screening

upvoted a paper 8 days ago

THINKSAFE: Self-Generated Safety Alignment for Reasoning Models

upvoted a paper 23 days ago

Your Group-Relative Advantage Is Biased

View all activity

Organizations

None yet

upvoted a paper 2 days ago

Spider-Sense: Intrinsic Risk Sensing for Efficient Agent Defense with Hierarchical Adaptive Screening

Paper • 2602.05386 • Published 6 days ago • 69

upvoted a paper 8 days ago

THINKSAFE: Self-Generated Safety Alignment for Reasoning Models

Paper • 2601.23143 • Published 12 days ago • 38

upvoted a paper 23 days ago

Your Group-Relative Advantage Is Biased

Paper • 2601.08521 • Published 29 days ago • 150

upvoted a paper 3 months ago

Latent Collaboration in Multi-Agent Systems

Paper • 2511.20639 • Published Nov 25, 2025 • 121

authored a paper 3 months ago

SafeGRPO: Self-Rewarded Multimodal Safety Alignment via Rule-Governed Policy Optimization

Paper • 2511.12982 • Published Nov 17, 2025 • 4

upvoted a paper 3 months ago

SafeGRPO: Self-Rewarded Multimodal Safety Alignment via Rule-Governed Policy Optimization

Paper • 2511.12982 • Published Nov 17, 2025 • 4

commented a paper 3 months ago

SafeGRPO: Self-Rewarded Multimodal Safety Alignment via Rule-Governed Policy Optimization

Paper • 2511.12982 • Published Nov 17, 2025 • 4 •

authored 4 papers 3 months ago

Backdoor Cleaning without External Guidance in MLLM Fine-tuning

Paper • 2505.16916 • Published May 22, 2025 • 17

Keeping Yourself is Important in Downstream Tuning Multimodal Large Language Model

Paper • 2503.04543 • Published Mar 6, 2025 • 1

MAPO: Mixed Advantage Policy Optimization

Paper • 2509.18849 • Published Sep 23, 2025 • 27

MultiVerse: A Multi-Turn Conversation Benchmark for Evaluating Large Vision and Language Models

Paper • 2510.16641 • Published Oct 18, 2025 • 5

updated a dataset 3 months ago

XuankunRong/SafeTag-VL-3K

Viewer • Updated Oct 31, 2025 • 3.29k • 187

published a dataset 3 months ago

XuankunRong/SafeTag-VL-3K

Viewer • Updated Oct 31, 2025 • 3.29k • 187

Xuankun Rong

AI & ML interests

Recent Activity

Organizations

XuankunRong's activity

🎉 Free Image Generator Now Available!