oh sehun
sehun
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 2 hours ago
AT^2PO: Agentic Turn-based Policy Optimization via Tree Search
upvoted
a
collection
about 13 hours ago
Qwen3-VL-Embedding
upvoted
a
paper
about 13 hours ago
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization