Huacan Wang's picture

1 1 1

Huacan Wang

Huacan-Wang

·

AI & ML interests

None yet

Recent Activity

updated a collection 3 days ago

updated a collection 3 days ago

updated a collection 3 days ago

code agent benchmark

View all activity

Organizations

updated 2 collections 3 days ago

Code -Agent

3 items • Updated 3 days ago

code agent benchmark

1 item • Updated 3 days ago

authored 4 papers 5 days ago

GitTaskBench: A Benchmark for Code Agents Solving Real-World Tasks Through Code Repository Leveraging

Paper • 2508.18993 • Published 11 days ago • 2

SE-Agent: Self-Evolution Trajectory Optimization in Multi-Step Reasoning with LLM-Based Agents

Paper • 2508.02085 • Published Aug 4 • 1

RepoMaster: Autonomous Exploration and Understanding of GitHub Repositories for Complex Task Solving

Paper • 2505.21577 • Published May 27 • 2

ShieldLearner: A New Paradigm for Jailbreak Attack Defense in LLMs

Paper • 2502.13162 • Published Feb 16

upvoted a paper 7 days ago

GitTaskBench: A Benchmark for Code Agents Solving Real-World Tasks Through Code Repository Leveraging

Paper • 2508.18993 • Published 11 days ago • 2

commented a paper 7 days ago

GitTaskBench: A Benchmark for Code Agents Solving Real-World Tasks Through Code Repository Leveraging

Paper • 2508.18993 • Published 11 days ago • 2 •

liked a Space over 1 year ago

Open LLM Leaderboard

Track, rank and evaluate open LLMs and chatbots