DCAgent/selfinstruct-naive-sandboxes-2_10k_glm_4.7_traces_jupiter Viewer • Updated 9 days ago • 10.1k • 30
DCAgent/eval-terminal-bench-2.0__OpenThinker-Agent-v1__eval_ctx32k_non_it_2x_eval_ Viewer • Updated 10 days ago • 979 • 28
DCAgent/exp_rpt_nemotron-bash-withtests-gpt5mini_glm_4.7_traces_jupiter Viewer • Updated 10 days ago • 10.7k • 30
DCAgent/exp_rpt_nemotron-bash-withtests_glm_4.7_traces_jupiter Viewer • Updated 10 days ago • 10.4k • 31
DCAgent/code-contests-sandboxes-with-tests_10k_glm_4.7_traces_jupiter Viewer • Updated 10 days ago • 8.19k • 29
DCAgent/eval-swebench-verified-random-100-folders__exp-psu-swesmith-31K__eval_ctx32k_non9933e620 Viewer • Updated 10 days ago • 4.51k • 23
DCAgent/exp_rpt_stack-go-v3-test_10k_glm_4.7_traces_jupiter Viewer • Updated 10 days ago • 14.3k • 25
DCAgent/eval-swebench-verified-random-100-folders__exp-psu-swesmith-10K__eval_ctx32k_nona9ab762c Viewer • Updated 10 days ago • 5.35k • 22