christian-muertz's picture
Upload eval/2025-09-07-02:09:46/README.md with huggingface_hub
92ec70d verified

SWE-bench Report

This folder contains the evaluation results of the SWE-bench using the official evaluation docker containerization.

Summary

  • total instances: 500
  • submitted instances: 498
  • completed instances: 481
  • empty patch instances: 8
  • resolved instances: 180
  • unresolved instances: 301
  • error instances: 9

Resolved Instances

Unresolved Instances

Error Instances

Empty Patch Instances

Incomplete Instances