SWT-Bench Collection Variations of the SWT-Bench pre-formatted dataset used for the paper "SWT-Bench: Testing and Validating Real-World Bug-Fixes with Code Agents" • 6 items • Updated 24 days ago
Constrained Diffusion Tasks Collection Datasets used in the paper "Constrained Decoding of Diffusion LLMs for Context-Free Grammars". • 5 items • Updated 20 days ago