Other

swebench-pro-runner

An open-source evaluation platform for testing AI coding agents on real-world software engineering tasks. SWE-bench Pro Runner provides 742 curated tasks across 11 production repositories, with full orchestration tooling to launch evaluations, track results, and generate analytics reports.

OtherEmerging

GitHub

Stars

—

Forks

—

Contributors

Last push

2mo ago

Recent commits

Latest commits.

Ignore local Claude config, dispatch logs, and ad-hoc reports
454ed96manojvas2mo ago
Slim openlibrary py3.10 Dockerfile + add Phase-5 orchestration scripts
b4fbce5manojvas2mo ago
Add Stage-4 verification-feedback workflow (CTA #7, Approach A)
a33c213manojvas2mo ago
Webclients TextEncoder polyfill: inject via jest --setupFiles instead of NODE_OPTIONS
3fc78a8manojvas2mo ago
Fix variant workflow apostrophe-closes-quote bug
d5f7e20manojvas2mo ago

Top contributors

Builders behind this project.

manojvas

56 commits