Research platform for evaluating AI software-maintenance agents with sandboxed patch validation and evidence reports.
Latest commits.
Builders behind this project.