Environment realism eval for AI control — do LLMs detect synthetic vs production environments?
Latest commits.
Builders behind this project.