A framework for evaluating auto-interp pipelines, i.e., natural language explanations of neurons.
Latest commits.
Builders behind this project.