Python

robustness-eval

A small, referenceable corruption-robustness benchmark for image classifiers (ImageNet-C / CIFAR-10-C style). Train clean, grade under corruption × severity, report the robustness gap. Agent-drivable --json CLI.

PythonEmergingagentsagents-mdbenchmarkcifar-10-c

GitHub

Stars

—

Forks

—

Contributors

Last push

13d ago

Recent commits

Latest commits.

Corruption-robustness benchmark: train clean, grade under shift, rank by the robustness gap
c8c2530RubenHaisma13d ago

Top contributors

Builders behind this project.

RubenHaisma

1 commits