Java
Purpose-built Java experiment driver for AI agent evaluation — orchestrates git reset, pre-processing, agent invocation, judging, scoring, tracking, comparison, and Langfuse export.
Latest commits.
No recent commits available.
Builders behind this project.
No contributor data available.