An easy-to-use evaluation tool for running Humanity's Last Exam on (locally) hosted Ollama instances.
Latest commits.
Builders behind this project.