Python
A benchmark for LLMs on complicated tasks in Windows Terminal
Latest commits.
Builders behind this project.