A benchmark for evaluating Multimodal LLMs using multiple-choice questions.
Latest commits.
Builders behind this project.