One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks
Latest commits.
Builders behind this project.