HTML
A web-based calculator for estimating GPU memory requirements and maximum concurrent requests for self-hosted LLM inference.
Latest commits.
Builders behind this project.