Python
LLM inference server with continuous batching & SSD caching for Apple Silicon — managed from the macOS menu bar
Latest commits.
Builders behind this project.