Python
LLM inference server with continuous batching & SSD caching for Apple Silicon — managed from the macOS menu bar
Latest commits.
No recent commits available.
Builders behind this project.
No contributor data available.