⚡ Run 1-bit LLMs on CPU with an OpenAI-compatible API. No GPU required.
Latest commits.
Builders behind this project.