A single-model LLM inference server with live online learning via LoRA hot-swap. Train while you serve.
Latest commits.
Builders behind this project.