A throughput-oriented high-performance serving framework for LLMs
Latest commits.
Builders behind this project.