The RunPod worker template for serving our large language model endpoints. Powered by vLLM.
Latest commits.
Builders behind this project.