LoreonLabsPlatform

Overview

Intelligence

Markets
Builders
Research
Ecosystems
Launchpads

Search

Shell

llm-d

Achieve state of the art inference performance with modern accelerators on Kubernetes

ShellEmerging

Stars

—

Forks

—

Contributors

8

Last push

2mo ago

Recent commits

Latest commits.

checkpoint
5f12241William Morgan2mo ago
use vllm 0.19.0
086697aWilliam Morgan2mo ago
fix concurrency group to sha not PR (#1073)
cd05677Greg Pereira2mo ago
simplify WVA guide test (#1072)
ca04901Lionel Villard3mo ago
release workflow fixes (#1071)
899f851Greg Pereira3mo ago

Add WVA autoscaling kustomize overlay for the Inference Scheduling well-lit path (#1035)

0c48ffcLionel Villard3mo ago

Last version bump before cutting `v0.6` release (#1070)

67c983cmaugustosilva3mo ago

images preparation for v0.6.0 (#1067)

a64d0c5Diego Castan3mo ago

Top contributors

Builders behind this project.

Gregory-Pereira

robertgshaw2-redhat