LoreonLabsPlatform
DocsHome
  • Overview

Intelligence

  • Markets
  • Builders
  • Research
  • Ecosystems
  • Launchpads
  • Search
Ecosystems

Shell

llm-d

Achieve state of the art inference performance with modern accelerators on Kubernetes

ShellEmerging
GitHubWebsite
Stars
—
Forks
—
Contributors
8
Last push
2mo ago

Recent commits

Latest commits.

  • checkpoint
    5f12241William Morgan2mo ago
  • use vllm 0.19.0
    086697aWilliam Morgan2mo ago
  • fix concurrency group to sha not PR (#1073)
    cd05677Greg Pereira2mo ago
  • simplify WVA guide test (#1072)
    ca04901Lionel Villard3mo ago
  • release workflow fixes (#1071)
    899f851Greg Pereira3mo ago
Add WVA autoscaling kustomize overlay for the Inference Scheduling well-lit path (#1035)
0c48ffcLionel Villard3mo ago
  • Last version bump before cutting `v0.6` release (#1070)
    67c983cmaugustosilva3mo ago
  • images preparation for v0.6.0 (#1067)
    a64d0c5Diego Castan3mo ago
  • Top contributors

    Builders behind this project.

    Gregory-Pereira
    86 commits
    clubanderson
    68 commits
    lionelvillard
    51 commits
    smarterclayton
    37 commits
    robertgshaw2-redhat
    24 commits
    liu-cong
    22 commits
    yuanwu2017
    22 commits
    petecheslock
    20 commits