LoreonLabsPlatform
DocsHome
  • Overview

Intelligence

  • Markets
  • Builders
  • Research
  • Ecosystems
  • Launchpads
  • Search
Ecosystems

Python

sglang

SGLang is a fast serving framework for large language models and vision language models.

PythonEmerging
GitHubWebsite
Stars
2
Forks
—
Contributors
8
Last push
8mo ago

Recent commits

Latest commits.

  • Update CODEOWNERS for attention/ascend_backend.py (#11092)
    e5281f8Lianmin Zheng9mo ago
  • Enable optional FP32 compute for LM Head (#10729)
    d17986fnarutolhy9mo ago
  • [model] added support for w8a8int8 used by neuralmagic/Qwen2-0.5B-Ins… (#9642)
    8831c55DevashishLal-CB9mo ago
  • Remove hybrid_linear_attn attention backend and refactor attention registry (#10816)
    2bc61ddli-kesen9mo ago
  • [Profile] dump memory trace when cuda graph profile is enabled (#11083)
    6535fdaCheng Wan9mo ago
feat(reasoning): improve enable thinking from request (#10875)
3713eb6Jimmy9mo ago
  • [router][grpc] Add logprobs support to router (#11082)
    5937a56Chang Su9mo ago
  • [router] Use `get_pooled` in `process_single_choice` (#11079)
    f065e5bChang Su9mo ago
  • Top contributors

    Builders behind this project.

    merrymercy
    840 commits
    zhyncs
    745 commits
    fzyzcjy
    277 commits
    hnyls2002
    203 commits
    Ying1123
    186 commits
    slin1237
    142 commits
    ispobock
    142 commits
    BBuf
    116 commits