LoreonLabsPlatform
DocsHome
  • Overview

Intelligence

  • Markets
  • Builders
  • Research
  • Ecosystems
  • Launchpads
  • Search
Ecosystems

Python

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

PythonEmerging
GitHubWebsite
Stars
—
Forks
—
Contributors
8
Last push
4mo ago

Recent commits

Latest commits.

  • Bump up version to 0.1.3 (#657)
    aa84c92Zhuohan Li35mo ago
  • [Doc] Add Baichuan 13B to supported models (#656)
    f7389f4Zhuohan Li35mo ago
  • Refactor scheduler (#658)
    55fe8a8Woosuk Kwon35mo ago
  • [BUG FIX] upgrade fschat version to 0.2.23 (#650)
    e8ddc08YHPeter35mo ago
  • Add Falcon support (new) (#592)
    1b0bd0fZhuohan Li35mo ago
Fix log message in scheduler (#652)
20044caLily Liu35mo ago
  • fix baichuan for different position embedding for 7b and 13b models (#643)
    64f23c2Song35mo ago
  • fix biachuan-7b tp (#598)
    d4c7755Qing35mo ago
  • Top contributors

    Builders behind this project.

    WoosukKwon
    172 commits
    zhuohan123
    46 commits
    LiuXiaoxuanPKU
    6 commits
    gesanqiu
    3 commits
    Yard1
    2 commits
    MoeedDar
    2 commits
    suquark
    2 commits
    Oliver-ss
    2 commits