LoreonLabsPlatform

Overview

Intelligence

Markets
Builders
Research
Ecosystems
Launchpads

Search

Python

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

PythonEmerging

Stars

—

Forks

—

Contributors

8

Last push

4mo ago

Recent commits

Latest commits.

Bump up version to 0.1.3 (#657)
aa84c92Zhuohan Li35mo ago
[Doc] Add Baichuan 13B to supported models (#656)
f7389f4Zhuohan Li35mo ago
Refactor scheduler (#658)
55fe8a8Woosuk Kwon35mo ago
[BUG FIX] upgrade fschat version to 0.2.23 (#650)
e8ddc08YHPeter35mo ago
Add Falcon support (new) (#592)
1b0bd0fZhuohan Li35mo ago

Fix log message in scheduler (#652)

20044caLily Liu35mo ago

fix baichuan for different position embedding for 7b and 13b models (#643)

64f23c2Song35mo ago

fix biachuan-7b tp (#598)

d4c7755Qing35mo ago

Top contributors

Builders behind this project.