Loreon
Labs
Platform
Docs
Home
Ecosystems
Python
inference-perf
GenAI inference performance benchmarking tool
Python
Emerging
GitHub
Stars
—
Forks
—
Contributors
8
Last push
5mo ago
Recent commits
Latest commits.
fix concurrency higher than set issue. (#320)
7a44edf
Bob Tian
5mo ago
Enabling multiple report analysis using CLI tool (#307)
fd6f20a
Sachin Mathew Varghese
5mo ago
chore: update README.md with concurrent load generation info (#313)
88ab36f
Chang Min Bark
5mo ago
Add end-to-end testing using llm-d-inference-sim (#294)
74991c8
Diamond
5mo ago
feat: Multilora support (#315)
f7d234b
Chang Min Bark
5mo ago
Fix vllm prefix metrics (#309)
7d4515b
Jason Kramberger
5mo ago
Add cloudbuild.yaml (#306)
d8e4af8
Jason Kramberger
6mo ago
Add mTLS support in vllm client (#302)
dfe41ca
Qiu Yu
6mo ago
Top contributors
Builders behind this project.
Bslabe123
146 commits
SachinVarghese
45 commits
achandrasekar
34 commits
k8s-ci-robot
30 commits
jjk-g
21 commits
wangchen615
10 commits
aish1331
9 commits
rlakhtakia
8 commits