Loreon
Labs
Platform
Docs
Home
Ecosystems
Python
inference-perf
GenAI inference performance benchmarking tool
Python
Emerging
GitHub
Stars
—
Forks
—
Contributors
8
Last push
7mo ago
Recent commits
Latest commits.
Use 'median' instead of 'p50' in output reports (#201)
8741b33
Brendan Slabe
10mo ago
Add cnn_dailymail datagen (#196)
d0939fe
jjk-g
10mo ago
Added SGlang server support (#193)
b6687e7
Sachin Varghese
10mo ago
Remove alpha channel of diagram png to keep presentation consistent in differen browser modes (#199) (#200)
f24d189
Qifan Deng
10mo ago
Add vllm metrics [preemptions and swapped requests] to prometheus metrics (#197)
3ab904a
Shuwen Fang
10mo ago
Support custom http headers in inference requests (#192)
64eb360
Ashok Chandrasekar
10mo ago
Rename schedule_accuracy to schedule_delay (#195)
1b241a9
jjk-g
10mo ago
Defer InferenceAPIData gen to worker procs (#157)
5e1649d
jjk-g
10mo ago
Top contributors
Builders behind this project.
Bslabe123
135 commits
SachinVarghese
42 commits
achandrasekar
31 commits
k8s-ci-robot
30 commits
jjk-g
12 commits
wangchen615
10 commits
aish1331
7 commits
sjmonson
6 commits