LoreonLabsPlatform

Overview

Intelligence

Markets
Builders
Research
Ecosystems
Launchpads

Search

HTML

selfhostllm

A web-based calculator for estimating GPU memory requirements and maximum concurrent requests for self-hosted LLM inference.

HTMLEmergingcalculatorgpullmselfhost

Stars

46

Forks

5

Contributors

2

Last push

5d ago

Recent commits

Latest commits.

add Gemma 4, Qwen 3.6, StepFun models; correct GLM-4.7/5/5.1 specs
f9ef51aEran Sandler5d ago
update recent model catalog
9080733Eran Sandler27d ago
add RTX 50 and RTX PRO GPU options
102bfaeEran Sandler27d ago
Merge pull request #1 from Mr-claw/add-latest-GPU-spec
c8ea7a2Eran Sandler27d ago
Expand context window presets across GPU, Mac, and PC pages
fb2f474Eran Sandler4mo ago

Add integrated autocomplete selectors and refresh model/device catalogs

d5d5453Eran Sandler4mo ago

Add H200 and B200 GPU

7d72059Mr-claw5mo ago

Add AI2 OLMo 2 and OLMo 3 models to all calculators

2d51392Eran Sandler6mo ago

Top contributors

Builders behind this project.