LoreonLabsPlatform

Overview

Intelligence

Markets
Builders
Research
Narratives
Ecosystems
Launchpads

Discover

Search
Sources

Go

LLMKube

Kubernetes operator for local LLM inference with llama.cpp, vLLM, TGI, and mlx-server — multi-GPU NVIDIA + Apple Silicon Metal, autoscaling, air-gapped, production-ready

GoEmerging

Stars

—

Forks

—

Contributors

8

Last push

4h ago

Recent commits

Latest commits.

feat(foreman): deterministic coder verification gate with feedback loop (#749)
8cf3295Christopher Maher19h ago
fix(controller): honor RouterRule.Timeout in gateway-mode AIGatewayRoute generation (#748)
1978a72Christopher Maher19h ago
fix(controller): honor GPU resourceName override in checkAcceleratorAvailability (#747)
5aa3152Christopher Maher19h ago
fix(foreman): scope-overlap rail rescues honest paraphrase from false NO-GO (#746)
00ae36eChristopher Maher1d ago
fix(cli): correct --node-port help text and add NodePort test coverage (#742)
ef55900

Christopher Maher

1d ago

feat(foreman): loop convergence forcing (EditFreeStreak + final-turns submit) (#741)

cd3f068Christopher Maher1d ago

fix(foreman): make install-foreman-agent produce a working plist (#743)

b946fefChristopher Maher1d ago

feat(cli): add --node-port to pin a stable NodePort on InferenceService (#737)

a6c1a03Christopher Maher2d ago

Top contributors

Builders behind this project.

github-actions[bot]

dependabot[bot]

matiasinsaurralde

mircea-pavel-anton