LoreonLabsPlatform

Overview

Intelligence

Markets
Builders
Research
Ecosystems
Launchpads

Search

Python

omlx

LLM inference server with continuous batching & SSD caching for Apple Silicon — managed from the macOS menu bar

PythonEmerging

Stars

1

Forks

—

Contributors

8

Last push

16d ago

Recent commits

Latest commits.

feat: add swift-only mac app rebuild
a108055jundot16d ago
fix: bump dflash-mlx pin
a472e3ajundot16d ago
tune(memory): relax throttle/eviction thresholds
1efb140jundot17d ago
fix(oq): defer MTP norm shift decisions
f051000jundot17d ago
fix(memory): custom tier uses 2GB reserve, skips dynamic ceiling
fd10281jundot17d ago

fix(oq): gate MTP preservation on checkpoint weights

dab77c6jundot17d ago

build(deps-dev): bump paroquant from 0.1.14 to 0.1.15 (#1561)

a8ab95fdependabot[bot]17d ago

fix(specprefill): materialize draft model lazy state on loader thread (#1485)

2daf4c9cfbraun17d ago

Top contributors

Builders behind this project.

latent-variable