LoreonLabsPlatform

Overview

Intelligence

Markets
Builders
Research
Narratives
Ecosystems
Launchpads

Discover

Search
Sources

Python

kernel-set

High-performance CUDA kernels for LLM inference & training — callable from Python, Rust, Go, and TypeScript through one stable C ABI.

PythonEmerging

Stars

1

Forks

—

Contributors

1

Last push

8h ago

Recent commits

Latest commits.

route defaults from promoted benchmark evidence
d535797cklxx8h ago
bench: add qwen3 shape-aware kernel microbench
07f12d5cklxx10h ago
bench: default qwen3 to best-practice path
7cfe670cklxx10h ago
bench: add best-practice qwen3 kernel path
3f7a11fcklxx10h ago
docs: fix engine smoke comparison wording
63ea818cklxx12h ago

docs: clarify qwen3 engine smoke scope

131c7e0cklxx12h ago

bench: add full-kernel qwen3 engine smoke

17f4f0dcklxx12h ago

docs: highlight benchmark and engine results

4b9d3cecklxx1d ago

Top contributors

Builders behind this project.