LoreonLabsPlatform

Overview

Intelligence

Markets
Builders
Research
Narratives
Ecosystems
Launchpads

Discover

Search
Sources

C++

atomic-llama-cpp-turboquant

llama.cpp fork with TurboQuant WHT-rotated KV cache & weight compression + Gemma 4 MTP and Qwen 3.6 NextN speculative decoding (+30-50% throughput).

C++Emerging

Stars

—

Forks

—

Contributors

8

Last push

1mo ago

Recent commits

Latest commits.

Update docker.yml
7af714barczewski1mo ago
Merge pull request #14 from AtomicBot-ai/b1-mtp-qwen-rebase
0a635dcOoze1mo ago
Enhance multimodal support and speculative decoding in atomic-llama-cpp-turboquant
ead60fbBiogenic Ooze1mo ago
Merge pull request #13 from AtomicBot-ai/b1-mtp-qwen-rebase
8893692Ooze1mo ago
Update documentation and scripts for AtomicChat UDT quantization and Qwen 3.6 NextN enhancements
c7e6138Biogenic Ooze

1mo ago

Enhance UDT benchmarking scripts and add chat calibration sample

33e9b6dBiogenic Ooze1mo ago

Enhance documentation and scripts for AtomicChat UDT quantization and Qwen 3.6 NextN

5c1717fBiogenic Ooze1mo ago

Merge pull request #11 from AtomicBot-ai/b1-mtp-qwen-rebase

514e600Ooze1mo ago

Top contributors

Builders behind this project.

JohannesGaessler