Loreon
Labs
Platform
Docs
Home
Ecosystems
C++
llama-cpp-turboquant
LLM inference in C/C++
C++
Emerging
GitHub
Stars
—
Forks
—
Contributors
8
Last push
2mo ago
Recent commits
Latest commits.
Update build-linux-vulkan-x64.yml
aafb7e5
justuser-31
2mo ago
Create build-linux-vulkan-x64.yml
b133ab1
justuser-31
2mo ago
Update build-vulkan.yml
f8f128d
justuser-31
2mo ago
Update build-vulkan.yml
a5bfaaf
justuser-31
2mo ago
Merge pull request #105 from TheTom/fix/disable-sparse-v-cuda
11a241d
Tom Turney
2mo ago
cuda: disable sparse V skip (warp divergence regression)
f2dc968
TheTom
2mo ago
Upstream sync to b8871 (64 commits)
67559e5
Tom Turney
2mo ago
Merge commit '7fc1c4ef7' into rebase/upstream-sync-april-2026
7f320bb
TheTom
2mo ago
Top contributors
Builders behind this project.
ggerganov
1.7K commits
ngxson
428 commits
JohannesGaessler
366 commits
slaren
362 commits
jeffbolznv
269 commits
CISC
262 commits
danbev
251 commits
TheTom
143 commits