Loreon
Labs
Platform
Docs
Home
Ecosystems
C++
CTranslate2
Fast inference engine for Transformer models
C++
Emerging
GitHub
Website
Stars
—
Forks
—
Contributors
8
Last push
22mo ago
Recent commits
Latest commits.
support minimum gemma 2 (#1772)
f89fa2b
Minh-Thuc
22mo ago
Add log probs for all tokens (#1755)
6647945
Minh-Thuc
22mo ago
Wav2Vec2 upgrade with Conv1D options (#1758)
8ba828c
homink
22mo ago
Bump torch from 2.1.0 to 2.2.0 in /python/tests (#1746)
d202032
dependabot[bot]
23mo ago
feat: grouped conv1d (#1749)
1000086
Mustafa Ebrar Aktaş
23mo ago
fix: implement llama3 RoPE scaling type and fix converter (#1751)
a386cbd
Mustafa Ebrar Aktaş
23mo ago
Fix CI (#1747)
e6a8f94
Minh-Thuc
23mo ago
Quantzation AWQ GEMM + GEMV (#1727)
39f48f2
Minh-Thuc
24mo ago
Top contributors
Builders behind this project.
guillaumekln
2K commits
minhthuc2502
37 commits
vince62s
8 commits
keichi
6 commits
panosk
6 commits
michaelfeil
5 commits
ebraraktas
5 commits
jordimas
3 commits