Loreon
Labs
Platform
Docs
Home
Ecosystems
C++
CTranslate2
Fast inference engine for Transformer models
C++
Emerging
GitHub
Website
Stars
2
Forks
—
Contributors
8
Last push
5mo ago
Recent commits
Latest commits.
Merge branch 'OpenNMT:master' into master
01de6e4
Purfview
5mo ago
Minor refactor to CMakeLists.txt (#1980)
5674509
sssshhhhhh
5mo ago
Fixes cross attention tests and refactors code (#1974)
dd7fc83
Jordi Mas
5mo ago
Remove unnecessary check from wav2vec2 (#1977)
ef5a514
plan9better
5mo ago
Add optional residual add to gemm op (#1975)
467e4cd
sssshhhhhh
5mo ago
Implement cuda layernorm axis (#1971)
6e9b3ac
sssshhhhhh
5mo ago
Add causal flag to fa2 (#1976)
be75f30
sssshhhhhh
5mo ago
Fix CUDA bf16 median filter (#1972)
d09369d
sssshhhhhh
5mo ago
Top contributors
Builders behind this project.
guillaumekln
2K commits
minhthuc2502
49 commits
jordimas
20 commits
vince62s
9 commits
Purfview
8 commits
keichi
6 commits
panosk
6 commits
sssshhhhhh
6 commits