Loreon
Labs
Platform
Docs
Home
Ecosystems
Other
flash-attention
Fast and memory-efficient exact attention
Other
Emerging
GitHub
Stars
—
Forks
—
Contributors
8
Last push
3mo ago
Recent commits
Latest commits.
Fix copy-paste error in hopper tests (#1279)
c1d146c
milesvant
20mo ago
FA3 kvcache + split kv + gqa parallelization (#1236)
a5a7527
jayhshah
20mo ago
[CrossEntropy] Fix where labels address not aligned to 16 bytes
bedf877
Tri Dao
21mo ago
Hotfix due to change of upstream api (#1239)
53a4f34
rocking
21mo ago
Fix FAv3 compilation with MSVC (#1240)
8476986
hlky
21mo ago
Merge pull request #1233 from Dao-AILab/ipiszy/local_attn
9cafd4a
Ying Zhang
21mo ago
address comments
1c9717d
Ying Zhang
21mo ago
minify torch.torch.int32 to torch.int32 (#1237)
30e1ef0
Zhihao Shen
21mo ago
Top contributors
Builders behind this project.
tridao
511 commits
piercefreeman
25 commits
ipiszy
16 commits
ksivaman
13 commits
DanFu09
11 commits
tmm1
4 commits
drisspg
4 commits
lucidrains
4 commits