Loreon
Labs
Platform
Docs
Home
Ecosystems
Other
flash-attention
Fast and memory-efficient exact attention
Other
Emerging
GitHub
Stars
—
Forks
—
Contributors
8
Last push
2mo ago
Recent commits
Latest commits.
SM90 FA4 QuACK 0.4 Compatibility (#2513)
ebeff90
EduardDurech
2mo ago
[hd256] Improve forward kernel with exp2 FMA emulation (+3% to +9% performance gain) (#2488)
6c73fb5
Johnsonms
2mo ago
Add cache utils logging test (#2509)
96bd151
Driss Guessous
2mo ago
[Tests,MLA] Close coverage gaps in test_flash_attn_mla_absorbed{,_varlen} (#2483)
89ce84b
Johnsonms
2mo ago
Fix clc scheduling request bug (#2508)
b86e0cc
Driss Guessous
2mo ago
Fix (#2505)
519445a
Matthew Bonanni
2mo ago
[CuTe, Flex] simplify blocksparse interface in flash_attn_func (#2506)
6b52632
Reuben Stern
2mo ago
[SM100] Guard gO None in empty-tile correction (#2504)
547031a
geruome
2mo ago
Top contributors
Builders behind this project.
tridao
959 commits
drisspg
71 commits
piercefreeman
25 commits
jayhshah
24 commits
henrylhtsang
19 commits
guilhermeleobas
18 commits
ksivaman
18 commits
ipiszy
17 commits