Loreon
Labs
Platform
Docs
Home
Ecosystems
Other
flash-attention
Fast and memory-efficient exact attention
Other
Emerging
GitHub
Stars
—
Forks
—
Contributors
8
Last push
10mo ago
Recent commits
Latest commits.
[BugFix] Fix flash_attn_with_kvcache with scalar cache_seqlen (#1795)
cd9383f
Chao Shi
10mo ago
Bump to v2.8.3
060c918
Tri Dao
10mo ago
[Cute] Implement PackGQA with TMA for fwd_sm100
69b33b5
Tri Dao
10mo ago
feat: add support for pytorch2.8 (#1801)
d2e3fc3
NanoCode012
10mo ago
[Cute] Make sure R2P happen
3c51f15
Tri Dao
10mo ago
[Cute] Remove trailing bracket (#1809)
581b68d
Jean-Luc Duprat
10mo ago
[Cute] Implement page table with TMA for fwd_sm100
a1c2e22
Tri Dao
10mo ago
Remove old rotary kernel
f28841d
Tri Dao
10mo ago
Top contributors
Builders behind this project.
tridao
744 commits
piercefreeman
25 commits
ksivaman
18 commits
ipiszy
17 commits
DanFu09
11 commits
rocking5566
10 commits
drisspg
6 commits
danthe3rd
6 commits