Loreon
Labs
Platform
Search…
⌘K
Docs
Home
Ecosystems
Other
flash-attention
Fast and memory-efficient exact attention
Other
Emerging
GitHub
Stars
—
Forks
—
Contributors
8
Last push
3mo ago
Recent commits
Latest commits.
[Ai-assisted] CLC work stealing (#2218)
98024f9
Driss Guessous
3mo ago
[AMD ROCm] Update CK and add RDNA 3/4 support (#2400)
5301a35
rocking
3mo ago
[Fwd,Sm100] Clean up pipeline creation a bit
4fcfdec
Tri Dao
3mo ago
Fix edge case when tag has no delta from previous (#2394)
abd9943
Driss Guessous
3mo ago
[Fwd,Sm100] Enable 2CTA for hdim 192-128 noncausal
3c20009
Tri Dao
3mo ago
[Fwd,Sm100] Tune ex2 frequency and registers
b2176fd
Tri Dao
3mo ago
refine bwd swizzle when deterministic (#2390)
5c7711e
jayhshah
3mo ago
Add README link for Turing support (#2379)
b8eda39
steve
3mo ago
Top contributors
Builders behind this project.
tridao
957 commits
drisspg
64 commits
piercefreeman
25 commits
jayhshah
22 commits
henrylhtsang
19 commits
guilhermeleobas
18 commits
ksivaman
18 commits
ipiszy
17 commits