LoreonLabsPlatform

Overview

Intelligence

Markets
Builders
Research
Narratives
Ecosystems
Launchpads

Discover

Search
Sources

Other

flash-attention

Fast and memory-efficient exact attention

OtherEmerging

Stars

—

Forks

—

Contributors

8

Last push

10mo ago

Recent commits

Latest commits.

[BugFix] Fix flash_attn_with_kvcache with scalar cache_seqlen (#1795)
cd9383fChao Shi10mo ago
Bump to v2.8.3
060c918Tri Dao10mo ago
[Cute] Implement PackGQA with TMA for fwd_sm100
69b33b5Tri Dao10mo ago
feat: add support for pytorch2.8 (#1801)
d2e3fc3NanoCode01210mo ago
[Cute] Make sure R2P happen
3c51f15Tri Dao10mo ago

[Cute] Remove trailing bracket (#1809)

581b68dJean-Luc Duprat10mo ago

[Cute] Implement page table with TMA for fwd_sm100

a1c2e22Tri Dao10mo ago

Remove old rotary kernel

f28841dTri Dao10mo ago

Top contributors

Builders behind this project.