LoreonLabsPlatform

Overview

Intelligence

Markets
Builders
Research
Narratives
Ecosystems
Launchpads

Discover

Search
Sources

Other

flash-attention

Fast and memory-efficient exact attention

OtherEmerging

Stars

—

Forks

—

Contributors

8

Last push

4mo ago

Recent commits

Latest commits.

[Cute][Testing] Add fake tensor mode support for compile-only test passes (#2283)
f2682b6Alkaid4mo ago
[Fwd,Sm100] Compute kv_stage based on hdim instead of hard-coding
51b6575Tri Dao4mo ago
[Fwd,Sm100] Switch back to poly degree 3
72eb5deTri Dao4mo ago
[Fwd,Sm100] Add polynomials degree 1 - 5
990b510Tri Dao4mo ago
[Fwd,Sm100] Use NamedBarrier to signal softmax -> corr warps
d78c84aTri Dao4mo ago

[CuTe] Include broadcast dims in backward compile cache keys (#2298)

be76c60bonpyt4mo ago

Fix clang parser error of missing 'typename' prior to dependent type name occurs because `LLVM/Clang` is strictly adhering to C++ standards (#2295)

ceb1099tomflinda4mo ago

[Scheduler] Revert SingleTileScheduler to get block_idx

d146effTri Dao4mo ago

Top contributors

Builders behind this project.

guilhermeleobas