Loreon
Labs
Platform
Docs
Home
Ecosystems
Other
flash-attention
Fast and memory-efficient exact attention
Other
Emerging
GitHub
Stars
—
Forks
—
Contributors
8
Last push
32mo ago
Recent commits
Latest commits.
[LayerNorm] Add postnorm residual + LayerNorm/RMSNorm in Triton
0177164
Tri Dao
32mo ago
[LayerNorm] Implement residual + LayerNorm/RMSNorm in Triton
79bd1a2
Tri Dao
32mo ago
Fix typo in RotaryEmbedding forward output type (#666)
3566596
Antony Frolov
32mo ago
Bump to v2.3.3
83aef84
Tri Dao
32mo ago
[CrossEntropy] Fix triton cross_entropy_loss IMA for >=2B elements
c79de85
Tri Dao
32mo ago
Clarify inference README is a placeholder
02ac572
Tri Dao
33mo ago
Bump to v2.3.2
7f31e7c
Tri Dao
33mo ago
Change constexpr int to constexpr static int
5a83425
Tri Dao
33mo ago
Top contributors
Builders behind this project.
tridao
380 commits
piercefreeman
25 commits
ksivaman
13 commits
DanFu09
11 commits
tmm1
4 commits
lxuechen
4 commits
robotcator
4 commits
ploshkin
3 commits