Loreon
Labs
Platform
Docs
Home
Ecosystems
Other
FlashMLA
FlashMLA: Efficient MLA kernels
Other
Emerging
GitHub
Stars
—
Forks
—
Contributors
8
Last push
10mo ago
Recent commits
Latest commits.
Fix accuracy issue in sum_OdO kernel
c759027
Jiashi Li
10mo ago
Drop support for CUDA <12.8
ef5b1a6
Jiashi Li
10mo ago
Add more GPU architctures support (#76)
41b611f
Zeyu WANG
11mo ago
update .gitignore
9edee0c
ljss
14mo ago
update to cutlass 3.9
9c5dfab
ljss
14mo ago
Fix synchronization issues
01a2772
ljss
14mo ago
Fix LaTeX render error (#74)
70b9468
Shengyu Liu
14mo ago
Minor fix to the docs to correct FlashAttention-3's paper link and typos (#73)
6cff5a7
ℍ𝕠𝕝𝕝𝕠𝕨 𝕄𝕒𝕟
14mo ago
Top contributors
Builders behind this project.
beginlner
17 commits
sijiac
3 commits
interestingLSY
3 commits
KnowingNothing
2 commits
uchihatmtkinu
1 commits
chunyang-wen
1 commits
lancerts
1 commits
sazczmh
1 commits