LoreonLabsPlatform

Overview

Intelligence

Markets
Builders
Research
Narratives
Ecosystems
Launchpads

Discover

Search
Sources

Other

flash-attention

Fast and memory-efficient exact attention

OtherEmerging

Stars

—

Forks

—

Contributors

8

Last push

21mo ago

Recent commits

Latest commits.

minify torch.torch.int32 to torch.int32 (#1237)
30e1ef0Zhihao Shen21mo ago
Add custom ops for compatibility with PT Compile (#1139)
83e41b3Antoni Viros21mo ago
Merge pull request #1182 from ipiszy/used_q
af314d4Ying Zhang21mo ago
small fixes
8cbc8a0Ying Zhang21mo ago
minor changes to unpad_input test util func
cdbbe84Ying Zhang21mo ago

Add seqused_q in fwd / bwd and seqused_k in bwd.

db80387Ying Zhang22mo ago

Support page kvcache in AMD ROCm (#1198)

e2182ccrocking21mo ago

[Rotary] Add test for rotary when qkv are packed an there's GQA

cc1690dTri Dao21mo ago

Top contributors

Builders behind this project.