Loreon
Labs
Platform
Docs
Home
Ecosystems
Other
cutlass
CUDA Templates for Linear Algebra Subroutines
Other
Emerging
GitHub
Stars
—
Forks
—
Contributors
8
Last push
38mo ago
Recent commits
Latest commits.
Fix for dangling references in the MHA example (#918)
e36912f
Alexander Zinoviev
39mo ago
CUTLASS 3.1 Python interface documentation (#917)
9a83bd3
Jack Kosaian
39mo ago
Fix some typos in CuTe tutorials (#912)
54bebe4
Adnan Akhundov
39mo ago
Allow L2 prefect for clang compiler (#914)
43cfbe0
Guray Ozen
39mo ago
added support of b2b bmm (#849)
4a68cf7
Aleksandr Pivovar
39mo ago
CUTLASS 3.1 (#915)
d572cc1
ANIKET SHIVAM
39mo ago
fMHA: Add backward pass (#844)
9b8166e
dan_the_3rd
39mo ago
Add tile_n=32 and tile_k=32 kernels in generator.py (#858)
e2d439e
Shuai Shao
39mo ago
Top contributors
Builders behind this project.
hwu36
74 commits
kerrmudgeon
69 commits
dumerrill
16 commits
Peter9606
13 commits
jackkosaian
11 commits
mnicely
10 commits
ANIKET-SHIVAM
8 commits
Artem-B
8 commits