LoreonLabsPlatform

Overview

Intelligence

Markets
Builders
Research
Narratives
Ecosystems
Launchpads

Discover

Search
Sources

C++

cutlass

CUDA Templates for Linear Algebra Subroutines

C++Emerging

Stars

—

Forks

—

Contributors

8

Last push

1mo ago

Recent commits

Latest commits.

update to 4.5 (#3228)
ef120d0Haicheng Wu1mo ago
fix: exclude SM70/72 from CUTLASS_NVCC_ARCHS_SUPPORTED on CUDA >= 13.0 (#3166)
c775e56Vensen1mo ago
Add Snake activation functor for EVT (#3184)
2e56847Emre Albayrak1mo ago
[CuTeDSL] Fix loop carried target scope (#3200)
1d9e1f6TungtungQia1mo ago
[CuTeDSL] Update atomic_max_float32 to atomic_fmax in blockscaled GEMM example (#3206)
ae6bccfquesta-quan-wang1mo ago

v4.5 tag update (#3202)

cb37157Junkai-Wu2mo ago

[Hopper CuTeDSL] Add FP8 GEMM with 2xAcc (#3149)

f74fea9Johnsonms2mo ago

fix: Add missing kElementsPerAccess division in RegularTileIterator store (#3049)

7a9fe05Blake Ledden2mo ago

Top contributors

Builders behind this project.