Loreon
Labs
Platform
Docs
Home
Ecosystems
Python
kernel_profiler
A profiler for loopy kernels.
Python
Emerging
GitHub
Stars
—
Forks
—
Contributors
1
Last push
89mo ago
Recent commits
Latest commits.
printing generated code in example
861fb7e
jdsteve2
89mo ago
in footprint counting, filtering mem map by mtype=global before iterating rather than checking mtype==global
7f054b2
jdsteve2
90mo ago
caching grid sizes
f220705
jdsteve2
90mo ago
caching stats maps; combined separate stats getting functions into one to reduce redundant code
8198c32
jdsteve2
90mo ago
MEM_BANDWIDTH now calculated two ways, once counting all global accesses and once counting footprint
b5272b5
jdsteve2
90mo ago
accounting for count granularity when computing flops and bandwidth
6428459
jdsteve2
90mo ago
fixing flop counting, flops were only being counted once per subgroup
339ecaa
jdsteve2
90mo ago
updated readme
5b33027
jdsteve2
90mo ago
Top contributors
Builders behind this project.
jdsteve2
18 commits