Loreon
Labs
Platform
Docs
Home
Ecosystems
Jupyter Notebook
cpu_math_kernels_pri
bugparty/cpu_math_kernels_pri
Jupyter Notebook
Emerging
GitHub
Stars
—
Forks
—
Contributors
5
Last push
15h ago
Recent commits
Latest commits.
Merge pull request #33 from bugparty/thunderbolt/max-v3-8x-unroll-16789617959211030163
acca01e
Bowen Han
2mo ago
Add AVX2 max reduction with 8x unrolling (max_v3)
44c966d
google-labs-jules[bot]
2mo ago
Add AVX2 max reduction with 8x unrolling (max_v3)
843a3d8
google-labs-jules[bot]
2mo ago
Merge pull request #32 from bugparty/thunderbolt/avx2-max-reduction-11079407733403411578
9f350fa
Bowen Han
2mo ago
Add AVX2 4x unrolled max reduction kernel
55c046b
google-labs-jules[bot]
2mo ago
Merge pull request #31 from bugparty/thunderbolt-softmax-v5-18282112880023903289
88205de
Bowen Han
2mo ago
⚡ Thunderbolt: Softmax — Optimized exp256 range reduction and polynomial eval
25f63cf
google-labs-jules[bot]
2mo ago
Refactor README.md to clarify focus on agent-optimized transpose kernels and update performance metrics
87bf964
bowman
2mo ago
Top contributors
Builders behind this project.
bugparty
82 commits
google-labs-jules[bot]
38 commits
Copilot
4 commits
github-classroom[bot]
3 commits
Codex
1 commits