Loreon
Labs
Platform
Docs
Home
Ecosystems
Other
aiter
AI Tensor Engine for ROCm
Other
Emerging
GitHub
Stars
—
Forks
—
Contributors
8
Last push
3mo ago
Recent commits
Latest commits.
Add blockPerCu support for CKTile GEMMs and CKTile MOE tuning (#2313)
f9feadc
Yashvardhan Agarwal
3mo ago
[FIX] fix igemm 4GB oob bug (#2373)
c5f5009
junxiaguo
3mo ago
Fmha fwd remove ck dependency (#2353)
df13e36
JaxChen29
3mo ago
CI: Build Triton wheel once and share across test shards (#2380)
7788e75
Xin Huang
3mo ago
Fix use-after-free in cktile blockscale GEMM x_scale handling (#2358)
3e4552b
Sami Remes
3mo ago
Update topk.py to support non-power-of-2 experts (Kimi-K2) for long contexts (#2359)
b82dcd5
Clint
3mo ago
refactor: use ctypes binding (#2255)
7cce7fd
amd-ruitang3
3mo ago
fix(car): shfl and ag dispatch (#2346)
c781456
TennyWang1223
3mo ago
Top contributors
Builders behind this project.
valarLip
188 commits
gyohuangxin
136 commits
junhaha666
106 commits
slippedJim
66 commits
amd-ruitang3
64 commits
rahulbatra85
52 commits
yzhou103
51 commits
ZhangLirong-amd
50 commits