Loreon
Labs
Platform
Search…
⌘K
Docs
Home
Ecosystems
Other
aiter
AI Tensor Engine for ROCm
Other
Emerging
GitHub
Stars
—
Forks
—
Contributors
8
Last push
4mo ago
Recent commits
Latest commits.
Add `fused_qk_norm_rope_2way` horizon fusion kernel for Qwen-Image model (#1969)
eef90e7
Yutao Xu
4mo ago
add rmsnorm CK_TILE_FLOAT_TO_BFLOAT16_DEFAULT compile config (#1979)
a0123a4
zhyajie
4mo ago
[CK_TILE][FMHA] Add FP8 KV_BLOCKSCALE support for batch prefill (#1975)
e225f34
Jeff Huang
4mo ago
Fix the precision issue of allreduce (#1967)
64faa90
yanboshao
4mo ago
syncing with latest ck (#1977)
5accacf
Khushbu Agarwal
4mo ago
minor update for mla f8 v3 kernel (#1962)
0a699a7
liyjiang
4mo ago
Add allreduce rmsnorm 1stage fusion pass (#1949)
7842709
Yutao Xu
4mo ago
hipBLASLt online tuning (#1800)
6284a83
Han Lin
4mo ago
Top contributors
Builders behind this project.
valarLip
181 commits
gyohuangxin
95 commits
junhaha666
95 commits
slippedJim
61 commits
amd-ruitang3
56 commits
rahulbatra85
52 commits
ZhangLirong-amd
47 commits
yzhou103
45 commits