LoreonLabsPlatform
DocsHome
  • Overview

Intelligence

  • Markets
  • Builders
  • Research
  • Ecosystems
  • Launchpads
  • Search
Ecosystems

Other

Aule-Attention

High-performance FlashAttention-2 for AMD, Intel, and Apple GPUs. Drop-in replacement for PyTorch SDPA. Triton backend for ROCm (MI300X, RDNA3), Vulkan backend for consumer GPUs. No CUDA required.

OtherEmerging
GitHubWebsite
Stars
—
Forks
—
Contributors
1
Last push
6mo ago

Recent commits

Latest commits.

  • fix: AMD kernel autotune key to avoid recompilation during generation
    f2b6451xenn00106mo ago
  • docs: Update README with AMD MI300X benchmarks
    e50b760xenn00106mo ago
  • feat: Add AMD MI300X optimized FlashAttention-2 kernel
    27dbf10xenn00106mo ago
  • chore: Bump version to 0.3.6 with PagedAttention
    5bdfd8exenn00106mo ago
  • docs: Add creator attribution to README
    f8583e1xenn00106mo ago
fix: Update build for Zig 0.14.0 compatibility
f204bc1xenn00106mo ago
  • feat: Add vLLM-style PagedAttention with block-based KV cache
    42400a5xenn00106mo ago
  • feat(paged): Phase 2 - Paged attention shader
    ef80a72xenn00106mo ago
  • Top contributors

    Builders behind this project.

    xenn0010
    32 commits