Other

SageAttention

Quantized Attention achieves speedup of 2-3x and 3-5x compared to FlashAttention and xformers, without lossing end-to-end metrics across language, image, and video models.

OtherEmerging

GitHub

Stars

—

Forks

—

Contributors

—

Last push

13mo ago

Recent commits

Latest commits.

No recent commits available.

Top contributors

Builders behind this project.

No contributor data available.