BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.
Latest commits.
Builders behind this project.