Python
A general x bits quantization toolbox for LLMs, 2-8 bits support and quantization with GPTQ/AWQ easily.
Latest commits.
Builders behind this project.