LoreonLabsPlatform
DocsHome
  • Overview

Intelligence

  • Markets
  • Builders
  • Research
  • Ecosystems
  • Launchpads
  • Search
Ecosystems

Python

QLLM

A general x bits quantization toolbox for LLMs, 2-8 bits support and quantization with GPTQ/AWQ easily.

PythonEmerging
GitHub
Stars
—
Forks
—
Contributors
2
Last push
31mo ago

Recent commits

Latest commits.

  • fix move_to_appropriate_device and initialize act_order
    a946bc3aciddelgado31mo ago
  • act_order (#20)
    336ba7aJiCheng31mo ago
  • act_order (#19)
    a29481fJiCheng31mo ago
  • refactor and support act_order (#18)
    bc5ef0cJiCheng31mo ago
  • minor fix for evaluation (#17)
    719125dJiCheng31mo ago
Ort exp (#16)
bcc503bJiCheng31mo ago
  • support onnx export with pastkv (#15)
    d7069dcJiCheng31mo ago
  • rmd (#13)
    1f13769JiCheng32mo ago
  • Top contributors

    Builders behind this project.

    wejoncy
    25 commits
    aciddelgado
    1 commits