LoreonLabsPlatform
DocsHome
  • Overview

Intelligence

  • Markets
  • Builders
  • Research
  • Ecosystems
  • Launchpads
  • Search
Ecosystems

Other

lit-llama

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

OtherEmerging
GitHub
Stars
—
Forks
—
Contributors
8
Last push
37mo ago

Recent commits

Latest commits.

  • Clarify weight situation in README (#292)
    7e72448Sebastian Raschka38mo ago
  • Add tokenizer_path parameter (#285)
    fb88336DerekJuba-NIST38mo ago
  • Clarify description for generate_alpaca.py (#282)
    5df20dbAdrian Wälchli38mo ago
  • Support for XLA devices in `generate.py` (#265)
    29aff09Gerson Kroiz38mo ago
  • Update documentation to Open LLaMa 400B checkpoint (#276)
    ee32d4fmentoc300038mo ago
Fix reset mmaps and buffers (#275)
98f29dbLuca Antiga38mo ago
  • Fix checkpoint_dir arg in download_weights (#274)
    8d1b5e9Luca Antiga38mo ago
  • Clarify how to use int4 quantization (#262)
    efb64c4Sebastian Raschka38mo ago
  • Top contributors

    Builders behind this project.

    lantiga
    52 commits
    awaelchli
    43 commits
    carmocca
    42 commits
    t-vi
    11 commits
    rasbt
    6 commits
    williamFalcon
    6 commits
    Ever2after
    4 commits
    Borda
    3 commits