LoreonLabsPlatform

Overview

Intelligence

Markets
Builders
Research
Ecosystems
Launchpads

Search

Other

lit-llama

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

OtherEmerging

Stars

—

Forks

—

Contributors

8

Last push

37mo ago

Recent commits

Latest commits.

Clarify weight situation in README (#292)
7e72448Sebastian Raschka38mo ago
Add tokenizer_path parameter (#285)
fb88336DerekJuba-NIST38mo ago
Clarify description for generate_alpaca.py (#282)
5df20dbAdrian Wälchli38mo ago
Support for XLA devices in `generate.py` (#265)
29aff09Gerson Kroiz38mo ago
Update documentation to Open LLaMa 400B checkpoint (#276)
ee32d4fmentoc300038mo ago

Fix reset mmaps and buffers (#275)

98f29dbLuca Antiga38mo ago

Fix checkpoint_dir arg in download_weights (#274)

8d1b5e9Luca Antiga38mo ago

Clarify how to use int4 quantization (#262)

efb64c4Sebastian Raschka38mo ago

Top contributors

Builders behind this project.