GLaMA is a small-scale autoregressive transformer model inspired by GPT-style architectures built for learning purposes only.
Latest commits.
Builders behind this project.