A hands-on implementation of the technologies behind modern Large Language Models, covering gradient descent, backpropagation, tokenization, transformers, self-attention, and GPT architecture. The project emphasizes understanding AI systems from first principles rather than relying solely on high-level frameworks.
Latest commits.
Builders behind this project.