Jupyter Notebook
Transformers from scratch implemented GQA,RoPE,RMS-Norm and trained on that code
Latest commits.
Builders behind this project.