Loreon
Labs
Platform
Docs
Home
Ecosystems
Python
nanochat
The best ChatGPT that $100 can buy.
Python
Emerging
GitHub
Stars
—
Forks
—
Contributors
8
Last push
6d ago
Recent commits
Latest commits.
Add comprehensive MoE documentation with benchmarks and scaling analysis
53ce4f7
ademeure
2mo ago
Add --num-experts, --top-k, --num-shared-experts CLI args to base_train.py
c0e579e
ademeure
2mo ago
Fix MoE dtype compatibility with COMPUTE_DTYPE system
674d623
ademeure
2mo ago
Add FA3/FlexAttention/SDPA attention backend (cherry-picked from fa3-flex-sdpa branch)
e006f7f
ademeure
2mo ago
Merge upstream/master into moe-new: add smear/backout, GradScaler, ClimbMix, autoresearch tuning
ccb6dbd
ademeure
2mo ago
create a group for dev dependencies, there is no need to install all this other stuff just for speedrun and it's exposing people to dependency chain attacks. we need to delete more dependencies. dependencies bad bad bad
a445144
Andrej Karpathy
3mo ago
delete non-essential deps from legacy use
03be953
Andrej Karpathy
3mo ago
Merge pull request #595 from svlandeg/fix/typo
7808dc7
Andrej
3mo ago
Top contributors
Builders behind this project.
karpathy
262 commits
svlandeg
36 commits
lukestanley
8 commits
ericsilberstein
6 commits
ademeure
5 commits
dipeshbabu
4 commits
Kripner
3 commits
burtenshaw
3 commits