Loreon
Labs
Platform
Docs
Home
Ecosystems
Other
nanochat
The best ChatGPT that $100 can buy.
Other
Emerging
GitHub
Stars
—
Forks
—
Contributors
4
Last push
8mo ago
Recent commits
Latest commits.
add script to train a 000 version of nanochat. currently it's a bit more like 00 and this would run in probably around 33 hours instead of the budget of 41 hours, so we might tune it later. i think it's ok for now
fae3aca
Andrej Karpathy
8mo ago
fix subtle issue in token decoding in cases where multiple utf8 bytes need to be emitted into a single codepoint. exampels are emoji or foreign languages. basically we have to accumulate token sequences/text and only emit when we get full codepoints
4c3590c
Andrej Karpathy
8mo ago
add basic logging to chat_web, which i think might be fun
03fa673
Andrej Karpathy
8mo ago
add very basic abuse prevention limits to chat_web so it's ok to host endpoints
52bfeea
Andrej Karpathy
8mo ago
allow multiple GPUs to do inference in a data parallel way
01fb290
Andrej Karpathy
8mo ago
dont evaluate the sampling evals during SFT they are too slow. keep the multiple choice evals. delete unused imports
190d951
Andrej Karpathy
8mo ago
fix bug in learning rate multiplier, it was ramping up instead of ramping down. see more in Issue #68. also add --dry_run option useful for experimentation
b8076dd
Andrej Karpathy
8mo ago
export NANOCHAT_BASE_DIR so child processes get it too
67aaca9
Andrej
8mo ago
Top contributors
Builders behind this project.
karpathy
14 commits
epoyraz
1 commits
Mirza-Samad-Ahmed-Baig
1 commits
muellerzr
1 commits