Loreon
Labs
Platform
Docs
Home
Ecosystems
Other
rlhf-book
Textbook on reinforcement learning from human feedback
Other
Emerging
GitHub
Website
Stars
—
Forks
—
Contributors
8
Last push
12d ago
Recent commits
Latest commits.
Dark mode (#442)
bfac558
segyges
13d ago
Lec 6 DPO: math-nit fixes (parens on u, factor beta on slide 38) (#441)
79cdd52
Nathan Lambert
14d ago
Course: add Q&A 1 + Lecture 5 videos, link Lecture 5 on reasoning TOC (#440)
68c6037
Nathan Lambert
15d ago
Add Lecture 6: Direct Preference Optimization (chapter 8) (#435)
f23f0e5
Nathan Lambert
15d ago
Recreate RL-for-LLM diagrams in TikZ + organize diagrams/tikz by topic (#438)
4e09170
Nathan Lambert
16d ago
Lecture 5: clarify FP32 LM head caption, link loss-aggregation to Lecture 4 (#437)
e0c2b1d
Nathan Lambert
16d ago
Course: add Extra Resources section (#436)
b9d9c72
Nathan Lambert
18d ago
Lecture 5: The Rise of Reasoning Models (#327)
82fc036
Nathan Lambert
20d ago
Top contributors
Builders behind this project.
zafstojano
35 commits
wikiti
34 commits
casinca
9 commits
kiankyars
5 commits
zoranmedic
3 commits
galenballew
3 commits
Athe-kunal
3 commits
afqueiruga
3 commits