Loreon
Labs
Platform
Docs
Home
Ecosystems
TeX
rlhf-book
Textbook on reinforcement learning from human feedback
TeX
Emerging
GitHub
Website
Stars
—
Forks
—
Contributors
8
Last push
8mo ago
Recent commits
Latest commits.
Regularization and rejection sampling tweaks (#162)
a72e591
Zoran Medić
8mo ago
Policy gradients and dpo typos/tweaks (#163)
c2ecfff
Zoran Medić
8mo ago
Update metadata.yml with Pandoc workaround
6f9bdbb
Nathan Lambert
8mo ago
Try to fix action 1 (#161)
55b973e
Nathan Lambert
8mo ago
Clarify reward model conditioning (#160)
70ac958
Nathan Lambert
8mo ago
Fix typos (#159)
45e90b4
Zoran Medić
9mo ago
Update nav.js
a754032
Nathan Lambert
9mo ago
WIP: Add completions library (#157)
d211b16
Nathan Lambert
9mo ago
Top contributors
Builders behind this project.
wikiti
34 commits
zafstojano
10 commits
afqueiruga
3 commits
galenballew
3 commits
zoranmedic
3 commits
emmanuel-ferdman
2 commits
ilikerobots
2 commits
trigaten
2 commits