Loreon
Labs
Platform
Docs
Home
Ecosystems
Other
rlhf-book
Textbook on reinforcement learning from human feedback
Other
Emerging
GitHub
Website
Stars
—
Forks
—
Contributors
8
Last push
11mo ago
Recent commits
Latest commits.
Tool use impl additions (#133)
22cf313
Nathan Lambert
12mo ago
fix obj/loss for ppo (#132)
d91c460
Nathan Lambert
12mo ago
Various minor typos (#131)
fd66283
Alejandro F Queiruga
12mo ago
revert makefile changes (#130)
815e876
Nathan Lambert
12mo ago
Fix comprehensive typos and bugs across repository (#129)
71a5140
Nathan Lambert
12mo ago
Various typos (#128)
df9e6d6
Theo X. Olausson
12mo ago
Typo in 01-introduction.md (#127)
9e84005
Alejandro F Queiruga
12mo ago
Update 11-policy-gradients.md
061060c
Nathan Lambert
12mo ago
Top contributors
Builders behind this project.
wikiti
34 commits
zafstojano
4 commits
afqueiruga
2 commits
ilikerobots
2 commits
trigaten
2 commits
ethanelasky
2 commits
nnadeau
1 commits
kitkatdafu
1 commits