Code for "CREAM: Consistency Regularized Self-Rewarding Language Models", ICLR 2025.
Latest commits.
Builders behind this project.