LoreonLabsPlatform

Overview

Intelligence

Markets
Builders
Research
Narratives
Ecosystems
Launchpads

Discover

Search
Sources

Python

simple_GRPO

A very simple GRPO implement for reproducing r1-like LLM thinking.

PythonBuilding

Stars

1.7K

Forks

133

Contributors

4

Last push

7mo ago

Recent commits

Latest commits.

1
30f252cJinyiHan997mo ago
set vllm versions
3399de4JinyiHan997mo ago
Update README.md
4ab94c6LJQ11mo ago
Update README.md
3e85bb9LJQ11mo ago
Update README.md
8d7dc79LJQ11mo ago
Update README.md

17d7e4cLJQ11mo ago

a7b4530LJQ11mo ago

upload the prompt

652bd35Jinyi Han15mo ago

Top contributors

Builders behind this project.