Loreon
Labs
Platform
Docs
Home
Ecosystems
Other
benchmarks
Benchmarks for agents
Other
Emerging
GitHub
Stars
—
Forks
—
Contributors
5
Last push
16d ago
Recent commits
Latest commits.
Merge pull request #21 from nearai/pranav/add_terminal_bench_2
1e6462e
Pranav Raja
20d ago
Merge pull request #48 from nearai/pranav/team-guide-doc
44d87cc
Pranav Raja
20d ago
docs: team guide for /benchmark + trajectory viewer
ebb51a1
Pranav Raja
20d ago
address Firat's review on PR #21: critical/high/medium fixes + tests
e1d4d02
Pranav Raja
23d ago
openclaw: tbexec/tbwrite/tbread helpers + bundled docker CLI → 39.3% on TB2
1028078
Pranav Raja
27d ago
terminal-bench-2: full baselines + leaderboard entries
afdbd92
Pranav Raja
27d ago
openclaw: use top-level tools.deny (not sandbox.tools.deny) for TB
0391edf
Pranav Raja
27d ago
openclaw: deny file-ops tools for Terminal Bench so the agent must use docker exec
f1db3c4
Pranav Raja
27d ago
Top contributors
Builders behind this project.
pranavraja99
80 commits
arimed4000
19 commits
ilblackdragon
18 commits
zetyquickly
8 commits
zmanian
4 commits