LoreonLabsPlatform
DocsHome
  • Overview

Intelligence

  • Markets
  • Builders
  • Research
  • Ecosystems
  • Launchpads
  • Search
Ecosystems

Other

teamcity-ai-agent-testing-demo

End-to-end TeamCity framework to run AI agents on SWE-Bench Lite. Spin up isolated Docker images per task, extract patches, score with the official harness, and aggregate success rates. As an example, we'll look at Junie and Google Gemini CLI

OtherEmerging
GitHubWebsite
Stars
—
Forks
—
Contributors
2
Last push
10mo ago

Recent commits

Latest commits.

  • Merge pull request #2 from JetBrains/olgabedrina-patch-1
    8db1637Sergei Ugdyzhekov10mo ago
  • Update README.md
    0f499a9Olga Bedrina10mo ago
  • Update README.md
    dfc9fe8Olga Bedrina10mo ago
  • Update artifact rules in SWE_Bench_Lite to include datasets directory unpacking.
    d26e90fSergei Ugdyzhekov11mo ago
  • Refactor error tagging logic in TeamCity to group checks for error and empty patch instances.
    3c4302b
Sergei Ugdyzhekov
11mo ago
  • Handle empty patch instances in TeamCity tagging logic for AI task execution
    9c2d9adSergei Ugdyzhekov11mo ago
  • Removed empty configuration patch
    54b74fbSergei Ugdyzhekov11mo ago
  • Fix incorrect `mv` command in Gemini setup to properly handle `.mjs` extension
    f975211Sergei Ugdyzhekov11mo ago
  • Top contributors

    Builders behind this project.

    sugdyzhekov
    17 commits
    olgabedrina
    2 commits