Skip to content

Actions: OpenHands/benchmarks

Actions

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
2,500+ workflow runs
2,500+ workflow runs

Filter by Workflow

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Upgrade LiteLLM to 1.84.0rc1
PR Review by OpenHands #507: Pull request #709 opened by neubig
2s
GHCR retention - eval-agent-server
GHCR retention - eval-agent-server #27: Scheduled
10s main
Add SWE-Bench Pro benchmark support (#699)
Pre-commit checks #2229: Commit 2430744 pushed by neubig
1m 0s main
Add SWE-Bench Pro benchmark support
PR Review Evaluation #189: Pull request #699 closed by neubig
32s
eval: honor GPT-5 prompt when available
PR Review Evaluation #188: Pull request #686 closed by enyst
1m 34s
GHCR retention - eval-agent-server
GHCR retention - eval-agent-server #26: Scheduled
12s main
Add ProgramBench benchmark integration
PR Review Evaluation #187: Pull request #703 closed by neubig
1m 7s
Add ProgramBench benchmark integration
PR Review by OpenHands #506: Pull request #703 review_requested by neubig
3m 2s
Add ProgramBench benchmark integration
PR Review by OpenHands #505: Pull request #703 review_requested by neubig
2m 43s
Add ProgramBench benchmark integration
PR Review by OpenHands #504: Pull request #703 ready_for_review by neubig
3m 50s
GHCR retention - eval-agent-server
GHCR retention - eval-agent-server #25: Scheduled
11s main
GHCR retention - eval-agent-server
GHCR retention - eval-agent-server #24: Scheduled
7s main