mirror of https://github.com/All-Hands-AI/OpenHands.git synced 2026-04-29 03:00:45 -04:00

Files

Xingyao Wang a6ba6c5277 Add SWEBench-docker eval (#2085 )

* add initial version of swebench-docker eval

* update the branch of git repo

* add poetry run

* download dev set too and pre-load f2p and p2p

* update eval infer script

* increase timeout

* add poetry run

* install swebench from our fork

* update script

* update loc

* support single instance debug

* replace \r\n from model patch

* replace eval docker from namespace xingyaoww

* update script to auto detect swe-bench format jsonl

* support eval infer on single instance id

* change log output dir to logs

* update summarise result script

* update README

* update readme

* tweak branch

* Update evaluation/swe_bench/scripts/eval/prep_eval.sh

Co-authored-by: Graham Neubig <neubig@gmail.com>

---------

Co-authored-by: Graham Neubig <neubig@gmail.com>

2024-06-10 19:30:40 +00:00

Dockerfile.builder

feat(SWE-Bench environment) integrate SWE-Bench sandbox (#1468 )

2024-05-15 16:15:55 +00:00

Dockerfile.builder_with_conda

feat(SWE-Bench environment) integrate SWE-Bench sandbox (#1468 )

2024-05-15 16:15:55 +00:00

Dockerfile.full_deps

feat(SWE-Bench environment) integrate SWE-Bench sandbox (#1468 )

2024-05-15 16:15:55 +00:00

Dockerfile.full.v1.1

Fix SWE-Bench evaluation due to setuptools version (#1995 )

2024-05-23 23:17:42 +08:00

Dockerfile.full.v1.2

Some SWE-Bench infer fixes and improvements (#2065 )

2024-05-26 10:02:11 +00:00

Dockerfile.full.v1.2.1

fix yet another swe_bench issue (#2069 )

2024-05-26 10:01:43 -07:00

pull_all_eval_docker.sh

Add SWEBench-docker eval (#2085 )

2024-06-10 19:30:40 +00:00

README.md

feat(SWE-Bench environment) integrate SWE-Bench sandbox (#1468 )

2024-05-15 16:15:55 +00:00

README.md

Docker Build Guide

Builder

This constructs docker container used for evaluation/swe_bench/scripts/prepare_swe_utils.sh that downloads the datasets.

pushd evaluation/swe_bench
# This builds base image with basic dependencies
docker build -t ghcr.io/opendevin/eval-swe-bench:builder -f ./scripts/docker/Dockerfile.builder .
# This builds image with SWE-Bench conda environment pre-installed
docker build -t ghcr.io/opendevin/eval-swe-bench:builder_with_conda -f ./scripts/docker/Dockerfile.builder_with_conda .