Files
OpenHands/evaluation/swe_bench/scripts/docker/push_eval_docker.sh
Xingyao Wang 6a0ffc5c61 [Evaluation] Use the latest official SWE-Bench Dockerization for evaluation (#2728)
* add newline after patch to fix patch apply

* new swebench wip

* add newline after patch to fix patch apply

* only add newline if not empty

* update swebench source and update

* update gitignore for swebench eval

* update old prep_eval

* update gitignore

* add scripts for push and pull swebench images

* update eval_infer.sh

* update eval_infer for new docker workflow

* update script to create markdown report based on report.json

* update eval infer to use update output

* update readme

* only move result to folder if running whole file

* remove set-x

* update conversion script

* Update evaluation/swe_bench/README.md

* Update evaluation/swe_bench/README.md

* Update evaluation/swe_bench/README.md

* make sure last line end with newline

* switch to an fix attempt branch of swebench

* Update evaluation/swe_bench/README.md

* Update evaluation/swe_bench/README.md

---------

Co-authored-by: Engel Nyst <enyst@users.noreply.github.com>
2024-07-01 23:58:30 +00:00

31 lines
949 B
Bash

#!/bin/bash
# This is ONLY used for pushing docker images created by https://github.com/princeton-nlp/SWE-bench/blob/main/docs/20240627_docker/README.md
DOCKER_NAMESPACE=$1
# check if DOCKER_NAMESPACE is set
if [ -z "$DOCKER_NAMESPACE" ]; then
echo "Usage: $0 <docker_namespace>"
exit 1
fi
# target namespace
image_list=$(docker image ls --format '{{.Repository}}:{{.Tag}}' | grep sweb | grep -v $DOCKER_NAMESPACE)
# There are three tiers of images
# - base
# - env
# - eval (instance level)
for image in $image_list; do
echo "=============================="
echo "Image: $image"
# rename image by replace "__" with "_s_" to comply with docker naming convention
new_image_name=${image//__/_s_}
docker tag $image $DOCKER_NAMESPACE/$new_image_name
echo "Tagged $image to $DOCKER_NAMESPACE/$new_image_name"
docker push $DOCKER_NAMESPACE/$new_image_name
echo "Pushed $DOCKER_NAMESPACE/$new_image_name"
done