Compare commits

...

24 Commits

Author SHA1 Message Date
Xingyao Wang
74f3ac792f Merge commit 'f3b2085f9b31af0b3f24ee9a3269525c37ff63b3' into xw/fix-remote-runtime 2024-09-10 20:36:51 +00:00
Xingyao Wang
c40ca7b38d gets API key and Runtime from env var 2024-09-05 01:35:04 +00:00
Xingyao Wang
de0c6d0d86 Update evaluation/swe_bench/scripts/cleanup_remote_runtime.sh
Co-authored-by: Graham Neubig <neubig@gmail.com>
2024-09-05 09:33:47 +08:00
Xingyao Wang
0c18c3fae3 Update evaluation/swe_bench/scripts/cleanup_remote_runtime.sh
Co-authored-by: Graham Neubig <neubig@gmail.com>
2024-09-05 09:33:43 +08:00
Xingyao Wang
b3b410b675 Update evaluation/swe_bench/scripts/cleanup_remote_runtime.sh
Co-authored-by: Graham Neubig <neubig@gmail.com>
2024-09-05 09:33:38 +08:00
Xingyao Wang
f8c4896b87 Update evaluation/swe_bench/README.md
Co-authored-by: Graham Neubig <neubig@gmail.com>
2024-09-05 09:33:33 +08:00
Xingyao Wang
f43d4e8dd0 Update evaluation/swe_bench/README.md
Co-authored-by: Graham Neubig <neubig@gmail.com>
2024-09-05 09:33:23 +08:00
Xingyao Wang
c3a45dfab1 rename od to oh 2024-09-03 18:01:52 +00:00
Xingyao Wang
ed763093f3 update readme for cleanup 2024-09-03 17:51:27 +00:00
Xingyao Wang
0ace515a45 Merge branch 'main' into xw/fix-remote-runtime 2024-09-03 12:50:08 -05:00
Xingyao Wang
810e222217 update README 2024-09-03 17:47:23 +00:00
Xingyao Wang
832fa77e9d fix the cases when tag is too long 2024-09-03 17:45:46 +00:00
Xingyao Wang
5efc9aa1fa add script to cleanup remote runtime 2024-09-03 17:17:37 +00:00
Xingyao Wang
83f5186d71 set SWE-Bench default to run SWE-Bench lite 2024-09-03 00:46:20 +00:00
Xingyao Wang
0ae601d90b Merge commit 'd283420ac2ca3d4f98cdb2b32e44b85f919cbf63' into xw/improve-remote-runtime 2024-09-03 00:45:13 +00:00
Xingyao Wang
e4c0b86d30 update pbar 2024-09-02 18:53:15 +00:00
Xingyao Wang
92f4b18e09 handle the case when ret push is an generator 2024-09-02 18:51:00 +00:00
Xingyao Wang
8d4ab578cd Merge commit '1b92985a6a37f43319508d95eff2cb66e896e518' into xw/improve-remote-runtime 2024-09-02 18:45:52 +00:00
Xingyao Wang
1b92985a6a add push script 2024-09-02 18:45:37 +00:00
Xingyao Wang
2622f6714d increase timeout for remote runtime 2024-09-02 18:24:17 +00:00
Xingyao Wang
27e77baf1a update eval script and documentation 2024-09-02 14:17:25 -04:00
Xingyao Wang
eb7433af85 Merge commit '57ad0583b78f0f2b1d9a11027cac6d75c2c01fac' into xw/add-swebench-fullset 2024-09-02 14:09:01 -04:00
Xingyao Wang
a8e35be0b7 fix instance image list 2024-08-19 19:50:47 +00:00
Xingyao Wang
865d626046 feat: add SWE-bench fullset support 2024-08-19 19:47:03 +00:00

Diff Content Not Available