Compare commits

...

24 Commits

Author SHA1 Message Date
Xingyao Wang 74f3ac792f Merge commit 'f3b2085f9b31af0b3f24ee9a3269525c37ff63b3' into xw/fix-remote-runtime 2024-09-10 20:36:51 +00:00
Xingyao Wang c40ca7b38d gets API key and Runtime from env var 2024-09-05 01:35:04 +00:00
Xingyao Wang de0c6d0d86 Update evaluation/swe_bench/scripts/cleanup_remote_runtime.sh
Co-authored-by: Graham Neubig <neubig@gmail.com>
2024-09-05 09:33:47 +08:00
Xingyao Wang 0c18c3fae3 Update evaluation/swe_bench/scripts/cleanup_remote_runtime.sh
Co-authored-by: Graham Neubig <neubig@gmail.com>
2024-09-05 09:33:43 +08:00
Xingyao Wang b3b410b675 Update evaluation/swe_bench/scripts/cleanup_remote_runtime.sh
Co-authored-by: Graham Neubig <neubig@gmail.com>
2024-09-05 09:33:38 +08:00
Xingyao Wang f8c4896b87 Update evaluation/swe_bench/README.md
Co-authored-by: Graham Neubig <neubig@gmail.com>
2024-09-05 09:33:33 +08:00
Xingyao Wang f43d4e8dd0 Update evaluation/swe_bench/README.md
Co-authored-by: Graham Neubig <neubig@gmail.com>
2024-09-05 09:33:23 +08:00
Xingyao Wang c3a45dfab1 rename od to oh 2024-09-03 18:01:52 +00:00
Xingyao Wang ed763093f3 update readme for cleanup 2024-09-03 17:51:27 +00:00
Xingyao Wang 0ace515a45 Merge branch 'main' into xw/fix-remote-runtime 2024-09-03 12:50:08 -05:00
Xingyao Wang 810e222217 update README 2024-09-03 17:47:23 +00:00
Xingyao Wang 832fa77e9d fix the cases when tag is too long 2024-09-03 17:45:46 +00:00
Xingyao Wang 5efc9aa1fa add script to cleanup remote runtime 2024-09-03 17:17:37 +00:00
Xingyao Wang 83f5186d71 set SWE-Bench default to run SWE-Bench lite 2024-09-03 00:46:20 +00:00
Xingyao Wang 0ae601d90b Merge commit 'd283420ac2ca3d4f98cdb2b32e44b85f919cbf63' into xw/improve-remote-runtime 2024-09-03 00:45:13 +00:00
Xingyao Wang e4c0b86d30 update pbar 2024-09-02 18:53:15 +00:00
Xingyao Wang 92f4b18e09 handle the case when ret push is an generator 2024-09-02 18:51:00 +00:00
Xingyao Wang 8d4ab578cd Merge commit '1b92985a6a37f43319508d95eff2cb66e896e518' into xw/improve-remote-runtime 2024-09-02 18:45:52 +00:00
Xingyao Wang 1b92985a6a add push script 2024-09-02 18:45:37 +00:00
Xingyao Wang 2622f6714d increase timeout for remote runtime 2024-09-02 18:24:17 +00:00
Xingyao Wang 27e77baf1a update eval script and documentation 2024-09-02 14:17:25 -04:00
Xingyao Wang eb7433af85 Merge commit '57ad0583b78f0f2b1d9a11027cac6d75c2c01fac' into xw/add-swebench-fullset 2024-09-02 14:09:01 -04:00
Xingyao Wang a8e35be0b7 fix instance image list 2024-08-19 19:50:47 +00:00
Xingyao Wang 865d626046 feat: add SWE-bench fullset support 2024-08-19 19:47:03 +00:00

Diff Content Not Available