finaltrip
05b84df9cb
chore: fix some comments ( #2234 )
...
Signed-off-by: finaltrip <finaltrip@qq.com >
2024-06-03 16:04:34 +00:00
Bibek Poudel
42671815a8
changed the welcome logo from 60vh to auto ( #2235 )
2024-06-03 15:50:58 +00:00
dependabot[bot]
c0e8e11cdc
Bump boto3 from 1.34.116 to 1.34.117 ( #2232 )
...
Bumps [boto3](https://github.com/boto/boto3 ) from 1.34.116 to 1.34.117.
- [Release notes](https://github.com/boto/boto3/releases )
- [Changelog](https://github.com/boto/boto3/blob/develop/CHANGELOG.rst )
- [Commits](https://github.com/boto/boto3/compare/1.34.116...1.34.117 )
---
updated-dependencies:
- dependency-name: boto3
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-06-03 23:47:27 +08:00
dependabot[bot]
673cde31ba
Bump datasets from 2.19.1 to 2.19.2 ( #2231 )
...
Bumps [datasets](https://github.com/huggingface/datasets ) from 2.19.1 to 2.19.2.
- [Release notes](https://github.com/huggingface/datasets/releases )
- [Commits](https://github.com/huggingface/datasets/compare/2.19.1...2.19.2 )
---
updated-dependencies:
- dependency-name: datasets
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-06-03 23:47:10 +08:00
dependabot[bot]
5de0d5d041
Bump uvicorn from 0.30.0 to 0.30.1 ( #2230 )
...
Bumps [uvicorn](https://github.com/encode/uvicorn ) from 0.30.0 to 0.30.1.
- [Release notes](https://github.com/encode/uvicorn/releases )
- [Changelog](https://github.com/encode/uvicorn/blob/master/CHANGELOG.md )
- [Commits](https://github.com/encode/uvicorn/compare/0.30.0...0.30.1 )
---
updated-dependencies:
- dependency-name: uvicorn
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-06-03 23:46:45 +08:00
dependabot[bot]
adaa9c7c92
Bump e2b from 0.17.0 to 0.17.1 ( #2229 )
...
Bumps [e2b](https://github.com/e2b-dev/e2b ) from 0.17.0 to 0.17.1.
- [Release notes](https://github.com/e2b-dev/e2b/releases )
- [Commits](https://github.com/e2b-dev/e2b/compare/@e2b/python-sdk@0.17.0...@e2b/python-sdk@0.17.1 )
---
updated-dependencies:
- dependency-name: e2b
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-06-03 23:46:06 +08:00
tobitege
4b76f98b26
fix: keep colon part in model name for OpenRouter ( #2223 )
2024-06-03 17:11:44 +02:00
dependabot[bot]
47c12902de
Bump @types/node from 20.12.13 to 20.14.0 in /frontend ( #2226 )
...
Bumps [@types/node](https://github.com/DefinitelyTyped/DefinitelyTyped/tree/HEAD/types/node ) from 20.12.13 to 20.14.0.
- [Release notes](https://github.com/DefinitelyTyped/DefinitelyTyped/releases )
- [Commits](https://github.com/DefinitelyTyped/DefinitelyTyped/commits/HEAD/types/node )
---
updated-dependencies:
- dependency-name: "@types/node"
dependency-type: direct:development
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Leo <ifuryst@gmail.com >
2024-06-03 15:05:11 +00:00
dependabot[bot]
bffa61adb4
Bump prettier from 3.2.5 to 3.3.0 in /frontend ( #2225 )
...
Bumps [prettier](https://github.com/prettier/prettier ) from 3.2.5 to 3.3.0.
- [Release notes](https://github.com/prettier/prettier/releases )
- [Changelog](https://github.com/prettier/prettier/blob/main/CHANGELOG.md )
- [Commits](https://github.com/prettier/prettier/compare/3.2.5...3.3.0 )
---
updated-dependencies:
- dependency-name: prettier
dependency-type: direct:development
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-06-03 22:51:51 +08:00
Graham Neubig
4476c250c5
Add consent dialog ( #2169 )
...
* Add consent dialog for sharing conversation histories
* Update
* Update to nextui modals
* Update
* More fixes to modal
* Updates
* Revert most changes to ChatInterface
* Update form
* Cleanup
* Update consent dialog
* Lint
* Fix toast
* Fix to be a select
* prettier
* Update frontend/src/components/chat/ChatInterface.tsx
Co-authored-by: sp.wack <83104063+amanape@users.noreply.github.com >
* Update frontend/src/components/modals/feedback/FeedbackModal.tsx
Co-authored-by: sp.wack <83104063+amanape@users.noreply.github.com >
* Update frontend/src/components/modals/feedback/FeedbackModal.tsx
Co-authored-by: sp.wack <83104063+amanape@users.noreply.github.com >
* Update frontend/src/components/chat/ChatInterface.tsx
Co-authored-by: sp.wack <83104063+amanape@users.noreply.github.com >
* Fix
---------
Co-authored-by: OpenDevin <opendevin@opendevin.ai >
Co-authored-by: sp.wack <83104063+amanape@users.noreply.github.com >
2024-06-03 14:33:53 +00:00
Boxuan Li
c9c5d71e5c
logger.py: Fix resource leak ( #2215 )
2024-06-03 05:57:54 +00:00
மனோஜ்குமார் பழனிச்சாமி
783a3545b5
Named docker app container ( #2202 )
2024-06-03 05:49:10 +00:00
Boxuan Li
538d1d85a2
evaluation: Reset configs in finally block ( #2214 )
2024-06-03 09:52:12 +08:00
Boxuan Li
1adbec6757
ssh_box: Fix Docker descriptor leak ( #2212 )
2024-06-03 01:22:30 +00:00
Boxuan Li
6fd8e8d5b8
Fix file descriptor leaks in agentskills ( #2209 )
2024-06-03 09:11:10 +08:00
tobitege
908c253897
German translations updated ( #2208 )
2024-06-02 16:49:40 -07:00
Boxuan Li
399e6fb1d1
ssh_box: Close containers before throwing exception ( #2206 )
2024-06-02 20:13:44 +00:00
tobitege
64f7749b63
Windows docs extended; some markdown lint fixes ( #2205 )
2024-06-02 14:59:54 +00:00
Graham Neubig
efd689293e
Bump docs to 0.6 ( #2193 )
...
* Bump docs to 0.6
* Update README.md
2024-06-02 06:34:40 -04:00
Ryan H. Tran
22e8fb39b1
add cost metrics to evaluation outputs for all benchmarks ( #2199 )
2024-06-02 08:28:00 +00:00
Yizhe Zhang
8d79c3edbc
modify the exiting logic and reward calculation, delete unused function ( #2198 )
2024-06-02 06:38:09 +00:00
tobitege
b0478d2880
fix: Fix husky install deprecated message (since v9 of husky) ( #2190 ) ( #2191 )
...
Co-authored-by: மனோஜ்குமார் பழனிச்சாமி <smartmanoj42857@gmail.com >
2024-06-02 02:46:32 +00:00
RainRat
ed6dcc8381
fix typos ( #2187 )
...
* fix typos
no functional change
* fix typos
2024-06-01 20:40:30 +00:00
Leo
2c231c57c9
Add supported benchmarks to evaluation README (AgentBench, BIRD, LogicReasoning) ( #2183 )
...
Signed-off-by: ifuryst <ifuryst@gmail.com >
2024-06-01 11:33:01 -04:00
மனோஜ்குமார் பழனிச்சாமி
4ece6fb3cc
Auto started persistent container ( #2151 )
2024-06-01 14:46:41 +00:00
மனோஜ்குமார் பழனிச்சாமி
f9c7c3a520
Refactored logging ( #2159 )
2024-06-01 14:31:35 +00:00
மனோஜ்குமார் பழனிச்சாமி
aee3d506e6
Restricted persistent sandbox to opendevin user only ( #2177 )
2024-06-01 14:18:03 +00:00
Graham Neubig
3b8a649b3d
Update slack invite link to make it valid ( #2182 )
...
* Update README.md
* Update CustomFooter.tsx
* Update about.md
* Update faq.tsx
* Update intro.mdx
2024-06-01 21:55:27 +08:00
Binyuan Hui
46dcf4bb3e
Support BIRD benchmark ( #2117 )
...
* update: change timeout from 10 to 30
* update: readme for bird evaluation
* Update evaluation/bird/run_infer.py
Co-authored-by: Engel Nyst <enyst@users.noreply.github.com >
* Update evaluation/bird/README.md
Co-authored-by: Shimada666 <649940882@qq.com >
* Update evaluation/bird/README.md
Co-authored-by: Shimada666 <649940882@qq.com >
* Update evaluation/bird/run_infer.py
Co-authored-by: Engel Nyst <enyst@users.noreply.github.com >
---------
Co-authored-by: Engel Nyst <enyst@users.noreply.github.com >
Co-authored-by: Shimada666 <649940882@qq.com >
Co-authored-by: Yufan Song <33971064+yufansong@users.noreply.github.com >
2024-06-01 11:34:36 +00:00
Leo
78e003caf6
Fix: Avoid bash backtick eval in runtime commands. ( #2180 )
...
Signed-off-by: ifuryst <ifuryst@gmail.com >
2024-06-01 19:19:15 +08:00
Leo
be251b11de
Add AgentBench. ( #2012 )
...
* Add AgentBench.
* Load the datasets from HF.
Signed-off-by: ifuryst <ifuryst@gmail.com >
* Add helper functions.
* Add mock executor.
Signed-off-by: ifuryst <ifuryst@gmail.com >
* Add retriv agent answer cmd.
* Adjust the dataset.
* Refine test results.
Signed-off-by: ifuryst <ifuryst@gmail.com >
* Consolidate all AgentBench datasets and scripts into a single CSV dataset.
* Refactor dataset source.
* Update helper functions.
Signed-off-by: ifuryst <ifuryst@gmail.com >
* Fix the CRLF problem.
Signed-off-by: ifuryst <ifuryst@gmail.com >
* Separate the instance's workspace.
Signed-off-by: ifuryst <ifuryst@gmail.com >
* Add cleanup logic and error handling for sandbox closure.
* Normalized dataset
Signed-off-by: ifuryst <ifuryst@gmail.com >
* Update README.
Signed-off-by: ifuryst <ifuryst@gmail.com >
* Update the prompt to capture the answer.
Signed-off-by: ifuryst <ifuryst@gmail.com >
* Refactor script execution paths to use absolute container workspace path.
Signed-off-by: ifuryst <ifuryst@gmail.com >
* Update AgentBench README.
Signed-off-by: ifuryst <ifuryst@gmail.com >
* Delete useless functions.
Signed-off-by: ifuryst <ifuryst@gmail.com >
* Update evaluation/agent_bench/README.md
* Add script to summarize test results from JSONL file in AgentBench
Signed-off-by: ifuryst <ifuryst@gmail.com >
* Delete useless script and codes.
Signed-off-by: ifuryst <ifuryst@gmail.com >
* Update evaluation/agent_bench/scripts/summarise_results.py
---------
Signed-off-by: ifuryst <ifuryst@gmail.com >
Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk >
2024-06-01 07:58:14 +00:00
மனோஜ்குமார் பழனிச்சாமி
04d7354501
Detailed logs for ssh_box ( #2173 )
2024-06-01 11:40:22 +05:30
Boxuan Li
06e45afc75
Fix ssh box hung issue ( #2172 )
...
Co-authored-by: மனோஜ்குமார் பழனிச்சாமி <smartmanoj42857@gmail.com >
2024-06-01 05:31:32 +00:00
மனோஜ்குமார் பழனிச்சாமி
3a4dc5c68c
Initialized plugins only once for persistent sandboxes ( #2162 )
2024-06-01 10:46:09 +05:30
Boxuan Li
feaae0b7ac
Fix persist_sandbox in Makefile ( #2171 )
2024-06-01 12:50:31 +08:00
Rahul Anand
6e76f9a02f
Fix: Codebase font fixed, and other fixes for #2138 PR ( #2154 )
...
* fix #2123
* Docs enhancement
* Update docs/src/components/CustomFooter.tsx
Co-authored-by: sp.wack <83104063+amanape@users.noreply.github.com >
* Update docs/src/components/CustomFooter.tsx
Co-authored-by: sp.wack <83104063+amanape@users.noreply.github.com >
* Update docs/src/pages/faq.tsx
Co-authored-by: sp.wack <83104063+amanape@users.noreply.github.com >
* update
* fix for #2138 pr
* Update docs/src/components/CustomFooter.tsx
Co-authored-by: Graham Neubig <neubig@gmail.com >
* Update docs/src/components/HomepageHeader/HomepageHeader.tsx
Co-authored-by: Graham Neubig <neubig@gmail.com >
* Update docs/src/components/Welcome/Welcome.tsx
Co-authored-by: Graham Neubig <neubig@gmail.com >
* Update docs/src/css/custom.css
Co-authored-by: Graham Neubig <neubig@gmail.com >
---------
Co-authored-by: sp.wack <83104063+amanape@users.noreply.github.com >
Co-authored-by: Graham Neubig <neubig@gmail.com >
2024-06-01 02:22:44 +00:00
மனோஜ்குமார் பழனிச்சாமி
bf24a0b5c0
Fixed makefile ( #2168 )
2024-06-01 03:35:43 +05:30
Aaron Xia
42c6b506b5
Lazy launching BrowseEnv / making BrowseEnv optional ( #2155 )
...
* feat: lazy launching browser; browser optional for diffrent agents.
* style: lint
* fix: integration test fail due to browser not started.
* fix: run by cli and integration test failed.
* fix: lint
* fix: lint
---------
Co-authored-by: Graham Neubig <neubig@gmail.com >
2024-05-31 16:40:42 -04:00
மனோஜ்குமார் பழனிச்சாமி
8413f147c9
Added logs ( #2153 )
...
* Logged about config file
* Logged Browser env
* Update opendevin/core/config.py
Co-authored-by: Aleksandar <isavitaisa@gmail.com >
* Update opendevin/core/config.py
Co-authored-by: Aleksandar <isavitaisa@gmail.com >
---------
Co-authored-by: Aleksandar <isavitaisa@gmail.com >
2024-05-31 16:04:36 -04:00
Ryan H. Tran
01296ff79d
Add remaining subsets for MINT benchmark ( #2142 )
...
* add MMLU subset
* add theoremqa subset
* remove redundant packages from requirements.txt, adjust prompts, handle gpt3.5 propose a wrong answer after a correct answer
* add MBPP subset
* add humaneval subset
* update README
* exit actively after the agent finishes the task
2024-05-31 20:04:13 +00:00
மனோஜ்குமார் பழனிச்சாமி
f3f5768b4f
Install chromium only once ( #2100 )
...
* install chromium only once
* Update Makefile
* Update Makefile
2024-05-31 15:39:10 -04:00
dependabot[bot]
9a441ea8f7
Bump boto3 from 1.34.115 to 1.34.116 ( #2164 )
...
Bumps [boto3](https://github.com/boto/boto3 ) from 1.34.115 to 1.34.116.
- [Release notes](https://github.com/boto/boto3/releases )
- [Changelog](https://github.com/boto/boto3/blob/develop/CHANGELOG.rst )
- [Commits](https://github.com/boto/boto3/compare/1.34.115...1.34.116 )
---
updated-dependencies:
- dependency-name: boto3
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-05-31 15:13:33 -04:00
Graham Neubig
6596d5c799
Fix: Feedback should be sent through the backend to avoid CORS issues ( #2046 )
...
* Fix: Feedback should be sent through the backend to avoid CORS issues
* Update
* Fix merge error
* Revert unnecessary change
* Lint
* Moved to services
* Fixed bugs
---------
Co-authored-by: OpenDevin <opendevin@opendevin.ai >
2024-05-31 15:00:09 -04:00
dependabot[bot]
6aec3d789e
Bump litellm from 1.39.3 to 1.39.5 ( #2163 )
...
Bumps [litellm](https://github.com/BerriAI/litellm ) from 1.39.3 to 1.39.5.
- [Release notes](https://github.com/BerriAI/litellm/releases )
- [Commits](https://github.com/BerriAI/litellm/compare/v1.39.3...v1.39.5 )
---
updated-dependencies:
- dependency-name: litellm
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-05-31 19:36:04 +02:00
Graham Neubig
7a2122ebc2
Default to gpt-4o ( #2158 )
...
* Default to gpt-4o
* Fix default
0.6.0
2024-05-31 14:44:07 +00:00
dependabot[bot]
a7b19a0048
Bump @nextui-org/react from 2.4.0 to 2.4.1 in /frontend ( #2161 )
...
Bumps [@nextui-org/react](https://github.com/nextui-org/nextui/tree/HEAD/packages/core/react ) from 2.4.0 to 2.4.1.
- [Release notes](https://github.com/nextui-org/nextui/releases )
- [Changelog](https://github.com/nextui-org/nextui/blob/canary/packages/core/react/CHANGELOG.md )
- [Commits](https://github.com/nextui-org/nextui/commits/@nextui-org/react@2.4.1/packages/core/react )
---
updated-dependencies:
- dependency-name: "@nextui-org/react"
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-05-31 14:32:21 +00:00
dependabot[bot]
e6c8e1c9d2
Bump framer-motion from 11.2.9 to 11.2.10 in /frontend ( #2160 )
...
Bumps [framer-motion](https://github.com/framer/motion ) from 11.2.9 to 11.2.10.
- [Changelog](https://github.com/framer/motion/blob/main/CHANGELOG.md )
- [Commits](https://github.com/framer/motion/compare/v11.2.9...v11.2.10 )
---
updated-dependencies:
- dependency-name: framer-motion
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-05-31 14:30:14 +00:00
Boxuan Li
4d14b44a9a
SWE-bench: Add summarise utility script to view passed/failed task IDs ( #2137 )
...
* SWE-bench: Add summarise utility script to view passed/failed task IDs
* Fix typos
* Move file
* Prettify
* Use merged jsonl file
2024-05-31 12:32:17 +08:00
Boxuan Li
f188abd7a3
Delete evaluation outputs files ( #2152 )
...
* Delete evaluation outputs files
* Fix README
2024-05-31 03:12:27 +00:00
மனோஜ்குமார் பழனிச்சாமி
961c96a2a1
Added ssh_password to config setup ( #2139 )
...
Co-authored-by: Aleksandar <isavitaisa@gmail.com >
2024-05-31 07:26:16 +05:30