மனோஜ்குமார் பழனிச்சாமி
4afd85e591
Quick doc fix ( #2243 )
0.6.1
2024-06-04 07:00:44 +00:00
Leo
9ada36e30b
fix: restore python linting. ( #2228 )
...
* fix: restore python linting.
Signed-off-by: ifuryst <ifuryst@gmail.com >
* update: extend the Python lint check to evaluation.
Signed-off-by: ifuryst <ifuryst@gmail.com >
* Update evaluation/logic_reasoning/instruction.txt
---------
Signed-off-by: ifuryst <ifuryst@gmail.com >
Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk >
2024-06-04 06:36:19 +00:00
Xida Ren (Cedar)
1314a09ce9
One-step launch instructions ( #2189 )
...
Co-authored-by: மனோஜ்குமார் பழனிச்சாமி <smartmanoj42857@gmail.com >
Co-authored-by: Robert Brennan <accounts@rbren.io >
2024-06-03 23:28:50 -07:00
Yufan Song
2374374778
Fix python environment in review-pr dogfood action ( #2237 )
...
This doesn't completely fix the bug; it fixes the python environment, and there is more to do to fix the issue.
2024-06-03 20:31:38 -07:00
Graham Neubig
44665ee235
Add docs for sharing feedback ( #2241 )
2024-06-04 07:37:33 +05:30
Graham Neubig
74e25920da
Transition to gcloud endpoint ( #2240 )
2024-06-04 01:20:04 +00:00
Leo
759f76fab5
Fix: Properly close Docker client in DockerExecBox to prevent resource leakage ( #2224 )
2024-06-04 09:05:41 +08:00
dependabot[bot]
87c679ff1a
Bump ruff from 0.4.6 to 0.4.7 ( #2233 )
2024-06-03 22:09:08 +00:00
finaltrip
05b84df9cb
chore: fix some comments ( #2234 )
...
Signed-off-by: finaltrip <finaltrip@qq.com >
2024-06-03 16:04:34 +00:00
Bibek Poudel
42671815a8
changed the welcome logo from 60vh to auto ( #2235 )
2024-06-03 15:50:58 +00:00
dependabot[bot]
c0e8e11cdc
Bump boto3 from 1.34.116 to 1.34.117 ( #2232 )
...
Bumps [boto3](https://github.com/boto/boto3 ) from 1.34.116 to 1.34.117.
- [Release notes](https://github.com/boto/boto3/releases )
- [Changelog](https://github.com/boto/boto3/blob/develop/CHANGELOG.rst )
- [Commits](https://github.com/boto/boto3/compare/1.34.116...1.34.117 )
---
updated-dependencies:
- dependency-name: boto3
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-06-03 23:47:27 +08:00
dependabot[bot]
673cde31ba
Bump datasets from 2.19.1 to 2.19.2 ( #2231 )
...
Bumps [datasets](https://github.com/huggingface/datasets ) from 2.19.1 to 2.19.2.
- [Release notes](https://github.com/huggingface/datasets/releases )
- [Commits](https://github.com/huggingface/datasets/compare/2.19.1...2.19.2 )
---
updated-dependencies:
- dependency-name: datasets
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-06-03 23:47:10 +08:00
dependabot[bot]
5de0d5d041
Bump uvicorn from 0.30.0 to 0.30.1 ( #2230 )
...
Bumps [uvicorn](https://github.com/encode/uvicorn ) from 0.30.0 to 0.30.1.
- [Release notes](https://github.com/encode/uvicorn/releases )
- [Changelog](https://github.com/encode/uvicorn/blob/master/CHANGELOG.md )
- [Commits](https://github.com/encode/uvicorn/compare/0.30.0...0.30.1 )
---
updated-dependencies:
- dependency-name: uvicorn
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-06-03 23:46:45 +08:00
dependabot[bot]
adaa9c7c92
Bump e2b from 0.17.0 to 0.17.1 ( #2229 )
...
Bumps [e2b](https://github.com/e2b-dev/e2b ) from 0.17.0 to 0.17.1.
- [Release notes](https://github.com/e2b-dev/e2b/releases )
- [Commits](https://github.com/e2b-dev/e2b/compare/@e2b/python-sdk@0.17.0...@e2b/python-sdk@0.17.1 )
---
updated-dependencies:
- dependency-name: e2b
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-06-03 23:46:06 +08:00
tobitege
4b76f98b26
fix: keep colon part in model name for OpenRouter ( #2223 )
2024-06-03 17:11:44 +02:00
dependabot[bot]
47c12902de
Bump @types/node from 20.12.13 to 20.14.0 in /frontend ( #2226 )
...
Bumps [@types/node](https://github.com/DefinitelyTyped/DefinitelyTyped/tree/HEAD/types/node ) from 20.12.13 to 20.14.0.
- [Release notes](https://github.com/DefinitelyTyped/DefinitelyTyped/releases )
- [Commits](https://github.com/DefinitelyTyped/DefinitelyTyped/commits/HEAD/types/node )
---
updated-dependencies:
- dependency-name: "@types/node"
dependency-type: direct:development
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Leo <ifuryst@gmail.com >
2024-06-03 15:05:11 +00:00
dependabot[bot]
bffa61adb4
Bump prettier from 3.2.5 to 3.3.0 in /frontend ( #2225 )
...
Bumps [prettier](https://github.com/prettier/prettier ) from 3.2.5 to 3.3.0.
- [Release notes](https://github.com/prettier/prettier/releases )
- [Changelog](https://github.com/prettier/prettier/blob/main/CHANGELOG.md )
- [Commits](https://github.com/prettier/prettier/compare/3.2.5...3.3.0 )
---
updated-dependencies:
- dependency-name: prettier
dependency-type: direct:development
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-06-03 22:51:51 +08:00
Graham Neubig
4476c250c5
Add consent dialog ( #2169 )
...
* Add consent dialog for sharing conversation histories
* Update
* Update to nextui modals
* Update
* More fixes to modal
* Updates
* Revert most changes to ChatInterface
* Update form
* Cleanup
* Update consent dialog
* Lint
* Fix toast
* Fix to be a select
* prettier
* Update frontend/src/components/chat/ChatInterface.tsx
Co-authored-by: sp.wack <83104063+amanape@users.noreply.github.com >
* Update frontend/src/components/modals/feedback/FeedbackModal.tsx
Co-authored-by: sp.wack <83104063+amanape@users.noreply.github.com >
* Update frontend/src/components/modals/feedback/FeedbackModal.tsx
Co-authored-by: sp.wack <83104063+amanape@users.noreply.github.com >
* Update frontend/src/components/chat/ChatInterface.tsx
Co-authored-by: sp.wack <83104063+amanape@users.noreply.github.com >
* Fix
---------
Co-authored-by: OpenDevin <opendevin@opendevin.ai >
Co-authored-by: sp.wack <83104063+amanape@users.noreply.github.com >
2024-06-03 14:33:53 +00:00
Boxuan Li
c9c5d71e5c
logger.py: Fix resource leak ( #2215 )
2024-06-03 05:57:54 +00:00
மனோஜ்குமார் பழனிச்சாமி
783a3545b5
Named docker app container ( #2202 )
2024-06-03 05:49:10 +00:00
Boxuan Li
538d1d85a2
evaluation: Reset configs in finally block ( #2214 )
2024-06-03 09:52:12 +08:00
Boxuan Li
1adbec6757
ssh_box: Fix Docker descriptor leak ( #2212 )
2024-06-03 01:22:30 +00:00
Boxuan Li
6fd8e8d5b8
Fix file descriptor leaks in agentskills ( #2209 )
2024-06-03 09:11:10 +08:00
tobitege
908c253897
German translations updated ( #2208 )
2024-06-02 16:49:40 -07:00
Boxuan Li
399e6fb1d1
ssh_box: Close containers before throwing exception ( #2206 )
2024-06-02 20:13:44 +00:00
tobitege
64f7749b63
Windows docs extended; some markdown lint fixes ( #2205 )
2024-06-02 14:59:54 +00:00
Graham Neubig
efd689293e
Bump docs to 0.6 ( #2193 )
...
* Bump docs to 0.6
* Update README.md
2024-06-02 06:34:40 -04:00
Ryan H. Tran
22e8fb39b1
add cost metrics to evaluation outputs for all benchmarks ( #2199 )
2024-06-02 08:28:00 +00:00
Yizhe Zhang
8d79c3edbc
modify the exiting logic and reward calculation, delete unused function ( #2198 )
2024-06-02 06:38:09 +00:00
tobitege
b0478d2880
fix: Fix husky install deprecated message (since v9 of husky) ( #2190 ) ( #2191 )
...
Co-authored-by: மனோஜ்குமார் பழனிச்சாமி <smartmanoj42857@gmail.com >
2024-06-02 02:46:32 +00:00
RainRat
ed6dcc8381
fix typos ( #2187 )
...
* fix typos
no functional change
* fix typos
2024-06-01 20:40:30 +00:00
Leo
2c231c57c9
Add supported benchmarks to evaluation README (AgentBench, BIRD, LogicReasoning) ( #2183 )
...
Signed-off-by: ifuryst <ifuryst@gmail.com >
2024-06-01 11:33:01 -04:00
மனோஜ்குமார் பழனிச்சாமி
4ece6fb3cc
Auto started persistent container ( #2151 )
2024-06-01 14:46:41 +00:00
மனோஜ்குமார் பழனிச்சாமி
f9c7c3a520
Refactored logging ( #2159 )
2024-06-01 14:31:35 +00:00
மனோஜ்குமார் பழனிச்சாமி
aee3d506e6
Restricted persistent sandbox to opendevin user only ( #2177 )
2024-06-01 14:18:03 +00:00
Graham Neubig
3b8a649b3d
Update slack invite link to make it valid ( #2182 )
...
* Update README.md
* Update CustomFooter.tsx
* Update about.md
* Update faq.tsx
* Update intro.mdx
2024-06-01 21:55:27 +08:00
Binyuan Hui
46dcf4bb3e
Support BIRD benchmark ( #2117 )
...
* update: change timeout from 10 to 30
* update: readme for bird evaluation
* Update evaluation/bird/run_infer.py
Co-authored-by: Engel Nyst <enyst@users.noreply.github.com >
* Update evaluation/bird/README.md
Co-authored-by: Shimada666 <649940882@qq.com >
* Update evaluation/bird/README.md
Co-authored-by: Shimada666 <649940882@qq.com >
* Update evaluation/bird/run_infer.py
Co-authored-by: Engel Nyst <enyst@users.noreply.github.com >
---------
Co-authored-by: Engel Nyst <enyst@users.noreply.github.com >
Co-authored-by: Shimada666 <649940882@qq.com >
Co-authored-by: Yufan Song <33971064+yufansong@users.noreply.github.com >
2024-06-01 11:34:36 +00:00
Leo
78e003caf6
Fix: Avoid bash backtick eval in runtime commands. ( #2180 )
...
Signed-off-by: ifuryst <ifuryst@gmail.com >
2024-06-01 19:19:15 +08:00
Leo
be251b11de
Add AgentBench. ( #2012 )
...
* Add AgentBench.
* Load the datasets from HF.
Signed-off-by: ifuryst <ifuryst@gmail.com >
* Add helper functions.
* Add mock executor.
Signed-off-by: ifuryst <ifuryst@gmail.com >
* Add retriv agent answer cmd.
* Adjust the dataset.
* Refine test results.
Signed-off-by: ifuryst <ifuryst@gmail.com >
* Consolidate all AgentBench datasets and scripts into a single CSV dataset.
* Refactor dataset source.
* Update helper functions.
Signed-off-by: ifuryst <ifuryst@gmail.com >
* Fix the CRLF problem.
Signed-off-by: ifuryst <ifuryst@gmail.com >
* Separate the instance's workspace.
Signed-off-by: ifuryst <ifuryst@gmail.com >
* Add cleanup logic and error handling for sandbox closure.
* Normalized dataset
Signed-off-by: ifuryst <ifuryst@gmail.com >
* Update README.
Signed-off-by: ifuryst <ifuryst@gmail.com >
* Update the prompt to capture the answer.
Signed-off-by: ifuryst <ifuryst@gmail.com >
* Refactor script execution paths to use absolute container workspace path.
Signed-off-by: ifuryst <ifuryst@gmail.com >
* Update AgentBench README.
Signed-off-by: ifuryst <ifuryst@gmail.com >
* Delete useless functions.
Signed-off-by: ifuryst <ifuryst@gmail.com >
* Update evaluation/agent_bench/README.md
* Add script to summarize test results from JSONL file in AgentBench
Signed-off-by: ifuryst <ifuryst@gmail.com >
* Delete useless script and codes.
Signed-off-by: ifuryst <ifuryst@gmail.com >
* Update evaluation/agent_bench/scripts/summarise_results.py
---------
Signed-off-by: ifuryst <ifuryst@gmail.com >
Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk >
2024-06-01 07:58:14 +00:00
மனோஜ்குமார் பழனிச்சாமி
04d7354501
Detailed logs for ssh_box ( #2173 )
2024-06-01 11:40:22 +05:30
Boxuan Li
06e45afc75
Fix ssh box hung issue ( #2172 )
...
Co-authored-by: மனோஜ்குமார் பழனிச்சாமி <smartmanoj42857@gmail.com >
2024-06-01 05:31:32 +00:00
மனோஜ்குமார் பழனிச்சாமி
3a4dc5c68c
Initialized plugins only once for persistent sandboxes ( #2162 )
2024-06-01 10:46:09 +05:30
Boxuan Li
feaae0b7ac
Fix persist_sandbox in Makefile ( #2171 )
2024-06-01 12:50:31 +08:00
Rahul Anand
6e76f9a02f
Fix: Codebase font fixed, and other fixes for #2138 PR ( #2154 )
...
* fix #2123
* Docs enhancement
* Update docs/src/components/CustomFooter.tsx
Co-authored-by: sp.wack <83104063+amanape@users.noreply.github.com >
* Update docs/src/components/CustomFooter.tsx
Co-authored-by: sp.wack <83104063+amanape@users.noreply.github.com >
* Update docs/src/pages/faq.tsx
Co-authored-by: sp.wack <83104063+amanape@users.noreply.github.com >
* update
* fix for #2138 pr
* Update docs/src/components/CustomFooter.tsx
Co-authored-by: Graham Neubig <neubig@gmail.com >
* Update docs/src/components/HomepageHeader/HomepageHeader.tsx
Co-authored-by: Graham Neubig <neubig@gmail.com >
* Update docs/src/components/Welcome/Welcome.tsx
Co-authored-by: Graham Neubig <neubig@gmail.com >
* Update docs/src/css/custom.css
Co-authored-by: Graham Neubig <neubig@gmail.com >
---------
Co-authored-by: sp.wack <83104063+amanape@users.noreply.github.com >
Co-authored-by: Graham Neubig <neubig@gmail.com >
2024-06-01 02:22:44 +00:00
மனோஜ்குமார் பழனிச்சாமி
bf24a0b5c0
Fixed makefile ( #2168 )
2024-06-01 03:35:43 +05:30
Aaron Xia
42c6b506b5
Lazy launching BrowseEnv / making BrowseEnv optional ( #2155 )
...
* feat: lazy launching browser; browser optional for diffrent agents.
* style: lint
* fix: integration test fail due to browser not started.
* fix: run by cli and integration test failed.
* fix: lint
* fix: lint
---------
Co-authored-by: Graham Neubig <neubig@gmail.com >
2024-05-31 16:40:42 -04:00
மனோஜ்குமார் பழனிச்சாமி
8413f147c9
Added logs ( #2153 )
...
* Logged about config file
* Logged Browser env
* Update opendevin/core/config.py
Co-authored-by: Aleksandar <isavitaisa@gmail.com >
* Update opendevin/core/config.py
Co-authored-by: Aleksandar <isavitaisa@gmail.com >
---------
Co-authored-by: Aleksandar <isavitaisa@gmail.com >
2024-05-31 16:04:36 -04:00
Ryan H. Tran
01296ff79d
Add remaining subsets for MINT benchmark ( #2142 )
...
* add MMLU subset
* add theoremqa subset
* remove redundant packages from requirements.txt, adjust prompts, handle gpt3.5 propose a wrong answer after a correct answer
* add MBPP subset
* add humaneval subset
* update README
* exit actively after the agent finishes the task
2024-05-31 20:04:13 +00:00
மனோஜ்குமார் பழனிச்சாமி
f3f5768b4f
Install chromium only once ( #2100 )
...
* install chromium only once
* Update Makefile
* Update Makefile
2024-05-31 15:39:10 -04:00
dependabot[bot]
9a441ea8f7
Bump boto3 from 1.34.115 to 1.34.116 ( #2164 )
...
Bumps [boto3](https://github.com/boto/boto3 ) from 1.34.115 to 1.34.116.
- [Release notes](https://github.com/boto/boto3/releases )
- [Changelog](https://github.com/boto/boto3/blob/develop/CHANGELOG.rst )
- [Commits](https://github.com/boto/boto3/compare/1.34.115...1.34.116 )
---
updated-dependencies:
- dependency-name: boto3
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-05-31 15:13:33 -04:00