mirror of https://github.com/All-Hands-AI/OpenHands.git synced 2026-01-08 06:23:59 -05:00

Go to file

Xingyao Wang b2fdb963b6 Add detailed tutorial for adding new evaluation benchmarks (#1827 )

* Add detailed tutorial for adding new evaluation benchmarks

* update tutorial, fix typo, and log observation to the cmdline

* fix url

* Update evaluation/TUTORIAL.md

* Update evaluation/TUTORIAL.md

* Update evaluation/TUTORIAL.md

* Update evaluation/TUTORIAL.md

Co-authored-by: Graham Neubig <neubig@gmail.com>

* Update evaluation/TUTORIAL.md

Co-authored-by: Graham Neubig <neubig@gmail.com>

* Update evaluation/TUTORIAL.md

Co-authored-by: Graham Neubig <neubig@gmail.com>

* Update evaluation/TUTORIAL.md

Co-authored-by: Graham Neubig <neubig@gmail.com>

* Update evaluation/TUTORIAL.md

Co-authored-by: Graham Neubig <neubig@gmail.com>

* Update evaluation/TUTORIAL.md

Co-authored-by: Graham Neubig <neubig@gmail.com>

* Update evaluation/TUTORIAL.md

Co-authored-by: Graham Neubig <neubig@gmail.com>

* Update evaluation/TUTORIAL.md

Co-authored-by: Graham Neubig <neubig@gmail.com>

* Update evaluation/TUTORIAL.md

Co-authored-by: Graham Neubig <neubig@gmail.com>

* Update evaluation/TUTORIAL.md

Co-authored-by: Graham Neubig <neubig@gmail.com>

* Update evaluation/TUTORIAL.md

Co-authored-by: Graham Neubig <neubig@gmail.com>

* Update evaluation/TUTORIAL.md

Co-authored-by: Graham Neubig <neubig@gmail.com>

* Update evaluation/TUTORIAL.md

Co-authored-by: Graham Neubig <neubig@gmail.com>

* Update evaluation/TUTORIAL.md

Co-authored-by: Graham Neubig <neubig@gmail.com>

* simplify readme and add comments to the actual code

* Fix typo in evaluation/TUTORIAL.md

* Fix typo in evaluation/swe_bench/run_infer.py

* Fix another typo in evaluation/swe_bench/run_infer.py

* Update TUTORIAL.md

* Set host net work to false for SWEBench

* Update evaluation/TUTORIAL.md

Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>

* Update evaluation/TUTORIAL.md

Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>

* Update evaluation/TUTORIAL.md

Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>

* Update evaluation/TUTORIAL.md

Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>

---------

Co-authored-by: OpenDevin <opendevin@opendevin.ai>
Co-authored-by: Engel Nyst <enyst@users.noreply.github.com>
Co-authored-by: Graham Neubig <neubig@gmail.com>
Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>

2024-05-18 13:40:53 -04:00

.github

Update dependabot.yml (#1876 )

2024-05-18 16:17:39 +00:00

agenthub

make CodeAct paper link correct (#1870 )

2024-05-18 03:54:10 +00:00

containers

Update Dockerfile to assign workspace directory properly to user (#1830 )

2024-05-17 20:11:23 -04:00

dev_config/python

ci(docs): only generate autogen python docs on deploy (#1501 )

2024-05-02 18:29:41 +00:00

docs

Fix CodeAct paper link (#1784 )

2024-05-14 17:40:07 +00:00

evaluation

Add detailed tutorial for adding new evaluation benchmarks (#1827 )

2024-05-18 13:40:53 -04:00

frontend

Bump monaco-editor from 0.47.0 to 0.48.0 in /frontend (#1879 )

2024-05-18 13:38:26 -04:00

opendevin

Add detailed tutorial for adding new evaluation benchmarks (#1827 )

2024-05-18 13:40:53 -04:00

tests

Auto restarted Jupyter kernel (#1808 )

2024-05-18 08:40:31 +05:30

.dockerignore

Revamp docker build process (#1121 )

2024-04-15 19:10:38 -04:00

.gitattributes

lint: simplify hooks already covered by Ruff (#1204 )

2024-04-27 11:32:14 +00:00

.gitignore

feat(SWE-Bench environment) integrate SWE-Bench sandbox (#1468 )

2024-05-15 16:15:55 +00:00

CODE_OF_CONDUCT.md

Rename CodeOfConduct.md to CODE_OF_CONDUCT.md (#1665 )

2024-05-09 10:38:24 +00:00

CONTRIBUTING.md

Add integration test framework with mock llm (#1301 )

2024-04-25 10:56:53 -04:00

Development.md

Revert ssh box implemetation, fix multi-line command issues and add unit tests (#1460 )

2024-04-30 12:46:35 -04:00

LICENSE

Create MIT LICENSE (#8 )

2024-03-16 22:46:04 +08:00

Makefile

feat(SWE-Bench environment) integrate SWE-Bench sandbox (#1468 )

2024-05-15 16:15:55 +00:00

poetry.lock

Bump boto3 from 1.34.106 to 1.34.108 (#1887 )

2024-05-18 13:37:53 -04:00

pydoc-markdown.yml

docs(docs): start implementing docs website (#1372 )

2024-04-29 10:00:51 -07:00

pyproject.toml

Bump e2b from 0.14.14 to 0.17.0 (#1883 )

2024-05-18 13:13:45 -04:00

pytest.ini

Refactor integration test framework and relieve the pain of regeneration (#1818 )

2024-05-16 08:30:29 -07:00

README.md

use -it and pull=always for docker (#1769 )

2024-05-13 19:17:57 -04:00

README.md

OpenDevin: Code Less, Make More

Welcome to OpenDevin, a platform for autonomous software engineers, powered by AI and LLMs.

OpenDevin agents collaborate with human developers to write code, fix bugs, and ship features.

⚡ Quick Start

You can run OpenDevin with Docker. It works best with the most recent version of Docker, 26.0.0.

#The directory you want OpenDevin to modify. MUST be an absolute path!
export WORKSPACE_BASE=$(pwd)/workspace;

docker run \
    -it \
    --pull=always \
    -e SANDBOX_USER_ID=$(id -u) \
    -e WORKSPACE_MOUNT_PATH=$WORKSPACE_BASE \
    -v $WORKSPACE_BASE:/opt/workspace_base \
    -v /var/run/docker.sock:/var/run/docker.sock \
    -p 3000:3000 \
    --add-host host.docker.internal:host-gateway \
    ghcr.io/opendevin/opendevin:0.5

🚀 Documentation

To learn more about the project, and for tips on using OpenDevin, check out our documentation.

There you'll find resources on how to use different LLM providers (like ollama and Anthropic's Claude), troubleshooting resources, and advanced configuration options.

🤝 How to Contribute

OpenDevin is a community-driven project, and we welcome contributions from everyone. Whether you're a developer, a researcher, or simply enthusiastic about advancing the field of software engineering with AI, there are many ways to get involved:

Code Contributions: Help us develop new agents, core functionality, the frontend and other interfaces, or sandboxing solutions.
Research and Evaluation: Contribute to our understanding of LLMs in software engineering, participate in evaluating the models, or suggest improvements.
Feedback and Testing: Use the OpenDevin toolset, report bugs, suggest features, or provide feedback on usability.

For details, please check CONTRIBUTING.md.

🤖 Join Our Community

Whether you're a developer, a researcher, or simply enthusiastic about OpenDevin, we'd love to have you in our community. Let's make software engineering better together!

Slack workspace - Here we talk about research, architecture, and future development.
Discord server - This is a community-run server for general discussion, questions, and feedback.

📈 Progress

📜 License

Distributed under the MIT License. See LICENSE for more information.

Languages

Python 77.7%

TypeScript 19.7%

Shell 1.2%

Jinja 0.8%

JavaScript 0.3%

Other 0.2%