OpenHands

mirror of https://github.com/All-Hands-AI/OpenHands.git synced 2026-01-09 14:57:59 -05:00

Author	SHA1	Message	Date
Frank Xu	a84d19f03c	Enable CodeAct agents with browsing, and also enable arbitrary BrowserGym action support (#1807 ) * enable browsing in codeact, and arbitrary browsergym DSL support * fix * fix unit test case * update frontend for the new interactive browsing action * bump ver * Fix integration tests --------- Co-authored-by: OpenDevinBot <bot@opendevin.com>	2024-05-15 11:59:58 -04:00
Boxuan Li	6714000b2c	CodeActAgent: Fix iteration reminder (#1803 ) This PR includes three changes: 1) Iteration reminder should start with MAX_ITERATIONS from config rather than default value 100 2) In the first prompt, we should tell the LLM it has `MAX_ITERATIONS - 1` turns left, rather than `MAX_ITERATIONS - 2` 3) Remove legacy ITERATION_REMINDER config	2024-05-15 13:48:47 +08:00
Graham Neubig	3cef8ee187	Add GitHub prompt to CodeAct (#1792 ) * Added github to CodeAct * More codeact * Simplify prompt * Modify codeact prompt * fix integration test for CodeAct * yet another integration test fix for codeact * fix plugin use in jupyter * update edit tests * fix jupyter plugin potential port conflict * fix test ipython with latest ipython fix * update integration test * wait a bit for jupyter execution * add one unit tests for sandbox * fix integration test --------- Co-authored-by: OpenDevinBot <bot@opendevin.com> Co-authored-by: Xingyao Wang <xingyao6@illinois.edu>	2024-05-14 21:25:21 +00:00
Robert Brennan	beb74a19f6	Use event stream for the runtime (#1776 ) * rebuild PR from scratch * fix max_iter * regenerate tests * cut down on history * Update opendevin/controller/agent_controller.py * regenerate tests * revert swe agent * revert some codeact chagnes * regenerate tests * add source to dict * only add source if not none * try to fix coverage issue * lock * add gevent	2024-05-14 13:35:25 +00:00
Robert Brennan	82a798990c	refactor remind_iterations (#1760 ) * refactor remind_iterations * regenerate tests * concatenate iteration message * fix merge issues * update integration tests	2024-05-14 08:27:12 -04:00
Robert Brennan	b028bd46bb	Use messages to drive tasks (#1688 ) * finish is working * start reworking main_goal * remove main_goal from microagents * remove main_goal from other agents * fix issues * revert codeact line * make plan a subclass of task * fix frontend for new plan setup * lint * fix type * more lint * fix build issues * fix codeact mgs * fix edge case in regen script * fix task validation errors * regenerate integration tests * fix up tests * fix sweagent * revert codeact prompt * update integration tests * update integration tests * handle loading state * Update agenthub/codeact_agent/codeact_agent.py Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> * Update opendevin/controller/agent_controller.py Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> * Update agenthub/codeact_agent/codeact_agent.py Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> * Update opendevin/controller/state/plan.py Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> * update docs * regenerate tests * remove none from state type * revert test files * update integration tests * rename plan to root_task * revert plugin perms * regen integration tests * tweak integration script * prettier * fix test * set workspace up for regeneration * regenerate tests * Change directory of copy * Updated tests * Disable PlannerAgent test * Fix listen * Updated prompts * Disable planner again * Make codecov more lenient * Update agenthub/README.md * Update opendevin/server/README.md * re-enable planner tests * finish top level tasks * regen planner * fix root task factory --------- Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> Co-authored-by: Xingyao Wang <xingyao6@illinois.edu> Co-authored-by: Graham Neubig <neubig@gmail.com> Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>	2024-05-13 23:14:15 +00:00
Boxuan Li	316a772849	CodeAct: Emphasize open before edit (#1709 ) Co-authored-by: Yufan Song <33971064+yufansong@users.noreply.github.com>	2024-05-11 12:20:14 -07:00
Boxuan Li	bde12f4a09	CodeActAgent: Fix hack for multiple edits in same command (#1684 ) * Fix edit hack for multiple edits in same command This PR changes ([\s\S]) to ([\s\S]?) to make the capturing group non-greedy. This change ensures that the regex captures the smallest set of characters that extends up to the first end_of_edit it encounters, rather than extending across multiple edit commands. Without the fix, a bash command consisting of multiple edits would be corrupt and lead to unexpected edit results.	2024-05-10 23:32:09 -07:00
Bart Shappee	78cd2e5b47	fix: Issue where CodeAct agent was trying to log cost on local llm and throwing Undefined Model execption out of litellm (#1666 ) * fix: Issue where CodeAct agent was trying to log cost on local llm and throwing Undefined Model execption out of litellm * Review Feedback * Missing None Check * Review feedback and improved error handling --------- Co-authored-by: Robert Brennan <accounts@rbren.io>	2024-05-10 13:57:37 -04:00
Ammar Ladhani	564739d1db	Removes upper-case 'List' imports from typing and replaces them with lower-case 'list' (#1676 )	2024-05-09 19:05:46 -04:00
Xingyao Wang	21fe8dc1eb	Align codeact with swebench eval (#1612 ) * align codeact agent with the slight adjustment on eval branch * update integration test for new prompt * Regenerate test artifacts for CodeActAgent --------- Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>	2024-05-09 00:42:07 -07:00
Ikko Eltociear Ashimine	4cc462cc18	Update codeact_agent.py (#1653 ) splited -> splitted	2024-05-08 13:55:40 -04:00
Robert Brennan	242c4a0df6	Remove extra message actions (#1608 ) * remove extra actions * remove message observations * support null obs * handle null obs * fix frontend for changes * fix the way messages flow to the UI * change think to message * add regen script * regenerate all integration tests * change task * remove gh test * fix messages * fix tests * help agent exit after hitting max iter * Update opendevin/events/observation/success.py Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> * Update agenthub/codeact_agent/codeact_agent.py Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> --------- Co-authored-by: Engel Nyst <enyst@users.noreply.github.com>	2024-05-07 21:13:08 +00:00
Xingyao Wang	356caf0960	Fix the issue of newly import package by including instruction for Kernel restart (#1609 ) * fix the issue of newly import package by including instruction for kernel restart * fix integration test for new prompt * fix integration yet again	2024-05-07 06:11:05 +08:00
Xingyao Wang	3886c51217	Update CodeAct README.md (#1534 ) * Update README.md * update documentation in docstring	2024-05-03 02:32:53 +00:00
Robert Brennan	fadcdc117e	Migrate to new folder structure in preparation for refactor (#1531 ) * fix up folder structure * update docs * fix imports * fix imports * fix imoprt * fix imports * fix imports * fix imports * fix test import * fix tests * fix main import	2024-05-02 17:01:54 +00:00
Robert Brennan	ce7c7eaae4	Refactor actions and observations (#1479 ) * refactor actions and events * remove type_key * remove stream * move import * move import * fix NullObs * reorder imports * fix lint * fix dataclasses * remove blank fields * fix nullobs * fix sidebar labels * fix test compilation * switch to asdict * lint * fix whitespace * fix executable * delint * fix run * remove NotImplementeds * fix path prefix * remove null files * add debug * add more debug info * fix dataclass on null * remove debug * revert sandbox * fix merge issues * fix tyeps * Update opendevin/events/action/browse.py	2024-05-02 15:44:54 +00:00
Xingyao Wang	435f47ca0e	Improve the both frontend and backend for CodeActAgent (#1494 ) * improve the both frontend and backend for CodeActAgent * fix linter * update integration test	2024-05-02 02:07:40 +08:00
Xingyao Wang	d80f025e21	Implement Jupyter Frontend (#1363 ) * initialize plugin definition * initialize plugin definition * simplify mixin * further improve plugin mixin * add cache dir for pip * support clean up cache * add script for setup jupyter and execution server * integrate JupyterRequirement to ssh_box * source bashrc at the end of plugin load * add execute_cli that accept code via stdin * make JUPYTER_EXEC_SERVER_PORT configurable via env var * increase background cmd sleep time * Update opendevin/sandbox/plugins/mixin.py Co-authored-by: Robert Brennan <accounts@rbren.io> * add mixin to base class * make jupyter requirement a dataclass * source plugins only when >0 requirements * add `sandbox_plugins` for each agent & have controller take care of it * update build.sh to make logs available in /opendevin/logs * switch to use config for lib and cache dir * Add SANDBOX_WORKSPACE_DIR into config * Add SANDBOX_WORKSPACE_DIR into config * fix occurence of /workspace * fix permission issue with /workspace * use python to implement execute_cli to avoid stdin escape issue * add IPythonRunCellAction and get it working * wait until jupyter is avaialble * support plugin via copying instead of mounting * add agent talk action * support follow-up user language feedback * add __str__ for action to be printed better * only print PLAN at the beginning * wip: update codeact agent * get rid the initial messate * update codeact agent to handle null action; add thought to bash * dispatch thought for RUN action as well * fix weird behavior of pxssh where the output would not flush correctly * make ssh box can handle exit_code properly as well * add initial version of swe-agent plugin; * rename swe cursors * split setup script into two and create two requirements * print SWE-agent command documentation * update swe-agent to default to no custom docs * add initial version of swe-agent plugin; * rename swe cursors * split setup script into two and create two requirements * print SWE-agent command documentation * update swe-agent to default to no custom docs * update dockerfile with dependency from swe-agent * make env setup a separate script for .bashrc source * add wip prompt * fix mount_dir for ssh_box * update prompt * fix mount_dir for ssh_box * default to use host network * default to use host network * move prompt to a separate file * fix swe-tool plugins; add missing _split_string * remove hostname from sshbox * update the prompt with edit functionality * fix swe-tool plugins; add missing _split_string * add awaiting into status bar * fix the bug of additional send event * remove some print action * move logic to config.py * remove debugging comments * make host network as default * make WORKSPACE_MOUNT_PATH as abspath * implement execute_cli via file cp * Revert "implement execute_cli via file cp" This reverts commit `06f0155bc1`. * add codeact dependencies to default container * add IPythonRunCellObservation * add back cache dir and default to /tmp * make USE_HOST_NETWORK a bool * revert use host network to false * add temporarily fix for IPython RUN action * preliminary implementation of CodeActAgent's jupyter * update node module * update prompt * revert USE_HOST_NETWORK to true since it is not affecting anything * attempt to fix lint * remove newline * update prompt * Refactor browser style. (#1358) * delete useless assets and css class. * add waiting for page loaded (networkidle with 3s timeout) * Add integration test framework with mock llm (#1301) * Add integration test framework with mock llm * Fix MonologueAgent and PlannerAgent tests * Remove adhoc logging * Use existing logs * Fix SWEAgent and PlannerAgent * Check-in test log files * conftest: look up under test name folder only * Add docstring to conftest * Finish dev doc * Avoid non-determinism * Remove dependency on llm embedding model * Init embedding model only for MonologueAgent * Add adhoc fix for sandbox discrepancy * Test ssh and exec sandboxes * CI: fix missing sandbox type * conftest: Remove hack * Reword comment for TODO * Revert "refactor(frontend): Terminal (#1315)" (#1360) This reverts commit `27246aca7e`. * revert USE_HOST_NETWORK to true since it is not affecting anything * attempt to fix lint * handle IsADirectory errors (#1365) * update to 0.4.0 (#1362) Co-authored-by: Jim Su <jimsu@protonmail.com> * feat(frontend): multiple design changes (#1370) * fix/improve terminal hook (#1371) * Revert "update node module" This reverts commit `459b1031e7`. * support SyntaxHighlighter and markdown for jupyter visualization * fix jupyter execution server * make jupyter active * improve the display of markdown and raw text * get base64 image display for react * add `thought` to most action class * fix unit tests for current action abstraction * support user exit * update test cases with the latest action format (added 'thought') * fix integration test for CodeActAGent by mocking stdin * only mock stdin for tests with user_responses.log * remove -exec integration test for CodeActAgent since it is not supported * remove specific stop word * fix comments * improve clarity of prompt * attempt to fix lint * attempt to fix lint yet agiain * fix py lint * fix integration tests * sandbox might failed in chown due to mounting, but it won't be fatal * update debug instruction for sshbox * fix typo * get RUN_AS_DEVIN and network=host working with app sandbox * get RUN_AS_DEVIN and network=host working with app sandbox * attempt to fix the workspace base permission * sandbox might failed in chown due to mounting, but it won't be fatal * update sshbox instruction * remove default user id since it will be passed in the instruction * revert permission fix since it should be resolved by correct SANDBOX_USER_ID * the permission issue can be fixed by simply provide correct env var * remove log * set sandbox user id to getuid by default * move logging to initializer * make the uid consistent across host, app container, and sandbox * remove hostname as it causes sudo issue * fix permission of entrypoint script * make the uvicron app run as host user uid for jupyter plugin * add warning message * fix frontend lint * update dev md for instruction of running unit tests * add back unit tests * revert back to the original sandbox implementation to fix testcases * revert use host network * get docker socket gid and usermod instead of chmod 777 * allow unit test workflow to find docker.sock * make sandbox test working via patch * fix arg parser that's broken for some reason * try to fix app build disk space issue * fix integration test * Revert "fix arg parser that's broken for some reason" This reverts commit `6cc8961133`. * update Development.md * cleanup intergration tests & add exception for CodeAct+execbox * fix config * implement user_message action * fix doc * fix event dict error * fix frontend lint * revert accidentally changes to integration tests * revert accidentally changes to integration tests --------- Co-authored-by: Robert Brennan <accounts@rbren.io> Co-authored-by: Leo <ifuryst@gmail.com> Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> Co-authored-by: Jim Su <jimsu@protonmail.com> Co-authored-by: Alex Bäuerle <alex@a13x.io> Co-authored-by: sp.wack <83104063+amanape@users.noreply.github.com> Co-authored-by: Robert Brennan <contact@rbren.io>	2024-05-01 22:37:01 +08:00
Xingyao Wang	1c7cdbefdd	feat(CodeActAgent): Support Agent-User Interaction during Task Execution and the Full Integration of CodeActAgent (#1290 ) * initialize plugin definition * initialize plugin definition * simplify mixin * further improve plugin mixin * add cache dir for pip * support clean up cache * add script for setup jupyter and execution server * integrate JupyterRequirement to ssh_box * source bashrc at the end of plugin load * add execute_cli that accept code via stdin * make JUPYTER_EXEC_SERVER_PORT configurable via env var * increase background cmd sleep time * Update opendevin/sandbox/plugins/mixin.py Co-authored-by: Robert Brennan <accounts@rbren.io> * add mixin to base class * make jupyter requirement a dataclass * source plugins only when >0 requirements * add `sandbox_plugins` for each agent & have controller take care of it * update build.sh to make logs available in /opendevin/logs * switch to use config for lib and cache dir * Add SANDBOX_WORKSPACE_DIR into config * Add SANDBOX_WORKSPACE_DIR into config * fix occurence of /workspace * fix permission issue with /workspace * use python to implement execute_cli to avoid stdin escape issue * add IPythonRunCellAction and get it working * wait until jupyter is avaialble * support plugin via copying instead of mounting * add agent talk action * support follow-up user language feedback * add __str__ for action to be printed better * only print PLAN at the beginning * wip: update codeact agent * get rid the initial messate * update codeact agent to handle null action; add thought to bash * dispatch thought for RUN action as well * fix weird behavior of pxssh where the output would not flush correctly * make ssh box can handle exit_code properly as well * add initial version of swe-agent plugin; * rename swe cursors * split setup script into two and create two requirements * print SWE-agent command documentation * update swe-agent to default to no custom docs * add initial version of swe-agent plugin; * rename swe cursors * split setup script into two and create two requirements * print SWE-agent command documentation * update swe-agent to default to no custom docs * update dockerfile with dependency from swe-agent * make env setup a separate script for .bashrc source * add wip prompt * fix mount_dir for ssh_box * update prompt * fix mount_dir for ssh_box * default to use host network * default to use host network * move prompt to a separate file * fix swe-tool plugins; add missing _split_string * remove hostname from sshbox * update the prompt with edit functionality * fix swe-tool plugins; add missing _split_string * add awaiting into status bar * fix the bug of additional send event * remove some print action * move logic to config.py * remove debugging comments * make host network as default * make WORKSPACE_MOUNT_PATH as abspath * implement execute_cli via file cp * Revert "implement execute_cli via file cp" This reverts commit `06f0155bc1`. * add codeact dependencies to default container * add IPythonRunCellObservation * add back cache dir and default to /tmp * make USE_HOST_NETWORK a bool * revert use host network to false * add temporarily fix for IPython RUN action * update prompt * revert USE_HOST_NETWORK to true since it is not affecting anything * attempt to fix lint * remove newline * fix jupyter execution server * add `thought` to most action class * fix unit tests for current action abstraction * support user exit * update test cases with the latest action format (added 'thought') * fix integration test for CodeActAGent by mocking stdin * only mock stdin for tests with user_responses.log * remove -exec integration test for CodeActAgent since it is not supported * remove specific stop word * fix comments * improve clarity of prompt * fix py lint * fix integration tests * sandbox might failed in chown due to mounting, but it won't be fatal * update debug instruction for sshbox * fix typo * get RUN_AS_DEVIN and network=host working with app sandbox * get RUN_AS_DEVIN and network=host working with app sandbox * attempt to fix the workspace base permission * sandbox might failed in chown due to mounting, but it won't be fatal * update sshbox instruction * remove default user id since it will be passed in the instruction * revert permission fix since it should be resolved by correct SANDBOX_USER_ID * the permission issue can be fixed by simply provide correct env var * remove log * set sandbox user id to getuid by default * move logging to initializer * make the uid consistent across host, app container, and sandbox * remove hostname as it causes sudo issue * fix permission of entrypoint script * make the uvicron app run as host user uid for jupyter plugin * add warning message * update dev md for instruction of running unit tests * add back unit tests * revert back to the original sandbox implementation to fix testcases * revert use host network * get docker socket gid and usermod instead of chmod 777 * allow unit test workflow to find docker.sock * make sandbox test working via patch * fix arg parser that's broken for some reason * try to fix app build disk space issue * fix integration test * Revert "fix arg parser that's broken for some reason" This reverts commit `6cc8961133`. * update Development.md * cleanup intergration tests & add exception for CodeAct+execbox * fix config * implement user_message action * fix doc * fix event dict error * fix frontend lint * revert accidentally changes to integration tests * revert accidentally changes to integration tests --------- Co-authored-by: Robert Brennan <accounts@rbren.io> Co-authored-by: Robert Brennan <contact@rbren.io>	2024-05-01 08:40:00 -04:00
Jirka Borovec	0c2ebfd6e1	Ruff: use I rule for isort (#1410 ) Ruff: use I rule for isort	2024-04-29 15:41:58 -07:00
Xingyao Wang	9b48f63544	Fix build error (#1339 ) * add initial version of swe-agent plugin; * rename swe cursors * split setup script into two and create two requirements * print SWE-agent command documentation * update swe-agent to default to no custom docs * update dockerfile with dependency from swe-agent * make env setup a separate script for .bashrc source * fix swe-tool plugins; add missing _split_string * remove import for temporarily fix (will add back in another pr)	2024-04-24 12:25:18 -04:00
Xingyao Wang	fc5e075ea0	feat(sandbox): Implementation of Sandbox Plugin to Support Jupyter (#1255 ) * initialize plugin definition * initialize plugin definition * simplify mixin * further improve plugin mixin * add cache dir for pip * support clean up cache * add script for setup jupyter and execution server * integrate JupyterRequirement to ssh_box * source bashrc at the end of plugin load * add execute_cli that accept code via stdin * make JUPYTER_EXEC_SERVER_PORT configurable via env var * increase background cmd sleep time * Update opendevin/sandbox/plugins/mixin.py Co-authored-by: Robert Brennan <accounts@rbren.io> * add mixin to base class * make jupyter requirement a dataclass * source plugins only when >0 requirements * add `sandbox_plugins` for each agent & have controller take care of it * update build.sh to make logs available in /opendevin/logs * switch to use config for lib and cache dir * fix permission issue with /workspace * use python to implement execute_cli to avoid stdin escape issue * wait until jupyter is avaialble * support plugin via copying instead of mounting --------- Co-authored-by: Robert Brennan <accounts@rbren.io>	2024-04-23 08:45:53 +08:00
Boxuan Li	dd32fa6f4a	Unify linter behaviour across CI and pre-commit-hook (#1071 ) * CI: Add autopep8 linter Currently, we have autopep8 as part of pre-commit-hook. To ensure consistent behaviour, we should have it in CI as well. Moreover, pre-commit-hook contains a double-quote-string-fixer hook which changes all double quotes to single quotes, but I do observe some PRs with massive changes that do the opposite way. I suspect that these authors 1) disable or circumvent the pre-commit-hook, and 2) have other linters such as black in their IDE, which automatically change all single quotes to double quotes. This has caused a lot of unnecessary diff, made review really hard, and led to a lot of conflicts. * Use -diff for autopep8 * autopep8: Freeze version in CI * Ultimate fix * Remove pep8 long line disable workaround * Fix lint.yml * Fix all files under opendevin and agenthub	2024-04-14 00:19:56 -04:00
Boxuan Li	e0c7492609	Traffic Control: Add new config MAX_CHARS (#1015 ) * Add new config MAX_CHARS * Fix mypy linting issues	2024-04-12 19:01:52 +00:00
hugehope	9cd4ad3298	chore: fix some typos in comments (#1013 ) Signed-off-by: hugehope <cmm7@sina.cn>	2024-04-11 23:21:46 +02:00
Xingyao Wang	229fa988c5	remove seed=42 to fix #813 (#830 )	2024-04-06 13:04:17 -04:00
Jack Quimby	d6128941b7	Doc: Document difference between agents (#722 ) * doc: Guide for using local LLM with Ollama * forgot to delete print statement * typos * Updated guide - new working method * Move to docs folder * Fixed front end overwrite local model name * Update llm.py * Delete docs/examples/images/example.png deleted example.png * Documentation of agent differences * rename examples to documentation * Docstrings for all agents * typo fix * typo fixes * Typo fixes * more typo fixes * typo fix * typo fixes * typos fixed * Typo fixes * top 10 list * typo fix * typo fix * typos to the moon * typos fixed * typo fix * typo fix * anotha one * The rest of the typos * Corrected agent descriptions * Agents markdown updated --------- Co-authored-by: Robert Brennan <accounts@rbren.io>	2024-04-05 12:51:25 -05:00
Alex Bäuerle	a82e065f56	feat: add commands for swebench (#682 ) * feat: add commands for swebench * restructure	2024-04-05 12:47:32 -05:00
Xingyao Wang	fe736013cc	Print with color in controller (#176 ) * and color-coded printing action abd obs * fix langchains agent's observation * remove color printing from codeact agent * fix mypy * fix ruff * resolve conflict * fix ruff	2024-04-02 13:13:19 +08:00
Robert Brennan	a6f0c066b5	Implement Planning (#267 ) * add outline of agent * add plan class * add initial prompt * plumb plan through a bit * refactor state management * move task into state * fix errors * add prompt parsing * add task actions * better serialization * more serialization hacks * fix fn * fix recursion error * refine prompt * better description of run * update prompt * tighter planning mechanism * prompt tweaks * fix merge * fix lint issues * add error handling for tasks * add graphic for plans * remove base_path from file actions * rename subtask to task * better planning * prompt updates for verification * remove verify field * ruff * mypy * fix actions	2024-03-29 11:47:29 -04:00
Robert Brennan	94120f2b5d	refactor state management (#258 ) * refactor state management * rm import * move task into state * revert change * revert a few files	2024-03-28 15:23:47 -04:00
Anas DORBANI	82c215ed5d	Extract logic from init from langchains_agent and codeact_agent (#167 )	2024-03-28 08:51:21 -04:00

33 Commits