OpenHands

mirror of https://github.com/All-Hands-AI/OpenHands.git synced 2026-04-29 03:00:45 -04:00

Author	SHA1	Message	Date
மனோஜ்குமார் பழனிச்சாமி	dbf0e5b068	refactor log	2024-06-20 17:19:47 +05:30
மனோஜ்குமார் பழனிச்சாமி	2d19f33e54	log all action _step	2024-06-20 08:05:24 +05:30
மனோஜ்குமார் பழனிச்சாமி	4697c0fb4c	log user actions	2024-06-20 03:33:53 +05:30
மனோஜ்குமார் பழனிச்சாமி	5e67e74652	Skip duplicate log	2024-06-20 02:03:50 +05:30
மனோஜ்குமார் பழனிச்சாமி	d0bdae232f	Show relevant error in UI (#2516 )	2024-06-19 15:58:48 +05:30
Engel Nyst	b2307db010	Document, rename Agent* exceptions to LLM* (#2508 ) * rename "Agent" exceptions to LLM, document LLMResponseError	2024-06-18 22:30:22 +00:00
Engel Nyst	bb4ea1e6cb	Adjust is-stuck check for the same steps to 3 until it's stopped (#2437 )	2024-06-14 19:20:12 +05:30
Boxuan Li	a9a2f10170	Revamp AgentRejectAction and allow ManagerAgent to handle rejection (#1735 ) * Fix AgentRejectAction handling * Add ManagerAgent to integration tests * Fix regenerate.sh * Fix merge * Update README for micro-agents * Add test reject to regenerate.sh * regenerate.sh: Add support for running a specific test and/or agent * Refine reject schema, and allow ManagerAgent to handle reject * Add test artifacts for test_simple_task_rejection * Fix manager agent tests * Fix README * test_simple_task_rejection: check final agent state * Integration test: exit if mock prompt not found * Update test_simple_task_rejection tests * Fix test_edits test artifacts after prompt update * Fix ManagerAgent test_edits * WIP * Fix tests * update test_edits for ManagerAgent * Skip local sandbox for reject test * Fix test comparison	2024-06-08 23:12:30 -07:00
Aaron Xia	b1ec8e5dc2	style: Update agent_controller.py to clean log (#2124 )	2024-05-29 18:56:11 -07:00
Boxuan Li	9b371b1b5f	Refactor agent delegation and tweak micro agents (#1910 ) This PR fixes #1897. In addition, this PR fixes and tweaks a few micro-agents. For the first time, I am able to use ManagerAgent to complete test_write_simple_script and test_edits tasks in integration tests, so this PR also adds ManagerAgent as part of integration tests. test_write_simple_script involves delegation to CoderAgent while test_edits involves delegation to TypoFixerAgent. Also for the first time, I am able to use DelegateAgent to complete test_write_simple_script and test_edits tasks in integration tests, so this PR also adds DelegateAgent as part of integration tests. It involves delegation to StudyRepoForTaskAgent, CoderAgent and VerifierAgent. This PR is a blocker for #1735 and likely #1945.	2024-05-28 20:01:16 -07:00
Aleksandar	18d07bda89	feat: add max_budget_per_task configuration to control task cost (#2070 ) * feat: add max_budget_per_task configuration to control task cost * Fix test_arg_parser.py * Use the config.max_budget_per_task as default value * Add max_budget_per_task to core/main.py as well * Update opendevin/controller/agent_controller.py Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> --------- Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>	2024-05-27 02:04:31 +08:00
Engel Nyst	783fea62a0	Ignore pid for loop detection (Was: override eq...) (#2045 ) * rewrite, implement pid ignore in the controller * make the helper method private	2024-05-26 19:27:12 +02:00
Xingyao Wang	e731048ccf	Improve action and observation logging for the CLI interface (#2035 ) * properly log user messages; format browser action/obs, summarize action, messages properly for logging * add source to message * add spaces for printing	2024-05-24 08:21:25 -04:00
Robert Brennan	ea9c785075	fix session state after resuming (#1999 ) * fix state resuming * fix session reconnection * fix lint	2024-05-23 11:47:36 -04:00
Engel Nyst	0eccf31604	Refactor monologue and SWE agent to use the messages in state history (#1863 ) * Refactor monologue to use the messages in state history * add messages, clean up * fix monologue * update integration tests * move private method * update SWE agent to use the history from State * integration tests for SWE agent * rename monologue to initial_thoughts, since that is what it is	2024-05-23 07:29:12 +00:00
Robert Brennan	5bdacf738d	Refactor session management (#1810 ) * refactor session mgmt * defer file handling to runtime * add todo * refactor sessions a bit more * remove messages logic from FE * fix up socket handshake * refactor frontend auth a bit * first pass at redoing file explorer * implement directory suffix * fix up file tree * close agent on websocket close * remove session saving * move file refresh * remove getWorkspace * plumb path/code differently * fix build issues * fix the tests * fix npm build * add session rehydration * fix event serialization * logspam * fix user message rehydration * add get_event fn * agent state restoration * change history tracking for codeact * fix responsiveness of init * fix lint * lint * delint * fix prop * update tests * logspam * lint * fix test * revert codeact * change fileService to use API * fix up session loading * delint * delint * fix integration tests * revert test * fix up access to options endpoints * fix initial files load * delint * fix file initialization * fix mock server * fixl int * fix auth for html * Update frontend/src/i18n/translation.json Co-authored-by: Xingyao Wang <xingyao6@illinois.edu> * refactor sessions and sockets * avoid reinitializing the same session * fix reconnect issue * change up intro message * more guards on reinit * rename agent_session * delint * fix a bunch of tests * delint * fix last test * remove code editor context * fix build * fix any * fix dot notation * Update frontend/src/services/api.ts Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * fix up error handling * Update opendevin/server/session/agent.py Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * Update opendevin/server/session/agent.py Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * Update frontend/src/services/session.ts Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * fix build errs * fix else * add closed state * delint * Update opendevin/server/session/session.py Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> --------- Co-authored-by: Xingyao Wang <xingyao6@illinois.edu> Co-authored-by: Graham Neubig <neubig@gmail.com> Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> Co-authored-by: Engel Nyst <enyst@users.noreply.github.com>	2024-05-22 18:33:16 +00:00
Yufan Song	d18e6c85a0	feat: add metrics related to cost for better observability (#1944 ) * add metrics for total_cost * make lint * refact codeact * change metrics into llm * add costs list, add into state * refactor log completion * refactor and test others * make lint * Update opendevin/core/metrics.py Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * Update opendevin/llm/llm.py Co-authored-by: Xingyao Wang <xingyao6@illinois.edu> * refactor * add code --------- Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> Co-authored-by: Xingyao Wang <xingyao6@illinois.edu>	2024-05-22 08:53:31 +00:00
Frank Xu	1fe290adf9	[Feat] A competitive Web Browsing agent (#1856 ) * initial attempt at a browsing only agent * add browsing agent * update * implement agent * update * fix comments * remove unnecessary things from memory extras * update image processing --------- Co-authored-by: Yufan Song <33971064+yufansong@users.noreply.github.com>	2024-05-21 19:20:33 +00:00
Engel Nyst	1e51bb9276	Fix/update controller is_stuck() (#1891 ) * Refactor monologue to use the messages in state history remove now unused method * is_stuck update * fix is_stuck * unit tests * fix tests * Revert "Refactor monologue to use the messages in state history" This reverts commit `76b4b765ef`. * Override eq for CmdOutputObservation to ignore the pid, compare the actual command only * Revert "Override eq for CmdOutputObservation to ignore the pid, compare the actual command only" This reverts commit `6418d856b5`.	2024-05-21 22:56:59 +08:00
மனோஜ்குமார் பழனிச்சாமி	4612e107c9	fix: Handle invalid exit code conversion (#1915 )	2024-05-20 11:30:52 -04:00
Xingyao Wang	7817d4c94f	fix(controller): Improve error info logging (#1864 ) * improve error info logging * Move assignment of self.state.error to report_error function * only log exception to state, but not to user --------- Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>	2024-05-20 18:42:38 +08:00
Xingyao Wang	b2fdb963b6	Add detailed tutorial for adding new evaluation benchmarks (#1827 ) * Add detailed tutorial for adding new evaluation benchmarks * update tutorial, fix typo, and log observation to the cmdline * fix url * Update evaluation/TUTORIAL.md * Update evaluation/TUTORIAL.md * Update evaluation/TUTORIAL.md * Update evaluation/TUTORIAL.md Co-authored-by: Graham Neubig <neubig@gmail.com> * Update evaluation/TUTORIAL.md Co-authored-by: Graham Neubig <neubig@gmail.com> * Update evaluation/TUTORIAL.md Co-authored-by: Graham Neubig <neubig@gmail.com> * Update evaluation/TUTORIAL.md Co-authored-by: Graham Neubig <neubig@gmail.com> * Update evaluation/TUTORIAL.md Co-authored-by: Graham Neubig <neubig@gmail.com> * Update evaluation/TUTORIAL.md Co-authored-by: Graham Neubig <neubig@gmail.com> * Update evaluation/TUTORIAL.md Co-authored-by: Graham Neubig <neubig@gmail.com> * Update evaluation/TUTORIAL.md Co-authored-by: Graham Neubig <neubig@gmail.com> * Update evaluation/TUTORIAL.md Co-authored-by: Graham Neubig <neubig@gmail.com> * Update evaluation/TUTORIAL.md Co-authored-by: Graham Neubig <neubig@gmail.com> * Update evaluation/TUTORIAL.md Co-authored-by: Graham Neubig <neubig@gmail.com> * Update evaluation/TUTORIAL.md Co-authored-by: Graham Neubig <neubig@gmail.com> * Update evaluation/TUTORIAL.md Co-authored-by: Graham Neubig <neubig@gmail.com> * Update evaluation/TUTORIAL.md Co-authored-by: Graham Neubig <neubig@gmail.com> * simplify readme and add comments to the actual code * Fix typo in evaluation/TUTORIAL.md * Fix typo in evaluation/swe_bench/run_infer.py * Fix another typo in evaluation/swe_bench/run_infer.py * Update TUTORIAL.md * Set host net work to false for SWEBench * Update evaluation/TUTORIAL.md Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * Update evaluation/TUTORIAL.md Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * Update evaluation/TUTORIAL.md Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * Update evaluation/TUTORIAL.md Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> --------- Co-authored-by: OpenDevin <opendevin@opendevin.ai> Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> Co-authored-by: Graham Neubig <neubig@gmail.com> Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>	2024-05-18 13:40:53 -04:00
Boxuan Li	6714000b2c	CodeActAgent: Fix iteration reminder (#1803 ) This PR includes three changes: 1) Iteration reminder should start with MAX_ITERATIONS from config rather than default value 100 2) In the first prompt, we should tell the LLM it has `MAX_ITERATIONS - 1` turns left, rather than `MAX_ITERATIONS - 2` 3) Remove legacy ITERATION_REMINDER config	2024-05-15 13:48:47 +08:00
Xingyao Wang	d1fd277ad4	Support return final task states for evaluation (#1755 ) * support returning states at the end of controller * remove return None * fix issue of overriding final state * return the final state on close * merge AgentState with State * fix integration test * add ChangeAgentStateAction to history in attempt to fix integration * add back set agent state * update tests * update tests * directly return get state * add back the missing .close() * Update typo in opendevin/core/main.py --------- Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>	2024-05-15 03:43:01 +00:00
Robert Brennan	beb74a19f6	Use event stream for the runtime (#1776 ) * rebuild PR from scratch * fix max_iter * regenerate tests * cut down on history * Update opendevin/controller/agent_controller.py * regenerate tests * revert swe agent * revert some codeact chagnes * regenerate tests * add source to dict * only add source if not none * try to fix coverage issue * lock * add gevent	2024-05-14 13:35:25 +00:00
Robert Brennan	82a798990c	refactor remind_iterations (#1760 ) * refactor remind_iterations * regenerate tests * concatenate iteration message * fix merge issues * update integration tests	2024-05-14 08:27:12 -04:00
Robert Brennan	b028bd46bb	Use messages to drive tasks (#1688 ) * finish is working * start reworking main_goal * remove main_goal from microagents * remove main_goal from other agents * fix issues * revert codeact line * make plan a subclass of task * fix frontend for new plan setup * lint * fix type * more lint * fix build issues * fix codeact mgs * fix edge case in regen script * fix task validation errors * regenerate integration tests * fix up tests * fix sweagent * revert codeact prompt * update integration tests * update integration tests * handle loading state * Update agenthub/codeact_agent/codeact_agent.py Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> * Update opendevin/controller/agent_controller.py Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> * Update agenthub/codeact_agent/codeact_agent.py Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> * Update opendevin/controller/state/plan.py Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> * update docs * regenerate tests * remove none from state type * revert test files * update integration tests * rename plan to root_task * revert plugin perms * regen integration tests * tweak integration script * prettier * fix test * set workspace up for regeneration * regenerate tests * Change directory of copy * Updated tests * Disable PlannerAgent test * Fix listen * Updated prompts * Disable planner again * Make codecov more lenient * Update agenthub/README.md * Update opendevin/server/README.md * re-enable planner tests * finish top level tasks * regen planner * fix root task factory --------- Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> Co-authored-by: Xingyao Wang <xingyao6@illinois.edu> Co-authored-by: Graham Neubig <neubig@gmail.com> Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>	2024-05-13 23:14:15 +00:00
Xingyao Wang	8bfae8413e	Support passing sandbox as argument and iteration reminder (#1730 ) * support custom sandbox; add iteration_reminder * Enable iteration reminder in CodeActAgent integration test * Don't remove numbers when comparing prompts * Update tests/integration/README.md --------- Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>	2024-05-12 07:57:33 +00:00
Xia Zhenhua	10b971c612	feat: new delegate stuck check. (#1677 ) Co-authored-by: aaren.xzh <aaren.xzh@antfin.com>	2024-05-09 21:06:20 -04:00
Robert Brennan	26d82841d5	Create runtime implementation (#1626 ) * first pass at moving runtime * fix import * remove github refs * remove unnecessary import * remove unnecessary import * add e2b * refactor read and write file ops * remove github test * rm action * revert permissions * regenerate tests * re-delete file operations * regenerate integration tests * Update opendevin/runtime/runtime.py Co-authored-by: Graham Neubig <neubig@gmail.com> * fix ref * add docs * remove logspam --------- Co-authored-by: Graham Neubig <neubig@gmail.com>	2024-05-09 19:04:49 -04:00
Engel Nyst	446eaec1e6	Refactor config to dataclasses (#1552 ) * mypy is invaluable * fix config, add test * Add new-style toml support * add singleton, small doc fixes * fix some cases of loading toml, clean up, try to make it clearer * Add defaults_dict for UI * allow config to be mutable error handling fix toml parsing * remove debug stuff * Adapt Makefile * Add defaults for temperature and top_p * update to CodeActAgent * comments * fix unit tests * implement groups of llm settings (CLI) * fix merge issue * small fix sandboxes, small refactoring * adapt LLM init to accept overrides at runtime * reading config is enough * Encapsulate minimally embeddings initialization * agent bug fix; fix tests * fix sandboxes tests * refactor globals in sandboxes to properties	2024-05-09 22:48:29 +02:00
Frank Xu	ae7f208d51	Fix browser env leak after resetting agent (#1589 ) * add more teardown * add browser process teardown logic in agent controller * remove testing code --------- Co-authored-by: Robert Brennan <accounts@rbren.io>	2024-05-09 13:17:16 -04:00
Boxuan Li	af5bdf67aa	Add AgentRejectAction across multiple modules (#1615 ) * Add AgentRejectAction across multiple modules This commit introduces the AgentRejectAction class and integrates it across various modules and actions. It includes updates to READMEs, action definitions, and agent controllers to handle the new 'reject' action. This functionality will allow agents to properly signal task rejection. * Fix unit test * Remove wrong generates attributes from a few micro-agents	2024-05-08 10:03:14 -07:00
Robert Brennan	242c4a0df6	Remove extra message actions (#1608 ) * remove extra actions * remove message observations * support null obs * handle null obs * fix frontend for changes * fix the way messages flow to the UI * change think to message * add regen script * regenerate all integration tests * change task * remove gh test * fix messages * fix tests * help agent exit after hitting max iter * Update opendevin/events/observation/success.py Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> * Update agenthub/codeact_agent/codeact_agent.py Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> --------- Co-authored-by: Engel Nyst <enyst@users.noreply.github.com>	2024-05-07 21:13:08 +00:00
Robert Brennan	d0967122f8	Minor changes to agent state management (#1597 ) * move towards event stream * refactor agent state changes * move agent state logic * fix callbacks * break on finish * closer to working * change frontend to accomodate new flow * handle start action * fix locked stream * revert message * logspam * no async on close * get rid of agent_task * fix up closing * better asyncio handling * sleep to give back control * fix key * logspam * update frontend agent state actions * fix pause and cancel * delint * fix map * delint * wait for agent to finish * fix unit test * event stream enums * fix merge issues * fix lint * fix test * fix test * add user message action * add user message action * fix up user messages * fix main.py flow * refactor message waiting * lint * fix test * fix test * simplify if/else * fix state reset * logspam * add error status * minor changes to control bar * handle user messages when not awaiting * restart agent after stopping * Update opendevin/controller/agent_controller.py Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> * delint * refactor initialize * delint * fix dispatch --------- Co-authored-by: Engel Nyst <enyst@users.noreply.github.com>	2024-05-06 12:29:48 +00:00
Robert Brennan	f7e0c6cd06	Separate agent controller and server via EventStream (#1538 ) * move towards event stream * refactor agent state changes * move agent state logic * fix callbacks * break on finish * closer to working * change frontend to accomodate new flow * handle start action * fix locked stream * revert message * logspam * no async on close * get rid of agent_task * fix up closing * better asyncio handling * sleep to give back control * fix key * logspam * update frontend agent state actions * fix pause and cancel * delint * fix map * delint * wait for agent to finish * fix unit test * event stream enums * fix merge issues * fix lint * fix test * fix test * add user message action * add user message action * fix up user messages * fix main.py flow * refactor message waiting * lint * fix test * fix test	2024-05-05 19:20:01 +00:00
Robert Brennan	fadcdc117e	Migrate to new folder structure in preparation for refactor (#1531 ) * fix up folder structure * update docs * fix imports * fix imports * fix imoprt * fix imports * fix imports * fix imports * fix test import * fix tests * fix main import	2024-05-02 17:01:54 +00:00
Robert Brennan	ce7c7eaae4	Refactor actions and observations (#1479 ) * refactor actions and events * remove type_key * remove stream * move import * move import * fix NullObs * reorder imports * fix lint * fix dataclasses * remove blank fields * fix nullobs * fix sidebar labels * fix test compilation * switch to asdict * lint * fix whitespace * fix executable * delint * fix run * remove NotImplementeds * fix path prefix * remove null files * add debug * add more debug info * fix dataclass on null * remove debug * revert sandbox * fix merge issues * fix tyeps * Update opendevin/events/action/browse.py	2024-05-02 15:44:54 +00:00
Leo	95e4ca490f	Feat: add lint frontend and lint all to Makefile. (#1354 ) * Feat: add lint frontend and lint all to Makefile. * style codes. * Remove redundant target. --------- Co-authored-by: Jim Su <jimsu@protonmail.com> Co-authored-by: Robert Brennan <accounts@rbren.io>	2024-05-02 11:53:57 +00:00
Frank Xu	836864fa88	[feat] Integrate BrowserGym (#1452 ) * add a single-threaded server serving browsergym * update poetry * update browser page content * add import to make sure browsergym environments are registered properly * remove flask server, use multiprocess impl and Pipe * fix * refactor BrowserEnv * update browser action and obs to include more complete info * fix screenshot * update poetry lock * add playwright install to workflow * update * add better html to text conversion * update for better text conversion to maintain parity with the current handling of browseurlaction * update * update poetry * update multiprocessing mp * fix multiprocessing * update * update github workflow --------- Co-authored-by: Xingyao Wang <xingyao6@illinois.edu>	2024-05-02 19:52:53 +08:00
Xingyao Wang	1c7cdbefdd	feat(CodeActAgent): Support Agent-User Interaction during Task Execution and the Full Integration of CodeActAgent (#1290 ) * initialize plugin definition * initialize plugin definition * simplify mixin * further improve plugin mixin * add cache dir for pip * support clean up cache * add script for setup jupyter and execution server * integrate JupyterRequirement to ssh_box * source bashrc at the end of plugin load * add execute_cli that accept code via stdin * make JUPYTER_EXEC_SERVER_PORT configurable via env var * increase background cmd sleep time * Update opendevin/sandbox/plugins/mixin.py Co-authored-by: Robert Brennan <accounts@rbren.io> * add mixin to base class * make jupyter requirement a dataclass * source plugins only when >0 requirements * add `sandbox_plugins` for each agent & have controller take care of it * update build.sh to make logs available in /opendevin/logs * switch to use config for lib and cache dir * Add SANDBOX_WORKSPACE_DIR into config * Add SANDBOX_WORKSPACE_DIR into config * fix occurence of /workspace * fix permission issue with /workspace * use python to implement execute_cli to avoid stdin escape issue * add IPythonRunCellAction and get it working * wait until jupyter is avaialble * support plugin via copying instead of mounting * add agent talk action * support follow-up user language feedback * add __str__ for action to be printed better * only print PLAN at the beginning * wip: update codeact agent * get rid the initial messate * update codeact agent to handle null action; add thought to bash * dispatch thought for RUN action as well * fix weird behavior of pxssh where the output would not flush correctly * make ssh box can handle exit_code properly as well * add initial version of swe-agent plugin; * rename swe cursors * split setup script into two and create two requirements * print SWE-agent command documentation * update swe-agent to default to no custom docs * add initial version of swe-agent plugin; * rename swe cursors * split setup script into two and create two requirements * print SWE-agent command documentation * update swe-agent to default to no custom docs * update dockerfile with dependency from swe-agent * make env setup a separate script for .bashrc source * add wip prompt * fix mount_dir for ssh_box * update prompt * fix mount_dir for ssh_box * default to use host network * default to use host network * move prompt to a separate file * fix swe-tool plugins; add missing _split_string * remove hostname from sshbox * update the prompt with edit functionality * fix swe-tool plugins; add missing _split_string * add awaiting into status bar * fix the bug of additional send event * remove some print action * move logic to config.py * remove debugging comments * make host network as default * make WORKSPACE_MOUNT_PATH as abspath * implement execute_cli via file cp * Revert "implement execute_cli via file cp" This reverts commit `06f0155bc1`. * add codeact dependencies to default container * add IPythonRunCellObservation * add back cache dir and default to /tmp * make USE_HOST_NETWORK a bool * revert use host network to false * add temporarily fix for IPython RUN action * update prompt * revert USE_HOST_NETWORK to true since it is not affecting anything * attempt to fix lint * remove newline * fix jupyter execution server * add `thought` to most action class * fix unit tests for current action abstraction * support user exit * update test cases with the latest action format (added 'thought') * fix integration test for CodeActAGent by mocking stdin * only mock stdin for tests with user_responses.log * remove -exec integration test for CodeActAgent since it is not supported * remove specific stop word * fix comments * improve clarity of prompt * fix py lint * fix integration tests * sandbox might failed in chown due to mounting, but it won't be fatal * update debug instruction for sshbox * fix typo * get RUN_AS_DEVIN and network=host working with app sandbox * get RUN_AS_DEVIN and network=host working with app sandbox * attempt to fix the workspace base permission * sandbox might failed in chown due to mounting, but it won't be fatal * update sshbox instruction * remove default user id since it will be passed in the instruction * revert permission fix since it should be resolved by correct SANDBOX_USER_ID * the permission issue can be fixed by simply provide correct env var * remove log * set sandbox user id to getuid by default * move logging to initializer * make the uid consistent across host, app container, and sandbox * remove hostname as it causes sudo issue * fix permission of entrypoint script * make the uvicron app run as host user uid for jupyter plugin * add warning message * update dev md for instruction of running unit tests * add back unit tests * revert back to the original sandbox implementation to fix testcases * revert use host network * get docker socket gid and usermod instead of chmod 777 * allow unit test workflow to find docker.sock * make sandbox test working via patch * fix arg parser that's broken for some reason * try to fix app build disk space issue * fix integration test * Revert "fix arg parser that's broken for some reason" This reverts commit `6cc8961133`. * update Development.md * cleanup intergration tests & add exception for CodeAct+execbox * fix config * implement user_message action * fix doc * fix event dict error * fix frontend lint * revert accidentally changes to integration tests * revert accidentally changes to integration tests --------- Co-authored-by: Robert Brennan <accounts@rbren.io> Co-authored-by: Robert Brennan <contact@rbren.io>	2024-05-01 08:40:00 -04:00
Jirka Borovec	0c2ebfd6e1	Ruff: use I rule for isort (#1410 ) Ruff: use I rule for isort	2024-04-29 15:41:58 -07:00
Xia Zhenhua	086a2ed17f	feat: make the response of agent_controller better to process when exception in step execution (#1445 ) * feat: make the response of agent_controller better to process when exception occurred during executing step. * Update opendevin/controller/agent_controller.py --------- Co-authored-by: aaren.xzh <aaren.xzh@antfin.com> Co-authored-by: Robert Brennan <accounts@rbren.io>	2024-04-29 21:21:26 +00:00
Boxuan Li	831e934dab	Refactor: Use enum for config keys (#1376 )	2024-04-26 10:26:01 -04:00
Engel Nyst	2318ceae35	Send JSON parsing exceptions to LLM (#1342 ) * Add malformed JSON where we don't even start finding actions * Send any exception during JSON parsing back * Use specific exceptions --------- Co-authored-by: Robert Brennan <accounts@rbren.io>	2024-04-24 17:51:09 -04:00
Robert Brennan	1e95fa435d	Microagents and Delegation (#1238 ) * basic microagent structure * start on jinja * add instructions parser * add action instructions * add history instructions * fix a few issues * fix a few issues * fix issues * fix agent encoding * fix up anon class * prompt to fix errors * less debug info when errors happen * add another traceback * add output to finish * fix math prompt * fix pg prompt * fix up json prompt * fix math prompt * fix math prompt * fix repo prompt * fix up repo explorer * update lock * revert changes to agent_controller * refactor microagent registration a bit * create delegate action * delegation working * add finish action to manager * fix tests * rename microagents registry * rename fn * logspam * add metadata to manager agent * fix message * move repo_explorer * add delegator agent * rename agent_definition * fix up input-output plumbing * fix tests * Update agenthub/micro/math_agent/agent.yaml Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * Update agenthub/delegator_agent/prompt.py Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * Update agenthub/delegator_agent/prompt.py Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * remove prompt.py * fix lint * Update agenthub/micro/postgres_agent/agent.yaml Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * Update agenthub/micro/postgres_agent/agent.yaml Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * fix error --------- Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> Co-authored-by: Engel Nyst <enyst@users.noreply.github.com>	2024-04-24 17:46:14 -04:00
Robert Brennan	236b7bf6ea	refactor error handling so not all exceptions are caught (#1296 ) * refactor error handling so not all exceptions are caught * revert * Send the failed decoding back to the LLM (#1322) * fix quotes --------- Co-authored-by: Engel Nyst <enyst@users.noreply.github.com>	2024-04-24 16:44:06 +00:00
Engel Nyst	d1551c3097	Add checks to stop infinite loops (#1293 ) * Add checks to stop infinite loops * Send an AgentErrorObservation for the user to see an oops loop * (NullAction, Obs) problem should be (NullAction, error Obs) * Merge the two with AgentErrorObs. * Update opendevin/controller/agent_controller.py	2024-04-23 17:52:32 -04:00
Xingyao Wang	fc5e075ea0	feat(sandbox): Implementation of Sandbox Plugin to Support Jupyter (#1255 ) * initialize plugin definition * initialize plugin definition * simplify mixin * further improve plugin mixin * add cache dir for pip * support clean up cache * add script for setup jupyter and execution server * integrate JupyterRequirement to ssh_box * source bashrc at the end of plugin load * add execute_cli that accept code via stdin * make JUPYTER_EXEC_SERVER_PORT configurable via env var * increase background cmd sleep time * Update opendevin/sandbox/plugins/mixin.py Co-authored-by: Robert Brennan <accounts@rbren.io> * add mixin to base class * make jupyter requirement a dataclass * source plugins only when >0 requirements * add `sandbox_plugins` for each agent & have controller take care of it * update build.sh to make logs available in /opendevin/logs * switch to use config for lib and cache dir * fix permission issue with /workspace * use python to implement execute_cli to avoid stdin escape issue * wait until jupyter is avaialble * support plugin via copying instead of mounting --------- Co-authored-by: Robert Brennan <accounts@rbren.io>	2024-04-23 08:45:53 +08:00
Engel Nyst	464bf7ee23	Tweak connect exceptions (#1120 ) * Clean up manual sleep * Add default retries and document them. * Add doctrings to llm * Add exponential backoff for rate limiting errors * Get embeddings for the action and its own content, not the user message * Add a few bad exceptions to stop loop * Stop loop when the step has no action * Add action with content, no message, to history * make retry settings customizable * fix condense to stop the loop for the same reasons as completion * Add 500-504 exception to retries * document the retry variables * Add retries and limits for embeddings. Replaces llama-index hard-coded decorator. * Rename to retry_min_wait and retry_max_wait	2024-04-22 04:00:01 +02:00

1 2

78 Commits