OpenHands

mirror of https://github.com/All-Hands-AI/OpenHands.git synced 2026-01-10 07:18:10 -05:00

Author	SHA1	Message	Date
Xingyao Wang	4f0a454ed6	[Arch] Support integration tests using EventStream Runtime (#3184 ) * Remove global config from memory * Remove runtime global config * Remove from storage * Remove global config * Fix event stream tests * Fix sandbox issue * Change config * Removed transferred tests * Add swe env box * Fixes on testing * Fixed some tests * Merge with stashed changes * Fix typing * Fix ipython test * Revive function * Make temp_dir fixture * Remove test to avoid circular import * fix eventstream filestore for test_runtime * fix parse arg issue that cause integration test to fail * support swebench pull from custom namespace * add back simple tests for runtime * move multi-line bash tests to test_runtime; support multi-line bash for esruntime; * add testcase to handle PS2 prompt * use bashlex for bash parsing to handle multi-line commands; add testcases for multi-line commands * revert ghcr runtime change * Apply stash * fix run as other user; make test async; * fix test runtime for run as od * add run-as-devin to all the runtime tests * handle the case when username is root * move all run-as-devin tests from sandbox; only tests a few cases on different user to save time; * move over multi-line echo related tests to test_runtime * fix user-specific jupyter by fixing the pypoetry virtualenv folder * make plugin's init async; chdir at initialization of jupyter plugin; move ipy simple testcase to test runtime; * support agentskills import in move tests for jupyter pwd tests; overload `add_env_vars` for EventStreamRuntime to update env var also in Jupyter; make agentskills read env var lazily, in case env var is updated; * fix ServerRuntime agentskills issue * move agnostic image test to test_runtime * merge runtime tests in CI * fix enable auto lint as env var * update warning message * update warning message * test for different container images * change parsing output as debug * add exception handling for update_pwd_decorator * fix unit test indentation * add plugins as default input to Runtime class; remove init_sandbox_plugins; implement add_env_var (include jupyter) in the base class; * fix server runtime auto lint * Revert "add exception handling for update_pwd_decorator" This reverts commit `2b668b1506`. * tries to print debugging info for agentskills * explictly setting uid (try fix permission issue) * Revert "tries to print debugging info for agentskills" This reverts commit `8be4c86756`. * set sandbox user id during testing to hopefully fix the permission issue * add browser tools for server runtime * try to debug for old pwd * update debug cmd * only test agnostic runtime when TEST_RUNTIME is Server * fix temp dir mkdir * load TEST_RUNTIME at the beginning * remove ipython tests * only log to file when DEBUG * default logging to project root * temporarily remove log to file * fix LLM logger dir * fix logger * make set pwd an optional aux action * fix prev pwd * fix infinity recursion * simplify * do not import the whole od library to avoid logger folder by jupyter * fix browsing * increase timeout * attempt to fix agentskills yet again * clean up in testcases, since CI maybe run as non-root * add _cause attribute for event.id * remove parent * add a bunch of debugging statement again for CI :( * fix temp_dir fixture * change all temp dir to follow pytest's tmp_path_factory * remove extra bracket * clean up error printing a bit * jupyter chdir to self.config.workspace_mount_path_in_sandbox on initialization * jupyter chdir to self.config.workspace_mount_path_in_sandbox on initialization * add typing for tmp dir fixture * clear the directory before running the test to avoid weird CI temp dir * remove agnostic test case for server runtime * Revert "remove agnostic test case for server runtime" This reverts commit `30e2181c3f`. * disable agnostic tests in CI * fix test * make sure plugin arg is not passed when no plugin is specified; remove redundant on_event function; * move mock prompt * rename runtime * remove extra logging * refactor run_controller's interface; support multiple runtime for integration test; filter out hostname for prompt * uncomment other tests * pass the right runtime to controller * log runtime when start * uncomment tests * improve symbol filters * add intergration test prompts that seemd ok * add integration test workflow * add python3 to default ubuntu image * symlink python and fix permission to jupyter pip * add retry for jupyter execute server * fix jupyter pip install; add post-process for jupyter pip install; simplify init by add agent_skills path to PYTHONPATH; add testcase to tests jupyter pip install; * fix bug * use ubuntu:22.04 for eventstream integration tests * add todo * update testcase * remove redundant code * fix unit test * reduce dependency for runtime * try making llama-index an optional dependency that's not installed by default * remove pip install since it seemd not needed * log ipython execution; await write message since it returns a future * update ipy testcase * do not install llama-index in CI * do not install llama-index in the app docker as well * set sandbox container image in the integration test script * log plugins & env var for runtime * update conftest for sha256 * add git * remove all non-alphanumeric chalracters * add working ipy module tests! * default to use host network * remove is_async from browser to make thing a little more reliable; retry loading browser when error; * add sleep to wait a bit for http server * kill http server before regenerate browsing tests * fix browsing * only set sandbox container image if undefined * skip empty config value * update evaluation to use the latest run_controller * revert logger in execute_server to be compatible with server runtime * revert logging level to fix jupyter * set logger level * revert the logging * chmod for workspace to fix permission * support getting timeout from action * update test for server runtime * try to fix file permission * fix test_cmd_run_action_serialization_deserialization test (added timeout) * poetry: pip 24.2, torch 2.2.2 * revert adding pip to pyproject.toml * add build to dependencies in pyproject.toml * forgot poetry lock --no-update * fix a DelegatorAgent prompt_002.log (timeout) * fix a DelegatorAgent prompt_003.log (timeout) * couple more timeout attribs in prompt files * some more prompt files * prompts galore * add clarification comment for timeout * default timeout to config * add assert * update integraton tests for eventstream * update integration tests * fix timeout for action<->dict * remove redundant on_event * fix action execution timeout * updatelock --------- Co-authored-by: Graham Neubig <neubig@gmail.com> Co-authored-by: tobitege <tobitege@gmx.de>	2024-08-01 22:07:39 +00:00
tobitege	a4cb880699	(feat) LLM class: added acompletion and streaming + unit test (#3202 ) * LLM class: added acompletion and streaming, unit test test_acompletion.py * LLM: cleanup of self.config defaults and their use * added set_missing_attributes to LLMConfig * move default checker up	2024-08-01 22:41:40 +02:00
Xingyao Wang	286f10053e	[arch] Implement `copy_to` for Runtime (#3211 ) * add copy to * implement for ServerRuntime * implement copyto for runtime (required by eval); add tests for copy to * fix exist file check * unify copy_to_behavior and fix stuff	2024-08-02 02:46:11 +08:00
Xingyao Wang	2e60d25eae	[Agent, LLM] Make sure codeact agent produce message in u/a/u/a order (#3193 ) * make sure codeact agent produce message in u/a/u/a order * integration tests * sync message changes to codeact swe * fix integration tests --------- Co-authored-by: Engel Nyst <enyst@users.noreply.github.com>	2024-08-02 00:17:53 +08:00
Engel Nyst	21ea9953b3	don't use realpath with non-existent files (#3200 )	2024-08-01 01:11:22 +02:00
tobitege	938ed027c2	(fix) test_runtime.py parametrization for box_class (#3186 ) * fix test_runtime.py parametrization; prevent duplicate test runs * trivial file change to unblock stuck CI workflow * fix print_method_name fixture in test_runtime (yield was missing) * revert wrong param fixtures	2024-08-01 01:30:10 +08:00
Graham Neubig	a562a7ac7d	Add unit tests for LLM init function (#3188 ) * Add unit tests for LLM init function * Fix formatting --------- Co-authored-by: OpenDevin <opendevin@all-hands.dev>	2024-07-31 16:28:50 +02:00
Xingyao Wang	bd68249fba	[Arch] Test `EventStreamRuntime` to ensure its feature parity with `ServerRuntime` (#3157 ) * Remove global config from memory * Remove runtime global config * Remove from storage * Remove global config * Fix event stream tests * Fix sandbox issue * Change config * Removed transferred tests * Add swe env box * Fixes on testing * Fixed some tests * Merge with stashed changes * Fix typing * Fix ipython test * Revive function * Make temp_dir fixture * Remove test to avoid circular import * fix eventstream filestore for test_runtime * fix parse arg issue that cause integration test to fail * support swebench pull from custom namespace * add back simple tests for runtime * move multi-line bash tests to test_runtime; support multi-line bash for esruntime; * add testcase to handle PS2 prompt * use bashlex for bash parsing to handle multi-line commands; add testcases for multi-line commands * revert ghcr runtime change * Apply stash * fix run as other user; make test async; * fix test runtime for run as od * add run-as-devin to all the runtime tests * handle the case when username is root * move all run-as-devin tests from sandbox; only tests a few cases on different user to save time; * move over multi-line echo related tests to test_runtime * fix user-specific jupyter by fixing the pypoetry virtualenv folder * make plugin's init async; chdir at initialization of jupyter plugin; move ipy simple testcase to test runtime; * support agentskills import in move tests for jupyter pwd tests; overload `add_env_vars` for EventStreamRuntime to update env var also in Jupyter; make agentskills read env var lazily, in case env var is updated; * fix ServerRuntime agentskills issue * move agnostic image test to test_runtime * merge runtime tests in CI * fix enable auto lint as env var * update warning message * update warning message * test for different container images * change parsing output as debug * add exception handling for update_pwd_decorator * fix unit test indentation * add plugins as default input to Runtime class; remove init_sandbox_plugins; implement add_env_var (include jupyter) in the base class; * fix server runtime auto lint * Revert "add exception handling for update_pwd_decorator" This reverts commit `2b668b1506`. * tries to print debugging info for agentskills * explictly setting uid (try fix permission issue) * Revert "tries to print debugging info for agentskills" This reverts commit `8be4c86756`. * set sandbox user id during testing to hopefully fix the permission issue * add browser tools for server runtime * try to debug for old pwd * update debug cmd * only test agnostic runtime when TEST_RUNTIME is Server * fix temp dir mkdir * load TEST_RUNTIME at the beginning * remove ipython tests * only log to file when DEBUG * default logging to project root * temporarily remove log to file * fix LLM logger dir * fix logger * make set pwd an optional aux action * fix prev pwd * fix infinity recursion * simplify * do not import the whole od library to avoid logger folder by jupyter * fix browsing * increase timeout * attempt to fix agentskills yet again * clean up in testcases, since CI maybe run as non-root * add _cause attribute for event.id * remove parent * add a bunch of debugging statement again for CI :( * fix temp_dir fixture * change all temp dir to follow pytest's tmp_path_factory * remove extra bracket * clean up error printing a bit * jupyter chdir to self.config.workspace_mount_path_in_sandbox on initialization * jupyter chdir to self.config.workspace_mount_path_in_sandbox on initialization * add typing for tmp dir fixture * clear the directory before running the test to avoid weird CI temp dir * remove agnostic test case for server runtime * Revert "remove agnostic test case for server runtime" This reverts commit `30e2181c3f`. * disable agnostic tests in CI * fix test --------- Co-authored-by: Graham Neubig <neubig@gmail.com>	2024-07-31 04:30:59 +08:00
tobitege	2533efabbb	(fix) split_bash_commands replaced; temp_dir fixture fix in some tests (#3160 ) * split_bash_commands replaced; temp_dir fixture fix in some tests * tweak test_runtime * skip 2 tests in test_runtime that need fixing in extra PR * reverting bash parsing changes and re-enabled tests * missed to revert a changed assert in test_runtime.py	2024-07-29 17:05:58 +00:00
Xingyao Wang	b1ea204c5b	Migrate multi-line-bash-related sandbox tests into runtime tests and fix multi-line issue (#3128 ) * Remove global config from memory * Remove runtime global config * Remove from storage * Remove global config * Fix event stream tests * Fix sandbox issue * Change config * Removed transferred tests * Add swe env box * Fixes on testing * Fixed some tests * Merge with stashed changes * Fix typing * Fix ipython test * Revive function * Make temp_dir fixture * Remove test to avoid circular import * fix eventstream filestore for test_runtime * fix parse arg issue that cause integration test to fail * support swebench pull from custom namespace * add back simple tests for runtime * move multi-line bash tests to test_runtime; support multi-line bash for esruntime; * add testcase to handle PS2 prompt * use bashlex for bash parsing to handle multi-line commands; add testcases for multi-line commands * revert ghcr runtime change --------- Co-authored-by: Graham Neubig <neubig@gmail.com>	2024-07-27 20:12:57 +00:00
Graham Neubig	275ea706cf	Remove remaining global config (#3099 ) * Remove global config from memory * Remove runtime global config * Remove from storage * Remove global config * Fix event stream tests * Fix sandbox issue * Change config * Removed transferred tests * Add swe env box * Fixes on testing * Fixed some tests * Fix typing * Fix ipython test * Revive function * Make temp_dir fixture * Remove test to avoid circular import	2024-07-26 18:43:32 +00:00
Xingyao Wang	1f6e86c932	Fix(test,CI): runtime build tests (#3126 ) * fix runtime build test * get runtime_build test to run in CI * move test involving docker from `test_ipython` to `test_sandbox`	2024-07-26 22:53:01 +08:00
tobitege	d0217b84ef	test_runtime: run tests per runtime, not alternating (#3103 )	2024-07-26 03:01:50 +08:00
Xingyao Wang	405c8a0456	[Arch] Add runtime image build CI & clean up runtime build using `jinja2` template (#3055 ) * test_runtime_client.py to test _execute_bash() * runtime_build and runtime tweaks * fix in docker script * revert bash changes * use sandbox_config.update_source_code to control source code update * add od_version to the sandbox tag * add doc instruction for update source code * do not remove whole poetry folder; add mamba clean * add missing newlines * cleanup runtime dockerfile into jinja template * make prep temp file a separate function; make that function accessible through cli * modify `runtime_build.py` so it can generate directory for building docker img * add dockerfile and sdist of runtime to gitignore since it will be dynamically generated * add runtime to build * do not rebuild new image when an `od_runtime` is provided * use default container_image for testing if possible * move runtime tests to ghcr runtime workflow * update docker base dir for runtime * fix unittest * fix image name * fix image name for test case * rename to make it consistent --------- Co-authored-by: tobitege <tobitege@gmx.de>	2024-07-24 21:56:12 +08:00
Boxuan Li	445f290beb	Validate to_replace in edit_file_by_replace AgentSkill (#3073 ) * Validate to_replace in edit_file_by_replace AgentSkill * Remove redundant replace reminder prompt * Add unit tests * Fix prompt	2024-07-22 21:01:35 -07:00
Graham Neubig	4099e48122	Removed config from agent controller (#3038 ) * Removed config from agent controller * Fix tests * Increase budget * Update tests * Update prompts * Add missing prompt * Fix mistaken deletions * Fix browsing test * Fixed browse tests	2024-07-22 17:42:57 +00:00
Graham Neubig	04877f8caf	Remove global config from tests (#3052 )	2024-07-20 23:07:09 -04:00
Boxuan Li	be6e6e3add	Bug fix: Metrics not accumulated across agent delegation (#3012 ) * Add test to reproduce cost miscalculation bug * Fix metrics bug * Copy metrics upon AgentRejectAction	2024-07-20 04:05:05 +00:00
Graham Neubig	3a21198424	Remove monologue agent (#3036 ) * Remove monologue agent * Fixes	2024-07-19 19:25:05 +00:00
jigsawlabs-student	fa6c12473e	#2220 , integrated aider style linting, currently passes related o… (#2489 ) * WIP for integrate aider linter, see OpenDevin#2220 Updated aider linter to: * Always return text and line numbers * Moved extract line number more consistently * Changed pylint to stop after first linter detects errors Updated agentskills * To get back a LintResult object and then use lines and text for error message and related line number * Moved code for extracting line number to aider linter Tests: * Added additional unit tests for aider to test for * Return values from lint failures * Confirm linter works for non-configured languages like Ruby * move to agent_skills, fixes not seeing skills error * format/lint to new code, fix failing tests, remove unused code from aider linter * small changes (remove litellm, fix readme typo) * fix failing sandbox test * keep, change dumping of metadata * WIP for integrate aider linter, see OpenDevin#2220 Updated aider linter to: * Always return text and line numbers * Moved extract line number more consistently * Changed pylint to stop after first linter detects errors Updated agentskills * To get back a LintResult object and then use lines and text for error message and related line number * Moved code for extracting line number to aider linter Tests: * Added additional unit tests for aider to test for * Return values from lint failures * Confirm linter works for non-configured languages like Ruby * move to agent_skills, fixes not seeing skills error * format/lint to new code, fix failing tests, remove unused code from aider linter * remove duplication of tree-sitter, grep-ast and update poetry.lock * revert to main branch poetry.lock version * only update necessary package * fix jupyter kernel wrong interpreter issue (only for swebench) * fix failing lint tests * update syntax error checks for flake * update poetry lock file * update poetry.lock file, which update content-hash * add grep ast * remove extra stuff caused by merge * update pyproject * remove extra pytest fixture, ruff styling fixes * lint files * update poetry.lock file --------- Co-authored-by: Jeff Katzy <jeffreyerickatz@gmail.com> Co-authored-by: yufansong <yufan@risingwave-labs.com> Co-authored-by: Xingyao Wang <xingyao@all-hands.dev> Co-authored-by: Xingyao Wang <xingyao6@illinois.edu> Co-authored-by: tobitege <tobitege@gmx.de>	2024-07-19 21:58:54 +08:00
Xingyao Wang	ac27ded81f	Fix: handle the case where env var is empty (#3016 ) * handle the case where env var is empty * fix logging * include obs content in logging * change to add_env_vars	2024-07-19 13:51:06 +00:00
Xingyao Wang	ff6ddc831f	fix: runtime test for mac (#3005 ) * move use_host_network to sandbox config * fix test runtime tests * fix kwargs to make it clearer	2024-07-19 03:03:55 +00:00
Boxuan Li	9d41314d1a	State: Add local_iteration attribute (#2990 ) * Add local_iteration state attribute * Fix typos --------- Co-authored-by: Xingyao Wang <xingyao6@illinois.edu>	2024-07-18 14:49:19 +00:00
tobitege	5a5713009f	INT: prevent error on repeat integration tests after failed test(s) (#2935 ) * Integration tests: prevent File not found error * forgot to remove debug calls in regenerate.sh	2024-07-18 06:29:15 +02:00
Xingyao Wang	f80ecec772	[Arch] Add tests for `EventStreamRuntime` and fix bash parsing (#2933 ) * deprecating recall action * fix integration tests * fix integration tests * refractor runtime to use async * remove search memory * rename .initialize to .ainit * draft of runtime image building (separate from img agnostic) * refractor runtime build into separate file and add unit tests for it * fix image agnostic tests * move `split_bash_commands` into a separate util file * fix bash pexcept parsing for env * refractor add_env_var from sandbox to runtime; add test runtime for env var, remove it from sandbox; * remove unclear comment * capture broader error * make `add_env_var` handle multiple export at the same time * add multi env var test * fix tests with new config * make runtime tests a separate ci to avoid full disk * Update Runtime README with architecture diagram and detailed explanations * update test * remove dependency of global config in sandbox test * fix sandbox typo * runtime tests does not need ghcr build now * remove download runtime img * remove dependency of global config in sandbox test * fix sandbox typo * try to free disk before running the tests * Update opendevin/runtime/client/README.md Co-authored-by: Yufan Song <33971064+yufansong@users.noreply.github.com> * Update opendevin/runtime/client/README.md Co-authored-by: Yufan Song <33971064+yufansong@users.noreply.github.com> * Update opendevin/runtime/client/README.md Co-authored-by: Yufan Song <33971064+yufansong@users.noreply.github.com> * try to reduce code duplication * Update opendevin/runtime/client/README.md Co-authored-by: Yufan Song <33971064+yufansong@users.noreply.github.com> * Update opendevin/runtime/client/README.md Co-authored-by: Yufan Song <33971064+yufansong@users.noreply.github.com> * Update opendevin/runtime/client/README.md Co-authored-by: Yufan Song <33971064+yufansong@users.noreply.github.com> * Update opendevin/runtime/client/README.md Co-authored-by: Yufan Song <33971064+yufansong@users.noreply.github.com> * Update opendevin/runtime/client/README.md Co-authored-by: Yufan Song <33971064+yufansong@users.noreply.github.com> * cleanup before setup * temporarily remove this enable lint test since env var are now handled by runtime * linter --------- Co-authored-by: OpenDevin <opendevin@all-hands.dev> Co-authored-by: Yufan Song <33971064+yufansong@users.noreply.github.com>	2024-07-18 06:10:45 +08:00
Xingyao Wang	cf3d2298da	Refactor: remove the use of global variable in test_sandbox (#2985 ) * remove dependency of global config in sandbox test * fix sandbox typo * try to reduce code duplication	2024-07-17 20:42:40 +00:00
Graham Neubig	c897791024	Refactor LLM config (#2953 ) * Add max_message_chars to LLM * Refactor LLM config * Fix tests * Made some functions class functions * Fix regression * Fixed comments	2024-07-17 09:16:04 -04:00
Graham Neubig	257698e89b	Remove global config from sandbox (#2961 ) * Some changes * Fixed errors * Remove duplicate initialize_plugins * Fix some tests * Fix tests	2024-07-16 18:34:04 +00:00
Boxuan Li	e3e437fcc2	Rework --llm-config CLI arg (#2957 )	2024-07-16 04:17:59 +00:00
Anush Kumar V	8f76587e5c	docs: updated docstrings using ruff's autofix feature (#2923 ) * Updated documentation using ruff's autofix feature * Updated pyproject.toml to include docstring validations * Updated documentation using ruff's autofix feature * Updated pyproject.toml to include docstring validations * Updated docstrings using ruff's autfix feature * Deleted opendevin/runtime/utils/soource.py, Keeping in sync with main --------- Co-authored-by: Graham Neubig <neubig@gmail.com>	2024-07-16 01:35:33 +00:00
Xingyao Wang	9b1f59a56e	Arch: refactor and add unit tests for `EventStreamRuntime` docker image build (#2908 ) * deprecating recall action * fix integration tests * fix integration tests * refractor runtime to use async * remove search memory * rename .initialize to .ainit * draft of runtime image building (separate from img agnostic) * refractor runtime build into separate file and add unit tests for it * fix image agnostic tests * Update opendevin/runtime/utils/runtime_build.py Co-authored-by: Mingzhang Zheng <649940882@qq.com> --------- Co-authored-by: Mingzhang Zheng <649940882@qq.com>	2024-07-15 01:27:31 +00:00
Xingyao Wang	e45ddeb2a2	arch: deprecating recall action and `search_memory` (#2900 ) * deprecating recall action * fix integration tests * fix integration tests * remove search memory	2024-07-12 19:23:21 +00:00
Boxuan Li	ebbc0e6803	Integration testing: unset irrelevant env variables (#2902 )	2024-07-12 22:12:37 +08:00
Xingyao Wang	e45d46c993	[Arch] Implement EventStream Runtime Client with Jupyter Support using Agnostic Sandbox (#2879 ) * support loading a particular runtime class via config.runtime (default to server to not break things) * move image agnostic util to shared runtime util * move dependency * include poetry.lock in sdist * accept port as arg for client * make client start server with specified port * update image agnostic utility for eventstream runtime * make client and runtime working with REST API * rename execute_server * add plugin to initialize stuff inside es-runtime; cleanup runtime methods to delegate everything to container * remove redundant ls -alh * fix jupyter * improve logging in agnostic sandbox * improve logging of test function * add read & edit * update agnostic sandbox * support setting work dir at start * fix file read/write test * fix unit test * update tescase * Fix unit test again * fix unit test again again	2024-07-12 01:52:26 +08:00
Xingyao Wang	1b54800a29	[Agent] Improve edits by adding back `edit_file_by_line` (#2722 ) * add replace-based block edit & preliminary test case fix * further fix the insert behavior * make edit only work on first occurence * bump codeact version since we now use new edit agentskills * update prompt for new agentskills * update integration tests * make run_infer.sh executable * remove code block for edit_file * update integration test for prompt changes * default to not use hint for eval * fix insert emptyfile bug * throw value error when `to_replace` is empty * make `_edit_or_insert_file` return string so we can try to fix some linter errors (best attempt) * add todo * update integration test * fix sandbox test for this PR * fix inserting with additional newline * rename to edit_file_by_replace * add back `edit_file_by_line` * update prompt for new editing tool * fix integration tests * bump codeact version since there are more changes * add back append file * fix current line for append * fix append unit tests * change the location where we show edited line no to agent and fix tests * update integration tests * fix global window size affect by open_file bug * fix global window size affect by open_file bug * increase window size to 300 * add file beginning and ending marker to avoid looping * expand the editor window to better display edit error for model * refractor to breakdown edit to internal functions * reduce window to 200 * move window to 100 * refractor to cleanup some logic into _calculate_window_bounds * fix integration tests * fix sandbox test on new prompt * update demonstration with new changes * fix integration * initialize llm inside process_instance to circumvent "AttributeError: Can't pickle local object" * update kwargs * retry for internal server error * fix max iteration * override max iter from config * fix integration tests * remove edit file by line * fix integration tests * add instruction to avoid hanging * Revert "add instruction to avoid hanging" This reverts commit `06fd2c5938`. * handle content policy violation error * fix integration tests * fix typo in prompt - the window is 100 * update all integration tests --------- Co-authored-by: Graham Neubig <neubig@gmail.com> Co-authored-by: Xingyao Wang <xingyao@all-hands.dev>	2024-07-11 15:30:20 +00:00
adragos	5f61885e44	feat: Implement user confirmation mode, request confirmation when running bash/python code in this mode (#2774 ) * [feat] confirmation mode for bash actions * feat: Add modal setting for Confirmation Mode * fix: frontend tests for confirmation mode switch * fix: add missing CONFIRMATION_MODE value in SettingsModal.test.tsx * fix: update test to integrate new setting * feat: Implement user confirmation for running bash/python code * fix: don't display rejected actions * fix: linting, rename/refactor based on feedback * fix: add property only to commands, pass serialization tests * fix: package-lock.json, lint test_action_serialization.py * test: add is_confirmed to integration test outputs --------- Co-authored-by: Mislav Balunovic <mislav.balunovic@gmail.com>	2024-07-11 14:57:21 +03:00
மனோஜ்குமார் பழனிச்சாமி	1d4f422638	Doc: Mention FORCE_REGENERATE var (#2833 ) * Mention FORCE_REGENERATE var in doc * Update tests/integration/README.md --------- Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> Co-authored-by: Yufan Song <33971064+yufansong@users.noreply.github.com>	2024-07-11 04:01:15 +00:00
Boxuan Li	c68478f470	Customize LLM config per agent (#2756 ) Currently, OpenDevin uses a global singleton LLM config and a global singleton agent config. This PR allows customers to configure an LLM config for each agent. A hypothetically useful scenario is to use a cheaper LLM for repo exploration / code search, and a more powerful LLM to actually do the problem solving (CodeActAgent). Partially solves #2075 (web GUI improvement is not the goal of this PR)	2024-07-09 22:05:54 -07:00
Xingyao Wang	f2e92b2db7	move image agnostic util to shared runtime util (#2859 )	2024-07-08 22:17:01 +00:00
மனோஜ்குமார் பழனிச்சாமி	c6aa50779d	Update regenerate.sh (#2832 )	2024-07-07 23:52:03 +02:00
Engel Nyst	d37b2973b2	Refactoring: event stream based agent history (#2709 ) * add to event stream sync * remove async from tests * small logging spam fix * remove swe agent * arch refactoring: use history from the event stream * refactor agents * monologue agent * ruff * planner agent * micro-agents * refactor history in evaluations * evals history refactoring * adapt evals and tests * unit testing stuck * testing micro agents, event stream * fix planner agent * fix tests * fix stuck after rename * fix test * small clean up * fix merge * fix merge issue * fix integration tests * Update agenthub/dummy_agent/agent.py * fix tests * rename more clearly; add todo; clean up	2024-07-07 21:04:23 +00:00
Shimada666	d22ff73905	Make the sandbox Python runtime completely transparent (#2796 ) * Make the sandbox Python runtime completely independent * fix source bashrc * add pip install instruction for ipython to fix intergration tests for codeact swe * update integration tests * change flake8 command to (maybe) fix sandbox tests? * make lint support both unittest & sandbox * fix agnostic image build error * refactor build script --------- Co-authored-by: Xingyao Wang <xingyao6@illinois.edu> Co-authored-by: tobitege <tobitege@gmx.de>	2024-07-06 15:22:44 +02:00
Xingyao Wang	a47713ecb0	[Arch] Remove supports for Background Commands (#2803 ) * depracting docker exec box * remove doc exec from workflow and docs * remove background commands * Update tests/unit/test_sandbox.py Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> * replace for-loop with assignment * fix integration tests * fix integration tests for shell script * fix integration tests * increase max iter to fix some monologue agent issue * fix integration test again * fix integration tests (seems related to run_user issue) --------- Co-authored-by: Engel Nyst <enyst@users.noreply.github.com>	2024-07-06 03:38:05 +08:00
மனோஜ்குமார் பழனிச்சாமி	143f38d25a	Refactored sandbox config and added fast boot (#2455 ) * Refactored sandbox config and added fastboot * added tests * fixed tests * fixed tests * intimate user about breaking change * remove default config from eval * check for lowercase env * add test * Revert Migration * migrate old sandbox configs * resolve merge conflict * revert migration 2 * Revert "remove default config from eval" This reverts commit `de57c588db`. * change type to box_type * fix var name * linted * lint * lint comments * fix tests * fix tests * fix typo * fix box_type, remove fast_boot * add tests for sandbox config * fix test * update eval docs * small removal comments * adapt toml template * old fields shouldn't be in the app dataclass * fix old keys in app config * clean up exec box --------- Co-authored-by: Engel Nyst <enyst@users.noreply.github.com>	2024-07-05 03:30:21 +00:00
Xingyao Wang	0d3b3ffbf8	[Arch] Removing docker exec box (#2802 ) * depracting docker exec box * remove doc exec from workflow and docs	2024-07-04 23:15:25 +00:00
Engel Nyst	0b8d357bef	Add event synchronously (#2700 ) * add to event stream sync * remove async from tests	2024-07-05 00:15:51 +02:00
sven	1b10e2b9d5	Make CodeAct finish task (#2673 ) * Added feature to CodeAct agent to finish action instead of waiting for user input. * Minor change * Update agenthub/codeact_agent/codeact_agent.py Co-authored-by: மனோஜ்குமார் பழனிச்சாமி <smartmanoj42857@gmail.com> * updated integration tests with claude-sonnet-3.5 * Update agenthub/codeact_agent/prompt.py Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * updated tests to remove typo in prompt * resolve merge conflicts II * revert unintended change of regenerate script * re-regenerating prompts to resolve merge conflicts --------- Co-authored-by: மனோஜ்குமார் பழனிச்சாமி <smartmanoj42857@gmail.com> Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>	2024-07-04 11:21:46 -07:00
Leo	c2f557edde	refactor: multiple code improvements (#2771 )	2024-07-04 18:51:22 +08:00
Graham Neubig	ffd3c7144c	Remove global args (#2760 ) * Remove global args * Remove global args * Update files * Update main * Bug fixes * Fix logging	2024-07-03 20:07:52 +09:00
Xingyao Wang	41ddba84bd	[Agent] (Potentially) improve Editing using `diff` (#2685 ) * add replace-based block edit & preliminary test case fix * further fix the insert behavior * make edit only work on first occurence * bump codeact version since we now use new edit agentskills * update prompt for new agentskills * update integration tests * make run_infer.sh executable * remove code block for edit_file * update integration test for prompt changes * default to not use hint for eval * fix insert emptyfile bug * throw value error when `to_replace` is empty * make `_edit_or_insert_file` return string so we can try to fix some linter errors (best attempt) * add todo * update integration test * fix sandbox test for this PR	2024-07-02 11:50:15 +09:00

1 2 3 4

177 Commits