OpenHands

mirror of https://github.com/All-Hands-AI/OpenHands.git synced 2026-04-29 03:00:45 -04:00

Author	SHA1	Message	Date
adragos	5f61885e44	feat: Implement user confirmation mode, request confirmation when running bash/python code in this mode (#2774 ) * [feat] confirmation mode for bash actions * feat: Add modal setting for Confirmation Mode * fix: frontend tests for confirmation mode switch * fix: add missing CONFIRMATION_MODE value in SettingsModal.test.tsx * fix: update test to integrate new setting * feat: Implement user confirmation for running bash/python code * fix: don't display rejected actions * fix: linting, rename/refactor based on feedback * fix: add property only to commands, pass serialization tests * fix: package-lock.json, lint test_action_serialization.py * test: add is_confirmed to integration test outputs --------- Co-authored-by: Mislav Balunovic <mislav.balunovic@gmail.com>	2024-07-11 14:57:21 +03:00
Boxuan Li	c68478f470	Customize LLM config per agent (#2756 ) Currently, OpenDevin uses a global singleton LLM config and a global singleton agent config. This PR allows customers to configure an LLM config for each agent. A hypothetically useful scenario is to use a cheaper LLM for repo exploration / code search, and a more powerful LLM to actually do the problem solving (CodeActAgent). Partially solves #2075 (web GUI improvement is not the goal of this PR)	2024-07-09 22:05:54 -07:00
Engel Nyst	2df1d67007	History clean up (#2849 ) * clean up add_history * refactor last agent message	2024-07-08 05:10:21 +02:00
Engel Nyst	d37b2973b2	Refactoring: event stream based agent history (#2709 ) * add to event stream sync * remove async from tests * small logging spam fix * remove swe agent * arch refactoring: use history from the event stream * refactor agents * monologue agent * ruff * planner agent * micro-agents * refactor history in evaluations * evals history refactoring * adapt evals and tests * unit testing stuck * testing micro agents, event stream * fix planner agent * fix tests * fix stuck after rename * fix test * small clean up * fix merge * fix merge issue * fix integration tests * Update agenthub/dummy_agent/agent.py * fix tests * rename more clearly; add todo; clean up	2024-07-07 21:04:23 +00:00
மனோஜ்குமார் பழனிச்சாமி	34c765688b	Streamline Logging Events (#2532 ) * Skip duplicate log * log user actions * fix tests * log all action _step * refactor log * revert test * refactor log * visual diff * disable overriding event source * Revert "disable overriding event source" This reverts commit `b0047cc0cd`. * Refactor logic * refactored runtime on_event * fix merge conflict in Web UI, it shows as red color (seems deletion but added) * linted --------- Co-authored-by: Xingyao Wang <xingyao6@illinois.edu>	2024-07-08 05:34:47 +09:00
மனோஜ்குமார் பழனிச்சாமி	adf1a0d556	Bugfix: add missing f-string for logging debug message in task creation (#2836 )	2024-07-07 17:36:19 +02:00
Xingyao Wang	a47713ecb0	[Arch] Remove supports for Background Commands (#2803 ) * depracting docker exec box * remove doc exec from workflow and docs * remove background commands * Update tests/unit/test_sandbox.py Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> * replace for-loop with assignment * fix integration tests * fix integration tests for shell script * fix integration tests * increase max iter to fix some monologue agent issue * fix integration test again * fix integration tests (seems related to run_user issue) --------- Co-authored-by: Engel Nyst <enyst@users.noreply.github.com>	2024-07-06 03:38:05 +08:00
Engel Nyst	0b8d357bef	Add event synchronously (#2700 ) * add to event stream sync * remove async from tests	2024-07-05 00:15:51 +02:00
Leo	c2f557edde	refactor: multiple code improvements (#2771 )	2024-07-04 18:51:22 +08:00
Graham Neubig	ffd3c7144c	Remove global args (#2760 ) * Remove global args * Remove global args * Update files * Update main * Bug fixes * Fix logging	2024-07-03 20:07:52 +09:00
Leo	5e6fb6131f	refactor: Renamed variables to resolve naming conflicts and eliminate warnings (#2732 ) * refactor: Renamed variables to resolve naming conflicts and eliminate warnings Signed-off-by: ifuryst <ifuryst@gmail.com> * Fix lint failed. Signed-off-by: ifuryst <ifuryst@gmail.com> * Combine set_initial_state methods, rename _filed to f, and adjust the AppConfig update codes. Signed-off-by: ifuryst <ifuryst@gmail.com> --------- Signed-off-by: ifuryst <ifuryst@gmail.com>	2024-07-02 15:00:58 +08:00
Boxuan Li	8dae1f9307	Bypass MAX_ITERATIONS and MAX_BUDGET_PER_TASK on web GUI (#2697 ) Closes #1493 Introduced TRAFFIC_CONTROL_STATE to allow OpenDevin to switch between normal traffic limiting mode and temporarily disabled mode.	2024-06-30 13:19:45 -07:00
Engel Nyst	2d9bb56763	Add ability to restore the cli session (optional) (#2699 ) * add ability to restore the main session * add quick log * rename to cli session	2024-06-30 06:56:55 +00:00
Engel Nyst	4b1cc56682	Sync history to stream (#2640 ) * add event to stream before budget check * make the budget check before the step * Update opendevin/controller/agent_controller.py Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> --------- Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>	2024-06-29 21:19:00 +00:00
Boxuan Li	e45b311c35	Remove MAX_CHARS traffic control (#2694 ) * Remove MAX_CHARS limiting * More cleanup	2024-06-29 12:59:41 -07:00
Boxuan Li	7766a3283e	CodeActAgent: Fix delegate history (#2672 )	2024-06-28 16:37:23 +09:00
மனோஜ்குமார் பழனிச்சாமி	af9385322b	Refactor: Simplify message formatting (#2670 ) Removed redundant `str()` conversion in f-string.	2024-06-28 07:34:26 +02:00
Engel Nyst	58b06cced7	Revert "Show relevant error in UI (#2516 )" (#2657 ) This reverts commit `d0bdae232f`.	2024-06-27 08:55:41 +00:00
Boxuan Li	ee86d8d25e	Frontend support for delegation and rejection (#2608 ) 1. Add support for rejection action on frontend 2. Show users the reason for rejection 3. Get rid of weird empty box after delegation 4. On web GUI, show customer when a delegation starts and ends	2024-06-26 00:30:10 -07:00
Boxuan Li	7e78fde48f	Bug fix: add error observation to history (#2610 ) * Bug fix: add error observation to history * Regenerate to demonstrate format error	2024-06-24 21:24:17 -07:00
Boxuan Li	39d90c0b2a	Track metrics throughout delegation & Polish UX for out of budget error (#2595 ) * Track metrics (costs) throught delegation * Metrics should be shared across agents for better UX * Update cost before starting delegate	2024-06-23 18:38:52 -07:00
மனோஜ்குமார் பழனிச்சாமி	d0bdae232f	Show relevant error in UI (#2516 )	2024-06-19 15:58:48 +05:30
Engel Nyst	b2307db010	Document, rename Agent* exceptions to LLM* (#2508 ) * rename "Agent" exceptions to LLM, document LLMResponseError	2024-06-18 22:30:22 +00:00
Engel Nyst	bb4ea1e6cb	Adjust is-stuck check for the same steps to 3 until it's stopped (#2437 )	2024-06-14 19:20:12 +05:30
Yufan Song	f7491bd2fa	Refactor response to action in agent step (#2350 ) * refactor action parser * Fix typos * fix typo --------- Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>	2024-06-10 10:17:30 +00:00
Boxuan Li	a9a2f10170	Revamp AgentRejectAction and allow ManagerAgent to handle rejection (#1735 ) * Fix AgentRejectAction handling * Add ManagerAgent to integration tests * Fix regenerate.sh * Fix merge * Update README for micro-agents * Add test reject to regenerate.sh * regenerate.sh: Add support for running a specific test and/or agent * Refine reject schema, and allow ManagerAgent to handle reject * Add test artifacts for test_simple_task_rejection * Fix manager agent tests * Fix README * test_simple_task_rejection: check final agent state * Integration test: exit if mock prompt not found * Update test_simple_task_rejection tests * Fix test_edits test artifacts after prompt update * Fix ManagerAgent test_edits * WIP * Fix tests * update test_edits for ManagerAgent * Skip local sandbox for reject test * Fix test comparison	2024-06-08 23:12:30 -07:00
Aaron Xia	42c6b506b5	Lazy launching BrowseEnv / making BrowseEnv optional (#2155 ) * feat: lazy launching browser; browser optional for diffrent agents. * style: lint * fix: integration test fail due to browser not started. * fix: run by cli and integration test failed. * fix: lint * fix: lint --------- Co-authored-by: Graham Neubig <neubig@gmail.com>	2024-05-31 16:40:42 -04:00
Aaron Xia	b1ec8e5dc2	style: Update agent_controller.py to clean log (#2124 )	2024-05-29 18:56:11 -07:00
Boxuan Li	9b371b1b5f	Refactor agent delegation and tweak micro agents (#1910 ) This PR fixes #1897. In addition, this PR fixes and tweaks a few micro-agents. For the first time, I am able to use ManagerAgent to complete test_write_simple_script and test_edits tasks in integration tests, so this PR also adds ManagerAgent as part of integration tests. test_write_simple_script involves delegation to CoderAgent while test_edits involves delegation to TypoFixerAgent. Also for the first time, I am able to use DelegateAgent to complete test_write_simple_script and test_edits tasks in integration tests, so this PR also adds DelegateAgent as part of integration tests. It involves delegation to StudyRepoForTaskAgent, CoderAgent and VerifierAgent. This PR is a blocker for #1735 and likely #1945.	2024-05-28 20:01:16 -07:00
Aleksandar	18d07bda89	feat: add max_budget_per_task configuration to control task cost (#2070 ) * feat: add max_budget_per_task configuration to control task cost * Fix test_arg_parser.py * Use the config.max_budget_per_task as default value * Add max_budget_per_task to core/main.py as well * Update opendevin/controller/agent_controller.py Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> --------- Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>	2024-05-27 02:04:31 +08:00
Engel Nyst	783fea62a0	Ignore pid for loop detection (Was: override eq...) (#2045 ) * rewrite, implement pid ignore in the controller * make the helper method private	2024-05-26 19:27:12 +02:00
Xingyao Wang	e731048ccf	Improve action and observation logging for the CLI interface (#2035 ) * properly log user messages; format browser action/obs, summarize action, messages properly for logging * add source to message * add spaces for printing	2024-05-24 08:21:25 -04:00
Robert Brennan	ea9c785075	fix session state after resuming (#1999 ) * fix state resuming * fix session reconnection * fix lint	2024-05-23 11:47:36 -04:00
Engel Nyst	0eccf31604	Refactor monologue and SWE agent to use the messages in state history (#1863 ) * Refactor monologue to use the messages in state history * add messages, clean up * fix monologue * update integration tests * move private method * update SWE agent to use the history from State * integration tests for SWE agent * rename monologue to initial_thoughts, since that is what it is	2024-05-23 07:29:12 +00:00
Boxuan Li	acb430eef5	Refactor integration testing CI, add optional Mac tests, and mark a few agents as deprecated (#1888 ) * Add MacOS to integration tests * Switch back to python 3.11 * Install Docker for macos pipeline * regenerate.sh: Use environmental variable for sandbox type * Pack different agents' tests into a single check * Fix CodeAct tests * Reduce file match and extensive debug logs * Add TEST_IN_CI mode that reports codecov * Small fix: don't quit if reusing old responses failed * Merge codecov results * Fix typos * Remove coverage merge step - codecov automatically does that * Make mac integration tests as optional - too slow * Fix codecov args * Add comments in yaml * Include sandbox type in codecov report name * Fix codecov report merge * Revert renaming of test_matrix_success * Remove SWEAgent and PlannerAgent from tests * Mark planner agent and SWE agent as deprecated * CodeCov: Ignore planner and sweagent * Revert "Remove SWEAgent and PlannerAgent from tests" This reverts commit `040cb3bfb9`. * Remove all tests for SWE Agent * Only keep basic tests for MonologueAgent and PlannerAgent * Mark SWE Agent as deprecated, and ignore code coverage for it --------- Co-authored-by: Engel Nyst <enyst@users.noreply.github.com>	2024-05-22 20:38:57 -07:00
Engel Nyst	b9a5be2569	Add ruff for shared mutable defaults (B) (#1938 ) * Add ruff for shared mutable defaults (B) * Apply B006, B008 on current files, except fast API * Update agenthub/SWE_agent/prompts.py Co-authored-by: Graham Neubig <neubig@gmail.com> * fix unintended behavior change * this is correct, tell Ruff to leave it alone --------- Co-authored-by: Graham Neubig <neubig@gmail.com> Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>	2024-05-22 20:06:00 -07:00
Robert Brennan	5bdacf738d	Refactor session management (#1810 ) * refactor session mgmt * defer file handling to runtime * add todo * refactor sessions a bit more * remove messages logic from FE * fix up socket handshake * refactor frontend auth a bit * first pass at redoing file explorer * implement directory suffix * fix up file tree * close agent on websocket close * remove session saving * move file refresh * remove getWorkspace * plumb path/code differently * fix build issues * fix the tests * fix npm build * add session rehydration * fix event serialization * logspam * fix user message rehydration * add get_event fn * agent state restoration * change history tracking for codeact * fix responsiveness of init * fix lint * lint * delint * fix prop * update tests * logspam * lint * fix test * revert codeact * change fileService to use API * fix up session loading * delint * delint * fix integration tests * revert test * fix up access to options endpoints * fix initial files load * delint * fix file initialization * fix mock server * fixl int * fix auth for html * Update frontend/src/i18n/translation.json Co-authored-by: Xingyao Wang <xingyao6@illinois.edu> * refactor sessions and sockets * avoid reinitializing the same session * fix reconnect issue * change up intro message * more guards on reinit * rename agent_session * delint * fix a bunch of tests * delint * fix last test * remove code editor context * fix build * fix any * fix dot notation * Update frontend/src/services/api.ts Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * fix up error handling * Update opendevin/server/session/agent.py Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * Update opendevin/server/session/agent.py Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * Update frontend/src/services/session.ts Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * fix build errs * fix else * add closed state * delint * Update opendevin/server/session/session.py Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> --------- Co-authored-by: Xingyao Wang <xingyao6@illinois.edu> Co-authored-by: Graham Neubig <neubig@gmail.com> Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> Co-authored-by: Engel Nyst <enyst@users.noreply.github.com>	2024-05-22 18:33:16 +00:00
Yufan Song	d18e6c85a0	feat: add metrics related to cost for better observability (#1944 ) * add metrics for total_cost * make lint * refact codeact * change metrics into llm * add costs list, add into state * refactor log completion * refactor and test others * make lint * Update opendevin/core/metrics.py Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * Update opendevin/llm/llm.py Co-authored-by: Xingyao Wang <xingyao6@illinois.edu> * refactor * add code --------- Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> Co-authored-by: Xingyao Wang <xingyao6@illinois.edu>	2024-05-22 08:53:31 +00:00
Frank Xu	1fe290adf9	[Feat] A competitive Web Browsing agent (#1856 ) * initial attempt at a browsing only agent * add browsing agent * update * implement agent * update * fix comments * remove unnecessary things from memory extras * update image processing --------- Co-authored-by: Yufan Song <33971064+yufansong@users.noreply.github.com>	2024-05-21 19:20:33 +00:00
Engel Nyst	1e51bb9276	Fix/update controller is_stuck() (#1891 ) * Refactor monologue to use the messages in state history remove now unused method * is_stuck update * fix is_stuck * unit tests * fix tests * Revert "Refactor monologue to use the messages in state history" This reverts commit `76b4b765ef`. * Override eq for CmdOutputObservation to ignore the pid, compare the actual command only * Revert "Override eq for CmdOutputObservation to ignore the pid, compare the actual command only" This reverts commit `6418d856b5`.	2024-05-21 22:56:59 +08:00
மனோஜ்குமார் பழனிச்சாமி	4612e107c9	fix: Handle invalid exit code conversion (#1915 )	2024-05-20 11:30:52 -04:00
Xingyao Wang	7817d4c94f	fix(controller): Improve error info logging (#1864 ) * improve error info logging * Move assignment of self.state.error to report_error function * only log exception to state, but not to user --------- Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>	2024-05-20 18:42:38 +08:00
Xingyao Wang	b2fdb963b6	Add detailed tutorial for adding new evaluation benchmarks (#1827 ) * Add detailed tutorial for adding new evaluation benchmarks * update tutorial, fix typo, and log observation to the cmdline * fix url * Update evaluation/TUTORIAL.md * Update evaluation/TUTORIAL.md * Update evaluation/TUTORIAL.md * Update evaluation/TUTORIAL.md Co-authored-by: Graham Neubig <neubig@gmail.com> * Update evaluation/TUTORIAL.md Co-authored-by: Graham Neubig <neubig@gmail.com> * Update evaluation/TUTORIAL.md Co-authored-by: Graham Neubig <neubig@gmail.com> * Update evaluation/TUTORIAL.md Co-authored-by: Graham Neubig <neubig@gmail.com> * Update evaluation/TUTORIAL.md Co-authored-by: Graham Neubig <neubig@gmail.com> * Update evaluation/TUTORIAL.md Co-authored-by: Graham Neubig <neubig@gmail.com> * Update evaluation/TUTORIAL.md Co-authored-by: Graham Neubig <neubig@gmail.com> * Update evaluation/TUTORIAL.md Co-authored-by: Graham Neubig <neubig@gmail.com> * Update evaluation/TUTORIAL.md Co-authored-by: Graham Neubig <neubig@gmail.com> * Update evaluation/TUTORIAL.md Co-authored-by: Graham Neubig <neubig@gmail.com> * Update evaluation/TUTORIAL.md Co-authored-by: Graham Neubig <neubig@gmail.com> * Update evaluation/TUTORIAL.md Co-authored-by: Graham Neubig <neubig@gmail.com> * Update evaluation/TUTORIAL.md Co-authored-by: Graham Neubig <neubig@gmail.com> * Update evaluation/TUTORIAL.md Co-authored-by: Graham Neubig <neubig@gmail.com> * simplify readme and add comments to the actual code * Fix typo in evaluation/TUTORIAL.md * Fix typo in evaluation/swe_bench/run_infer.py * Fix another typo in evaluation/swe_bench/run_infer.py * Update TUTORIAL.md * Set host net work to false for SWEBench * Update evaluation/TUTORIAL.md Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * Update evaluation/TUTORIAL.md Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * Update evaluation/TUTORIAL.md Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * Update evaluation/TUTORIAL.md Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> --------- Co-authored-by: OpenDevin <opendevin@opendevin.ai> Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> Co-authored-by: Graham Neubig <neubig@gmail.com> Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>	2024-05-18 13:40:53 -04:00
Robert Brennan	110b878dd9	fix up serialization and deserialization of events (#1850 ) * fix up serialization and deserialization of events * fix tests * remove prints * fix test * regenerate tests * add try blocks	2024-05-17 01:09:15 +00:00
Boxuan Li	6714000b2c	CodeActAgent: Fix iteration reminder (#1803 ) This PR includes three changes: 1) Iteration reminder should start with MAX_ITERATIONS from config rather than default value 100 2) In the first prompt, we should tell the LLM it has `MAX_ITERATIONS - 1` turns left, rather than `MAX_ITERATIONS - 2` 3) Remove legacy ITERATION_REMINDER config	2024-05-15 13:48:47 +08:00
Xingyao Wang	d1fd277ad4	Support return final task states for evaluation (#1755 ) * support returning states at the end of controller * remove return None * fix issue of overriding final state * return the final state on close * merge AgentState with State * fix integration test * add ChangeAgentStateAction to history in attempt to fix integration * add back set agent state * update tests * update tests * directly return get state * add back the missing .close() * Update typo in opendevin/core/main.py --------- Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>	2024-05-15 03:43:01 +00:00
Robert Brennan	beb74a19f6	Use event stream for the runtime (#1776 ) * rebuild PR from scratch * fix max_iter * regenerate tests * cut down on history * Update opendevin/controller/agent_controller.py * regenerate tests * revert swe agent * revert some codeact chagnes * regenerate tests * add source to dict * only add source if not none * try to fix coverage issue * lock * add gevent	2024-05-14 13:35:25 +00:00
Robert Brennan	82a798990c	refactor remind_iterations (#1760 ) * refactor remind_iterations * regenerate tests * concatenate iteration message * fix merge issues * update integration tests	2024-05-14 08:27:12 -04:00
Robert Brennan	b028bd46bb	Use messages to drive tasks (#1688 ) * finish is working * start reworking main_goal * remove main_goal from microagents * remove main_goal from other agents * fix issues * revert codeact line * make plan a subclass of task * fix frontend for new plan setup * lint * fix type * more lint * fix build issues * fix codeact mgs * fix edge case in regen script * fix task validation errors * regenerate integration tests * fix up tests * fix sweagent * revert codeact prompt * update integration tests * update integration tests * handle loading state * Update agenthub/codeact_agent/codeact_agent.py Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> * Update opendevin/controller/agent_controller.py Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> * Update agenthub/codeact_agent/codeact_agent.py Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> * Update opendevin/controller/state/plan.py Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> * update docs * regenerate tests * remove none from state type * revert test files * update integration tests * rename plan to root_task * revert plugin perms * regen integration tests * tweak integration script * prettier * fix test * set workspace up for regeneration * regenerate tests * Change directory of copy * Updated tests * Disable PlannerAgent test * Fix listen * Updated prompts * Disable planner again * Make codecov more lenient * Update agenthub/README.md * Update opendevin/server/README.md * re-enable planner tests * finish top level tasks * regen planner * fix root task factory --------- Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> Co-authored-by: Xingyao Wang <xingyao6@illinois.edu> Co-authored-by: Graham Neubig <neubig@gmail.com> Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>	2024-05-13 23:14:15 +00:00
Xingyao Wang	8bfae8413e	Support passing sandbox as argument and iteration reminder (#1730 ) * support custom sandbox; add iteration_reminder * Enable iteration reminder in CodeActAgent integration test * Don't remove numbers when comparing prompts * Update tests/integration/README.md --------- Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>	2024-05-12 07:57:33 +00:00

1 2 3

116 Commits