OpenHands

mirror of https://github.com/All-Hands-AI/OpenHands.git synced 2026-04-29 03:00:45 -04:00

Author	SHA1	Message	Date
Graham Neubig	7d331acffa	Handle error observations in codeact (#3383 ) * Handle error observations in codeact * Remove comments	2024-08-14 13:47:31 +00:00
Engel Nyst	9ed95abf83	Fix max budget per task error in headless mode (#3147 ) * set agent in ERROR instead of PAUSED when in headless mode * fallback to config value for budget	2024-07-27 17:35:40 +00:00
Engel Nyst	f07280153a	restore logging of user messages when using cli (#3145 )	2024-07-27 12:58:23 +00:00
Graham Neubig	4099e48122	Removed config from agent controller (#3038 ) * Removed config from agent controller * Fix tests * Increase budget * Update tests * Update prompts * Add missing prompt * Fix mistaken deletions * Fix browsing test * Fixed browse tests	2024-07-22 17:42:57 +00:00
Boxuan Li	be6e6e3add	Bug fix: Metrics not accumulated across agent delegation (#3012 ) * Add test to reproduce cost miscalculation bug * Fix metrics bug * Copy metrics upon AgentRejectAction	2024-07-20 04:05:05 +00:00
Xingyao Wang	6b16a5da0b	[Eval,Arch] Update GPTQ eval and add `headless_mode` for Controller (#2994 ) * update and polish gptq eval * fix typo * Update evaluation/gpqa/README.md Co-authored-by: Graham Neubig <neubig@gmail.com> * Update evaluation/gpqa/run_infer.py Co-authored-by: Graham Neubig <neubig@gmail.com> * add headless mode to all appropriate agent controller call * delegate set to error when in headless mode * try to deduplicate a bit * make headless_mode default to True and only change it to false for AgentSession --------- Co-authored-by: Graham Neubig <neubig@gmail.com>	2024-07-20 03:35:48 +00:00
Graham Neubig	dada004fac	Remove config from files (#3039 )	2024-07-19 23:20:44 -04:00
Boxuan Li	9d41314d1a	State: Add local_iteration attribute (#2990 ) * Add local_iteration state attribute * Fix typos --------- Co-authored-by: Xingyao Wang <xingyao6@illinois.edu>	2024-07-18 14:49:19 +00:00
Graham Neubig	c897791024	Refactor LLM config (#2953 ) * Add max_message_chars to LLM * Refactor LLM config * Fix tests * Made some functions class functions * Fix regression * Fixed comments	2024-07-17 09:16:04 -04:00
Anush Kumar V	8f76587e5c	docs: updated docstrings using ruff's autofix feature (#2923 ) * Updated documentation using ruff's autofix feature * Updated pyproject.toml to include docstring validations * Updated documentation using ruff's autofix feature * Updated pyproject.toml to include docstring validations * Updated docstrings using ruff's autfix feature * Deleted opendevin/runtime/utils/soource.py, Keeping in sync with main --------- Co-authored-by: Graham Neubig <neubig@gmail.com>	2024-07-16 01:35:33 +00:00
adragos	5f61885e44	feat: Implement user confirmation mode, request confirmation when running bash/python code in this mode (#2774 ) * [feat] confirmation mode for bash actions * feat: Add modal setting for Confirmation Mode * fix: frontend tests for confirmation mode switch * fix: add missing CONFIRMATION_MODE value in SettingsModal.test.tsx * fix: update test to integrate new setting * feat: Implement user confirmation for running bash/python code * fix: don't display rejected actions * fix: linting, rename/refactor based on feedback * fix: add property only to commands, pass serialization tests * fix: package-lock.json, lint test_action_serialization.py * test: add is_confirmed to integration test outputs --------- Co-authored-by: Mislav Balunovic <mislav.balunovic@gmail.com>	2024-07-11 14:57:21 +03:00
Boxuan Li	c68478f470	Customize LLM config per agent (#2756 ) Currently, OpenDevin uses a global singleton LLM config and a global singleton agent config. This PR allows customers to configure an LLM config for each agent. A hypothetically useful scenario is to use a cheaper LLM for repo exploration / code search, and a more powerful LLM to actually do the problem solving (CodeActAgent). Partially solves #2075 (web GUI improvement is not the goal of this PR)	2024-07-09 22:05:54 -07:00
Engel Nyst	2df1d67007	History clean up (#2849 ) * clean up add_history * refactor last agent message	2024-07-08 05:10:21 +02:00
Engel Nyst	d37b2973b2	Refactoring: event stream based agent history (#2709 ) * add to event stream sync * remove async from tests * small logging spam fix * remove swe agent * arch refactoring: use history from the event stream * refactor agents * monologue agent * ruff * planner agent * micro-agents * refactor history in evaluations * evals history refactoring * adapt evals and tests * unit testing stuck * testing micro agents, event stream * fix planner agent * fix tests * fix stuck after rename * fix test * small clean up * fix merge * fix merge issue * fix integration tests * Update agenthub/dummy_agent/agent.py * fix tests * rename more clearly; add todo; clean up	2024-07-07 21:04:23 +00:00
மனோஜ்குமார் பழனிச்சாமி	34c765688b	Streamline Logging Events (#2532 ) * Skip duplicate log * log user actions * fix tests * log all action _step * refactor log * revert test * refactor log * visual diff * disable overriding event source * Revert "disable overriding event source" This reverts commit `b0047cc0cd`. * Refactor logic * refactored runtime on_event * fix merge conflict in Web UI, it shows as red color (seems deletion but added) * linted --------- Co-authored-by: Xingyao Wang <xingyao6@illinois.edu>	2024-07-08 05:34:47 +09:00
Xingyao Wang	a47713ecb0	[Arch] Remove supports for Background Commands (#2803 ) * depracting docker exec box * remove doc exec from workflow and docs * remove background commands * Update tests/unit/test_sandbox.py Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> * replace for-loop with assignment * fix integration tests * fix integration tests for shell script * fix integration tests * increase max iter to fix some monologue agent issue * fix integration test again * fix integration tests (seems related to run_user issue) --------- Co-authored-by: Engel Nyst <enyst@users.noreply.github.com>	2024-07-06 03:38:05 +08:00
Engel Nyst	0b8d357bef	Add event synchronously (#2700 ) * add to event stream sync * remove async from tests	2024-07-05 00:15:51 +02:00
Leo	c2f557edde	refactor: multiple code improvements (#2771 )	2024-07-04 18:51:22 +08:00
Graham Neubig	ffd3c7144c	Remove global args (#2760 ) * Remove global args * Remove global args * Update files * Update main * Bug fixes * Fix logging	2024-07-03 20:07:52 +09:00
Leo	5e6fb6131f	refactor: Renamed variables to resolve naming conflicts and eliminate warnings (#2732 ) * refactor: Renamed variables to resolve naming conflicts and eliminate warnings Signed-off-by: ifuryst <ifuryst@gmail.com> * Fix lint failed. Signed-off-by: ifuryst <ifuryst@gmail.com> * Combine set_initial_state methods, rename _filed to f, and adjust the AppConfig update codes. Signed-off-by: ifuryst <ifuryst@gmail.com> --------- Signed-off-by: ifuryst <ifuryst@gmail.com>	2024-07-02 15:00:58 +08:00
Boxuan Li	8dae1f9307	Bypass MAX_ITERATIONS and MAX_BUDGET_PER_TASK on web GUI (#2697 ) Closes #1493 Introduced TRAFFIC_CONTROL_STATE to allow OpenDevin to switch between normal traffic limiting mode and temporarily disabled mode.	2024-06-30 13:19:45 -07:00
Engel Nyst	2d9bb56763	Add ability to restore the cli session (optional) (#2699 ) * add ability to restore the main session * add quick log * rename to cli session	2024-06-30 06:56:55 +00:00
Engel Nyst	4b1cc56682	Sync history to stream (#2640 ) * add event to stream before budget check * make the budget check before the step * Update opendevin/controller/agent_controller.py Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> --------- Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>	2024-06-29 21:19:00 +00:00
Boxuan Li	e45b311c35	Remove MAX_CHARS traffic control (#2694 ) * Remove MAX_CHARS limiting * More cleanup	2024-06-29 12:59:41 -07:00
Boxuan Li	7766a3283e	CodeActAgent: Fix delegate history (#2672 )	2024-06-28 16:37:23 +09:00
மனோஜ்குமார் பழனிச்சாமி	af9385322b	Refactor: Simplify message formatting (#2670 ) Removed redundant `str()` conversion in f-string.	2024-06-28 07:34:26 +02:00
Engel Nyst	58b06cced7	Revert "Show relevant error in UI (#2516 )" (#2657 ) This reverts commit `d0bdae232f`.	2024-06-27 08:55:41 +00:00
Boxuan Li	ee86d8d25e	Frontend support for delegation and rejection (#2608 ) 1. Add support for rejection action on frontend 2. Show users the reason for rejection 3. Get rid of weird empty box after delegation 4. On web GUI, show customer when a delegation starts and ends	2024-06-26 00:30:10 -07:00
Boxuan Li	7e78fde48f	Bug fix: add error observation to history (#2610 ) * Bug fix: add error observation to history * Regenerate to demonstrate format error	2024-06-24 21:24:17 -07:00
Boxuan Li	39d90c0b2a	Track metrics throughout delegation & Polish UX for out of budget error (#2595 ) * Track metrics (costs) throught delegation * Metrics should be shared across agents for better UX * Update cost before starting delegate	2024-06-23 18:38:52 -07:00
மனோஜ்குமார் பழனிச்சாமி	d0bdae232f	Show relevant error in UI (#2516 )	2024-06-19 15:58:48 +05:30
Engel Nyst	b2307db010	Document, rename Agent* exceptions to LLM* (#2508 ) * rename "Agent" exceptions to LLM, document LLMResponseError	2024-06-18 22:30:22 +00:00
Engel Nyst	bb4ea1e6cb	Adjust is-stuck check for the same steps to 3 until it's stopped (#2437 )	2024-06-14 19:20:12 +05:30
Boxuan Li	a9a2f10170	Revamp AgentRejectAction and allow ManagerAgent to handle rejection (#1735 ) * Fix AgentRejectAction handling * Add ManagerAgent to integration tests * Fix regenerate.sh * Fix merge * Update README for micro-agents * Add test reject to regenerate.sh * regenerate.sh: Add support for running a specific test and/or agent * Refine reject schema, and allow ManagerAgent to handle reject * Add test artifacts for test_simple_task_rejection * Fix manager agent tests * Fix README * test_simple_task_rejection: check final agent state * Integration test: exit if mock prompt not found * Update test_simple_task_rejection tests * Fix test_edits test artifacts after prompt update * Fix ManagerAgent test_edits * WIP * Fix tests * update test_edits for ManagerAgent * Skip local sandbox for reject test * Fix test comparison	2024-06-08 23:12:30 -07:00
Aaron Xia	b1ec8e5dc2	style: Update agent_controller.py to clean log (#2124 )	2024-05-29 18:56:11 -07:00
Boxuan Li	9b371b1b5f	Refactor agent delegation and tweak micro agents (#1910 ) This PR fixes #1897. In addition, this PR fixes and tweaks a few micro-agents. For the first time, I am able to use ManagerAgent to complete test_write_simple_script and test_edits tasks in integration tests, so this PR also adds ManagerAgent as part of integration tests. test_write_simple_script involves delegation to CoderAgent while test_edits involves delegation to TypoFixerAgent. Also for the first time, I am able to use DelegateAgent to complete test_write_simple_script and test_edits tasks in integration tests, so this PR also adds DelegateAgent as part of integration tests. It involves delegation to StudyRepoForTaskAgent, CoderAgent and VerifierAgent. This PR is a blocker for #1735 and likely #1945.	2024-05-28 20:01:16 -07:00
Aleksandar	18d07bda89	feat: add max_budget_per_task configuration to control task cost (#2070 ) * feat: add max_budget_per_task configuration to control task cost * Fix test_arg_parser.py * Use the config.max_budget_per_task as default value * Add max_budget_per_task to core/main.py as well * Update opendevin/controller/agent_controller.py Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> --------- Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>	2024-05-27 02:04:31 +08:00
Engel Nyst	783fea62a0	Ignore pid for loop detection (Was: override eq...) (#2045 ) * rewrite, implement pid ignore in the controller * make the helper method private	2024-05-26 19:27:12 +02:00
Xingyao Wang	e731048ccf	Improve action and observation logging for the CLI interface (#2035 ) * properly log user messages; format browser action/obs, summarize action, messages properly for logging * add source to message * add spaces for printing	2024-05-24 08:21:25 -04:00
Robert Brennan	ea9c785075	fix session state after resuming (#1999 ) * fix state resuming * fix session reconnection * fix lint	2024-05-23 11:47:36 -04:00
Engel Nyst	0eccf31604	Refactor monologue and SWE agent to use the messages in state history (#1863 ) * Refactor monologue to use the messages in state history * add messages, clean up * fix monologue * update integration tests * move private method * update SWE agent to use the history from State * integration tests for SWE agent * rename monologue to initial_thoughts, since that is what it is	2024-05-23 07:29:12 +00:00
Robert Brennan	5bdacf738d	Refactor session management (#1810 ) * refactor session mgmt * defer file handling to runtime * add todo * refactor sessions a bit more * remove messages logic from FE * fix up socket handshake * refactor frontend auth a bit * first pass at redoing file explorer * implement directory suffix * fix up file tree * close agent on websocket close * remove session saving * move file refresh * remove getWorkspace * plumb path/code differently * fix build issues * fix the tests * fix npm build * add session rehydration * fix event serialization * logspam * fix user message rehydration * add get_event fn * agent state restoration * change history tracking for codeact * fix responsiveness of init * fix lint * lint * delint * fix prop * update tests * logspam * lint * fix test * revert codeact * change fileService to use API * fix up session loading * delint * delint * fix integration tests * revert test * fix up access to options endpoints * fix initial files load * delint * fix file initialization * fix mock server * fixl int * fix auth for html * Update frontend/src/i18n/translation.json Co-authored-by: Xingyao Wang <xingyao6@illinois.edu> * refactor sessions and sockets * avoid reinitializing the same session * fix reconnect issue * change up intro message * more guards on reinit * rename agent_session * delint * fix a bunch of tests * delint * fix last test * remove code editor context * fix build * fix any * fix dot notation * Update frontend/src/services/api.ts Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * fix up error handling * Update opendevin/server/session/agent.py Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * Update opendevin/server/session/agent.py Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * Update frontend/src/services/session.ts Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * fix build errs * fix else * add closed state * delint * Update opendevin/server/session/session.py Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> --------- Co-authored-by: Xingyao Wang <xingyao6@illinois.edu> Co-authored-by: Graham Neubig <neubig@gmail.com> Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> Co-authored-by: Engel Nyst <enyst@users.noreply.github.com>	2024-05-22 18:33:16 +00:00
Yufan Song	d18e6c85a0	feat: add metrics related to cost for better observability (#1944 ) * add metrics for total_cost * make lint * refact codeact * change metrics into llm * add costs list, add into state * refactor log completion * refactor and test others * make lint * Update opendevin/core/metrics.py Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * Update opendevin/llm/llm.py Co-authored-by: Xingyao Wang <xingyao6@illinois.edu> * refactor * add code --------- Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> Co-authored-by: Xingyao Wang <xingyao6@illinois.edu>	2024-05-22 08:53:31 +00:00
Frank Xu	1fe290adf9	[Feat] A competitive Web Browsing agent (#1856 ) * initial attempt at a browsing only agent * add browsing agent * update * implement agent * update * fix comments * remove unnecessary things from memory extras * update image processing --------- Co-authored-by: Yufan Song <33971064+yufansong@users.noreply.github.com>	2024-05-21 19:20:33 +00:00
Engel Nyst	1e51bb9276	Fix/update controller is_stuck() (#1891 ) * Refactor monologue to use the messages in state history remove now unused method * is_stuck update * fix is_stuck * unit tests * fix tests * Revert "Refactor monologue to use the messages in state history" This reverts commit `76b4b765ef`. * Override eq for CmdOutputObservation to ignore the pid, compare the actual command only * Revert "Override eq for CmdOutputObservation to ignore the pid, compare the actual command only" This reverts commit `6418d856b5`.	2024-05-21 22:56:59 +08:00
மனோஜ்குமார் பழனிச்சாமி	4612e107c9	fix: Handle invalid exit code conversion (#1915 )	2024-05-20 11:30:52 -04:00
Xingyao Wang	7817d4c94f	fix(controller): Improve error info logging (#1864 ) * improve error info logging * Move assignment of self.state.error to report_error function * only log exception to state, but not to user --------- Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>	2024-05-20 18:42:38 +08:00
Xingyao Wang	b2fdb963b6	Add detailed tutorial for adding new evaluation benchmarks (#1827 ) * Add detailed tutorial for adding new evaluation benchmarks * update tutorial, fix typo, and log observation to the cmdline * fix url * Update evaluation/TUTORIAL.md * Update evaluation/TUTORIAL.md * Update evaluation/TUTORIAL.md * Update evaluation/TUTORIAL.md Co-authored-by: Graham Neubig <neubig@gmail.com> * Update evaluation/TUTORIAL.md Co-authored-by: Graham Neubig <neubig@gmail.com> * Update evaluation/TUTORIAL.md Co-authored-by: Graham Neubig <neubig@gmail.com> * Update evaluation/TUTORIAL.md Co-authored-by: Graham Neubig <neubig@gmail.com> * Update evaluation/TUTORIAL.md Co-authored-by: Graham Neubig <neubig@gmail.com> * Update evaluation/TUTORIAL.md Co-authored-by: Graham Neubig <neubig@gmail.com> * Update evaluation/TUTORIAL.md Co-authored-by: Graham Neubig <neubig@gmail.com> * Update evaluation/TUTORIAL.md Co-authored-by: Graham Neubig <neubig@gmail.com> * Update evaluation/TUTORIAL.md Co-authored-by: Graham Neubig <neubig@gmail.com> * Update evaluation/TUTORIAL.md Co-authored-by: Graham Neubig <neubig@gmail.com> * Update evaluation/TUTORIAL.md Co-authored-by: Graham Neubig <neubig@gmail.com> * Update evaluation/TUTORIAL.md Co-authored-by: Graham Neubig <neubig@gmail.com> * Update evaluation/TUTORIAL.md Co-authored-by: Graham Neubig <neubig@gmail.com> * Update evaluation/TUTORIAL.md Co-authored-by: Graham Neubig <neubig@gmail.com> * simplify readme and add comments to the actual code * Fix typo in evaluation/TUTORIAL.md * Fix typo in evaluation/swe_bench/run_infer.py * Fix another typo in evaluation/swe_bench/run_infer.py * Update TUTORIAL.md * Set host net work to false for SWEBench * Update evaluation/TUTORIAL.md Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * Update evaluation/TUTORIAL.md Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * Update evaluation/TUTORIAL.md Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * Update evaluation/TUTORIAL.md Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> --------- Co-authored-by: OpenDevin <opendevin@opendevin.ai> Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> Co-authored-by: Graham Neubig <neubig@gmail.com> Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>	2024-05-18 13:40:53 -04:00
Boxuan Li	6714000b2c	CodeActAgent: Fix iteration reminder (#1803 ) This PR includes three changes: 1) Iteration reminder should start with MAX_ITERATIONS from config rather than default value 100 2) In the first prompt, we should tell the LLM it has `MAX_ITERATIONS - 1` turns left, rather than `MAX_ITERATIONS - 2` 3) Remove legacy ITERATION_REMINDER config	2024-05-15 13:48:47 +08:00
Xingyao Wang	d1fd277ad4	Support return final task states for evaluation (#1755 ) * support returning states at the end of controller * remove return None * fix issue of overriding final state * return the final state on close * merge AgentState with State * fix integration test * add ChangeAgentStateAction to history in attempt to fix integration * add back set agent state * update tests * update tests * directly return get state * add back the missing .close() * Update typo in opendevin/core/main.py --------- Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>	2024-05-15 03:43:01 +00:00

1 2 3

104 Commits