OpenHands

mirror of https://github.com/All-Hands-AI/OpenHands.git synced 2026-04-29 03:00:45 -04:00

Author	SHA1	Message	Date
Yufan Song	1bdf8752e6	remote useless (#2332 )	2024-06-08 19:04:43 +00:00
Yufan Song	06a6ffcb09	feat: revert hiden special paths change in file action (#2328 ) * revert change in file action * remove useless code * make lint	2024-06-08 12:12:52 +00:00
Xingyao Wang	903381f16e	Add back jupyter PWD env var for agentskills (#2327 ) * add back jupyter pwd env var for agentskills * add unit test for pwd change in execute_cli	2024-06-08 08:51:42 +00:00
tobitege	5e42f140cb	fix: hide special paths; sort models (#2325 )	2024-06-08 02:13:11 +00:00
tobitege	b431fce938	tests: more Agentskills tests; updated .gitignore (#2307 ) * added tests related to backticks * updated .gitignore * added extra linter test for #2210 * hotfix for integration test --------- Co-authored-by: Engel Nyst <enyst@users.noreply.github.com>	2024-06-07 16:29:03 +00:00
Frank Xu	48151bdbb0	[feat] WebArena benchmark, MiniWoB++ benchmark and related arch changes (#2170 ) * add webarena, and revamp messaging for webarena eval * add changes for browsergym * update infer script * fix unit tests * update * add multiple run for miniwob * update instruction, remove personal path * update * add code for getting final reward, fix integration, add results * add avg cost calculation	2024-06-06 09:01:20 +08:00
மனோஜ்குமார் பழனிச்சாமி	2ffd54d258	fixed output logging (#2244 ) Co-authored-by: Leo <ifuryst@gmail.com>	2024-06-04 16:05:23 +00:00
மனோஜ்குமார் பழனிச்சாமி	4e479038f9	Bugfix by added config to disable plugin initialization for Persistent sandbox (#2179 ) * refactored source bashrc logic * added initialize_plugins config --------- Co-authored-by: Graham Neubig <neubig@gmail.com>	2024-06-04 10:59:30 -04:00
Leo	759f76fab5	Fix: Properly close Docker client in DockerExecBox to prevent resource leakage (#2224 )	2024-06-04 09:05:41 +08:00
Boxuan Li	1adbec6757	ssh_box: Fix Docker descriptor leak (#2212 )	2024-06-03 01:22:30 +00:00
Boxuan Li	6fd8e8d5b8	Fix file descriptor leaks in agentskills (#2209 )	2024-06-03 09:11:10 +08:00
Boxuan Li	399e6fb1d1	ssh_box: Close containers before throwing exception (#2206 )	2024-06-02 20:13:44 +00:00
RainRat	ed6dcc8381	fix typos (#2187 ) * fix typos no functional change * fix typos	2024-06-01 20:40:30 +00:00
மனோஜ்குமார் பழனிச்சாமி	4ece6fb3cc	Auto started persistent container (#2151 )	2024-06-01 14:46:41 +00:00
மனோஜ்குமார் பழனிச்சாமி	f9c7c3a520	Refactored logging (#2159 )	2024-06-01 14:31:35 +00:00
மனோஜ்குமார் பழனிச்சாமி	aee3d506e6	Restricted persistent sandbox to opendevin user only (#2177 )	2024-06-01 14:18:03 +00:00
Binyuan Hui	46dcf4bb3e	Support BIRD benchmark (#2117 ) * update: change timeout from 10 to 30 * update: readme for bird evaluation * Update evaluation/bird/run_infer.py Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> * Update evaluation/bird/README.md Co-authored-by: Shimada666 <649940882@qq.com> * Update evaluation/bird/README.md Co-authored-by: Shimada666 <649940882@qq.com> * Update evaluation/bird/run_infer.py Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> --------- Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> Co-authored-by: Shimada666 <649940882@qq.com> Co-authored-by: Yufan Song <33971064+yufansong@users.noreply.github.com>	2024-06-01 11:34:36 +00:00
Leo	78e003caf6	Fix: Avoid bash backtick eval in runtime commands. (#2180 ) Signed-off-by: ifuryst <ifuryst@gmail.com>	2024-06-01 19:19:15 +08:00
மனோஜ்குமார் பழனிச்சாமி	04d7354501	Detailed logs for ssh_box (#2173 )	2024-06-01 11:40:22 +05:30
Boxuan Li	06e45afc75	Fix ssh box hung issue (#2172 ) Co-authored-by: மனோஜ்குமார் பழனிச்சாமி <smartmanoj42857@gmail.com>	2024-06-01 05:31:32 +00:00
மனோஜ்குமார் பழனிச்சாமி	3a4dc5c68c	Initialized plugins only once for persistent sandboxes (#2162 )	2024-06-01 10:46:09 +05:30
Aaron Xia	42c6b506b5	Lazy launching BrowseEnv / making BrowseEnv optional (#2155 ) * feat: lazy launching browser; browser optional for diffrent agents. * style: lint * fix: integration test fail due to browser not started. * fix: run by cli and integration test failed. * fix: lint * fix: lint --------- Co-authored-by: Graham Neubig <neubig@gmail.com>	2024-05-31 16:40:42 -04:00
மனோஜ்குமார் பழனிச்சாமி	8413f147c9	Added logs (#2153 ) * Logged about config file * Logged Browser env * Update opendevin/core/config.py Co-authored-by: Aleksandar <isavitaisa@gmail.com> * Update opendevin/core/config.py Co-authored-by: Aleksandar <isavitaisa@gmail.com> --------- Co-authored-by: Aleksandar <isavitaisa@gmail.com>	2024-05-31 16:04:36 -04:00
மனோஜ்குமார் பழனிச்சாமி	961c96a2a1	Added ssh_password to config setup (#2139 ) Co-authored-by: Aleksandar <isavitaisa@gmail.com>	2024-05-31 07:26:16 +05:30
Xingyao Wang	01ef90205d	Add CodeActSWEAgent to remove browsing & github + improvements on agentskills (#2105 ) * update swe_bench prompt; use minimal prompt for codeact; * upgrade agentskills and update testcases * update infer prompt * fix cwd * add icl for swebench * also log in_context_example to run infer * remove extra print * change prompt to abs path * update error message to include current file info * change cwd for jupyter if needed * update edit error message * update prompt * improve git get patch * update hint string * default to 50 turns * revert changes from codeact agent and create new CodeActSWEAgent * revert changes to codeact * revert instructions for run infer * revert instructions for run infer * update README * update max iter * add codeact swe agent * fix issue for CodeActSWEAgent * allow specifying max iter in cmdline script * stop printing * Update agenthub/codeact_swe_agent/README.md Co-authored-by: Yufan Song <33971064+yufansong@users.noreply.github.com> * Fix prompt regression in jupyter plugin --------- Co-authored-by: Yufan Song <33971064+yufansong@users.noreply.github.com> Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>	2024-05-29 21:19:00 -07:00
மனோஜ்குமார் பழனிச்சாமி	d4ccd48af8	Persistent docker session (#1998 ) Co-authored-by: Robert Brennan <accounts@rbren.io> Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> Co-authored-by: Graham Neubig <neubig@gmail.com>	2024-05-29 13:22:34 +00:00
Xingyao Wang	5114230e53	Some SWE-Bench infer fixes and improvements (#2065 ) * reset workspace base properly * support running without hint * support running without hint * bump swe-bench eval docker to v1.2 for latest agentskills * only give hint when use hint text is trie * add swe-agent instructions for validation * update dockerfile * pin the python interpreter for execute_cli * avoid initialize plugins twice * default to use hint * save results to swe_bench_lite * unset gh token and increase max iter to 50 * remove printing of use hint status * refractor ssh login into one function * ok drop to 30 turns bc it is so expensive :( * remove reproduce comments to avoid stuck	2024-05-26 10:02:11 +00:00
Xingyao Wang	a6b3ce866d	refractor ssh login into one function (#2066 )	2024-05-26 08:56:13 +00:00
Shimada666	be1c2ad60d	feat: use retry decorator instead of retrying in a loop (#2058 ) * feat: use retry decorator instead of retrying in a loop * update code logic * update poetry lock	2024-05-25 16:04:40 +00:00
Xingyao Wang	ec68af5b83	fix the openai_api_key detected by agentskills (#2052 )	2024-05-25 22:09:07 +08:00
Xingyao Wang	221035d39a	Add retry logic to ssh login (#2053 ) * add retry logic to ssh login * Update opendevin/runtime/docker/ssh_box.py Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> --------- Co-authored-by: Engel Nyst <enyst@users.noreply.github.com>	2024-05-25 12:16:24 +00:00
Shimada666	b31f7701eb	Integrate Multimodal tools to `agentskills`. (#2016 ) * suport reading multimodal files * move file * update dependency * remove useless pip install * add comments * update the comment * Apply suggestions from code review * Add unit test for TXTReader * pre-commit hook corrupted utf16 test txt * Revert unnecessary dependency upgrades * feat: import some readers for agentskill * add dependencies * Integrate some multimodal tools * add shell pip dependency * update dependencies * update dependencies * update print window * remove __main__ * locally import cv2 * add c library for opencv * update lock file * update prompt * remove unuseful file * add some unittest * add unittest & remove excel-related parser * rollback poetry lock * remove markdown * remove requests * optimize parse_video output * Fix integration tests for CodeActAgent * remove test_parse_image unittest * Add a TODO to containers/sandbox/Dockerfile * update dependencies * remove pyproject.toml useless package * change document via openai key * Fix prompts after removing some actions --------- Co-authored-by: Mingchen Zhuge <mczhuge@gmail.com> Co-authored-by: yufansong <yufan@risingwave-labs.com> Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> Co-authored-by: Mingchen Zhuge <64179323+mczhuge@users.noreply.github.com> Co-authored-by: Xingyao Wang <xingyao6@illinois.edu>	2024-05-25 18:58:49 +08:00
Boxuan Li	91f313c914	BrowserEnv: init exception handling (#2050 ) * BrowserEnv: init exception handling * Revert irrelevant changes * Remove type ignore	2024-05-25 00:17:25 -07:00
மனோஜ்குமார் பழனிச்சாமி	cfae6821fa	refactored timeout (#2044 )	2024-05-24 18:19:14 +02:00
Boxuan Li	c59bcbbffd	Minor docstring & prompt fixes for AgentSkills (#2028 ) * A few minor fixes to agentskills * Regenerate prompts * Remove redundant comment	2024-05-24 13:30:48 +08:00
Xingyao Wang	cbf4c4b4c4	fix ExceptionPxssh (#2023 ) Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>	2024-05-23 21:24:21 -07:00
Xingyao Wang	602ffcdffb	Implement `agentskills` for OpenDevin to helpfully improve edit AND including more useful tools/skills (#1941 ) * add draft for skills * Implement and test agentskills functions: open_file, goto_line, scroll_down, scroll_up, create_file, search_dir, search_file, find_file * Remove new_sample.txt file * add some work from opendevin w/ fixes * Add unit tests for agentskills module * fix some issues and updated tests * add more tests for open * tweak and handle goto_line * add tests for some edge cases * add tests for scrolling * add tests for edit * add tests for search_dir * update tests to use pytest * use pytest --forked to avoid file op unit tests to interfere with each other via global var * update doc based on swe agent tool * update and add tests for find_file and search_file * move agent_skills to plugins * add agentskills as plugin and docs * add agentskill to ssh box and fix sandbox integration * remove extra returns in doc * add agentskills to initial tool for jupyter * support re-init jupyter kernel (for agentskills) after restart * fix print window's issue with indentation and add testcases * add prompt for codeact with the newest edit primitives * modify the way line number is presented (remove leading space) * change prompt to the newest display format * support tracking of costs via metrics * Update opendevin/runtime/plugins/agent_skills/README.md * Update opendevin/runtime/plugins/agent_skills/README.md * implement and add tests for py linting * remove extra text arg for incompatible subprocess ver * remove sample.txt * update test_edits integration tests * fix all integration * Update opendevin/runtime/plugins/agent_skills/README.md * Update opendevin/runtime/plugins/agent_skills/README.md * Update opendevin/runtime/plugins/agent_skills/README.md * Update agenthub/codeact_agent/prompt.py Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * Update agenthub/codeact_agent/prompt.py Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * Update agenthub/codeact_agent/prompt.py Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * Update opendevin/runtime/plugins/agent_skills/agentskills.py Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * correctly setup plugins for swebench eval * bump swe-bench version and add logging * correctly setup plugins for swebench eval * bump swe-bench version and add logging * Revert "correctly setup plugins for swebench eval" This reverts commit `2bd1055673`. * bump version * remove _AGENT_SKILLS_DOCS * move flake8 to test dep * update poetry.lock * remove extra arg * reduce max iter for eval * update poetry * fix integration tests --------- Co-authored-by: OpenDevin <opendevin@opendevin.ai> Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>	2024-05-23 16:04:09 +00:00
Engel Nyst	0eccf31604	Refactor monologue and SWE agent to use the messages in state history (#1863 ) * Refactor monologue to use the messages in state history * add messages, clean up * fix monologue * update integration tests * move private method * update SWE agent to use the history from State * integration tests for SWE agent * rename monologue to initial_thoughts, since that is what it is	2024-05-23 07:29:12 +00:00
Robert Brennan	5bdacf738d	Refactor session management (#1810 ) * refactor session mgmt * defer file handling to runtime * add todo * refactor sessions a bit more * remove messages logic from FE * fix up socket handshake * refactor frontend auth a bit * first pass at redoing file explorer * implement directory suffix * fix up file tree * close agent on websocket close * remove session saving * move file refresh * remove getWorkspace * plumb path/code differently * fix build issues * fix the tests * fix npm build * add session rehydration * fix event serialization * logspam * fix user message rehydration * add get_event fn * agent state restoration * change history tracking for codeact * fix responsiveness of init * fix lint * lint * delint * fix prop * update tests * logspam * lint * fix test * revert codeact * change fileService to use API * fix up session loading * delint * delint * fix integration tests * revert test * fix up access to options endpoints * fix initial files load * delint * fix file initialization * fix mock server * fixl int * fix auth for html * Update frontend/src/i18n/translation.json Co-authored-by: Xingyao Wang <xingyao6@illinois.edu> * refactor sessions and sockets * avoid reinitializing the same session * fix reconnect issue * change up intro message * more guards on reinit * rename agent_session * delint * fix a bunch of tests * delint * fix last test * remove code editor context * fix build * fix any * fix dot notation * Update frontend/src/services/api.ts Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * fix up error handling * Update opendevin/server/session/agent.py Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * Update opendevin/server/session/agent.py Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * Update frontend/src/services/session.ts Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * fix build errs * fix else * add closed state * delint * Update opendevin/server/session/session.py Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> --------- Co-authored-by: Xingyao Wang <xingyao6@illinois.edu> Co-authored-by: Graham Neubig <neubig@gmail.com> Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> Co-authored-by: Engel Nyst <enyst@users.noreply.github.com>	2024-05-22 18:33:16 +00:00
Frank Xu	1fe290adf9	[Feat] A competitive Web Browsing agent (#1856 ) * initial attempt at a browsing only agent * add browsing agent * update * implement agent * update * fix comments * remove unnecessary things from memory extras * update image processing --------- Co-authored-by: Yufan Song <33971064+yufansong@users.noreply.github.com>	2024-05-21 19:20:33 +00:00
மனோஜ்குமார் பழனிச்சாமி	d76c425b76	Refactored Logs (#1939 )	2024-05-22 03:06:13 +08:00
RainRat	43c187b949	fix typos (#1956 ) no functional change	2024-05-21 19:00:48 +00:00
Boxuan Li	99651e3249	Fix browser_env hung bug (#1933 ) * Fix browser_env hang bug * Send SIGKILL if SIGTERM doesn't work	2024-05-20 19:31:36 -07:00
மனோஜ்குமார் பழனிச்சாமி	4612e107c9	fix: Handle invalid exit code conversion (#1915 )	2024-05-20 11:30:52 -04:00
Robert Brennan	0ecba83e53	Move message history out of CodeAct (#1847 ) * stop keeping history state in codeact * regenerate tests * Update agenthub/codeact_agent/codeact_agent.py Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> * revert tests * regen tests * refactor codeact a bit * regenerate without using LLM * simplify logic * change to heredoc * fix heredoc * fix end_of_edit docs * regen tests * regenerate --------- Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>	2024-05-18 18:39:27 +00:00
Boxuan Li	0abc35cf57	ssh_box: Shutdown container when fail to start ssh session (#1872 )	2024-05-18 17:04:38 +08:00
Boxuan Li	a57a213c7c	Turn off auto linting by default, and on for swe_bench (#1861 ) Disable Python linting by default, and turn it on for SWE Bench. It is turned off by default since this behavior is weird and somewhat annoying to end users. It is turned on for SWE Bench because linting python files gives LLM a chance to fix the indentations.	2024-05-18 04:04:38 +00:00
Aleksandar	94a9ec76b0	Disable Python linting by default (fixes #1789 ) (#1794 ) * Disable Python linting by default (fixes #1789) * Try to simplify * Return do nothing comment * Disable linting for the javascript as well * Apply suggestions from code review --------- Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> Co-authored-by: Xingyao Wang <xingyao6@illinois.edu>	2024-05-17 20:55:12 -07:00
மனோஜ்குமார் பழனிச்சாமி	b0b44ed467	Auto restarted Jupyter kernel (#1808 ) Co-authored-by: Yufan Song <33971064+yufansong@users.noreply.github.com> Co-authored-by: Xingyao Wang <xingyao6@illinois.edu> Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>	2024-05-18 08:40:31 +05:30
மனோஜ்குமார் பழனிச்சாமி	5b6f622dad	Update browser_env.py (#1779 ) Co-authored-by: Robert Brennan <accounts@rbren.io> Co-authored-by: Graham Neubig <neubig@gmail.com>	2024-05-17 06:11:32 +05:30

1 2

85 Commits