蘑菇先生
5ef8bdfa35
EvoCUA Update (2025.01.05) ( #412 )
...
* evocua init
* setup max_token
* evocua update
---------
Co-authored-by: xuetaofeng <xuetaofeng@meituan.com>
Co-authored-by: Tianbao Xie <47296835+Timothyxxx@users.noreply.github.com>
2026-01-05 16:14:53 +08:00
Bowen Yang
439e178a2e
fix(os_symphony_evaluation) ( #410 )
...
* fix(os_symphony)
* Update desktop_env_os_symphony.py
* fix(os_symphony_desktop)
* fix(os_symphony_start)
* Add docstring to run_multienv_os_symphony.py
Added documentation header for the evaluation script.
2026-01-04 15:56:51 +08:00
Bowen Yang
951e1928c8
fix(desktop_os_symphony):support aws ( #406 )
...
* fix(os_symphony)
* Update desktop_env_os_symphony.py
2026-01-01 11:27:34 +08:00
Bowen Yang
02a35be067
fix(os_symphony) ( #405 )
2025-12-30 22:43:47 +08:00
Bowen Yang
662826f57e
fix(os_symphony):prompt ( #402 )
...
* add_os_symphony
* fix(os_symphony)
* fix(os_symphony):prompt
---------
Co-authored-by: Tianbao Xie <47296835+Timothyxxx@users.noreply.github.com>
2025-12-29 20:45:36 +08:00
xuetf
410ec63a89
Add EvoCUA Support ( #401 )
...
* evocua init
* setup max_token
---------
Co-authored-by: xuetaofeng <xuetaofeng@meituan.com>
Co-authored-by: Tianbao Xie <47296835+Timothyxxx@users.noreply.github.com>
2025-12-23 20:46:23 +08:00
Bowen Yang
031696e83c
fix os_symphony ( #400 )
...
* add_os_symphony
* fix(os_symphony)
---------
Co-authored-by: Tianbao Xie <47296835+Timothyxxx@users.noreply.github.com>
2025-12-23 20:45:30 +08:00
Bowen Yang
f593f35b1c
add_os_symphony ( #399 )
2025-12-23 14:30:44 +08:00
Ubuntu
ac31778ee3
Update: requirements.txt for seed agent
2025-12-15 11:47:56 +00:00
Ubuntu
60caa52fc4
Update: requirements.txt for seed agent
2025-12-15 11:47:40 +00:00
Ubuntu
41477a9c40
Update: seed agent
2025-12-15 11:45:57 +00:00
Ubuntu
78433ecfcf
Add agent: seed agent
2025-12-12 05:35:20 +00:00
Meshal Nayim
9540454b0a
Fix demo agent (PromptAgent) reset(): add vm_ip and kwargs for compatibility with lib_run_single.py ( #388 )
2025-12-09 15:59:25 +08:00
MillanK
cbc3b590ff
Task fix batch ( #383 )
...
* update 873cafdd-a581-47f6-8b33-b9696ddb7b05 task eval
* c1fa57f3-c3db-4596-8f09-020701085416 fix, add tolerance to url matching
* 8df7e444-8e06-4f93-8a1a-c5c974269d82 add more clear instruction to the filename for compress
* add address string normalization for 6f4073b8-d8ea-4ade-8a18-c5d1d5d5aa9a
---------
Co-authored-by: Jiaqi <dengjiaqi@moonshot.cn>
2025-11-19 17:24:25 +08:00
Qichen Fu
903ed36715
Add Claude Sonnet 4.5 support and improve action handling ( #362 )
...
🤖 Generated with [Claude Code](https://claude.com/claude-code )
Co-authored-by: Claude <noreply@anthropic.com>
2025-11-14 13:54:32 +08:00
Subash Shibu
3167339e45
Add hosted GBOX agent for OSWorld evaluation ( #376 )
2025-11-13 13:13:31 +08:00
Pengxiang-Li
00b6468eb7
feat/dart_gui ( #371 )
2025-11-07 21:50:01 +08:00
yiqilin
6d43dbc532
Update GIMP evaluation examples to replace local file paths with cloud file URLs for consistency and accessibility. ( #372 )
2025-11-07 21:49:49 +08:00
Timothyxxx
8365edc975
Add new section in README for OSWorld-MCP project
2025-10-30 06:06:48 +00:00
Daphne Barretto
21c2b7629b
Add consistent scores validation ( #368 )
...
* Add consistent scores validation
* revert osworld_run_maestro.py changes
2025-10-29 01:44:48 +08:00
Timothyxxx
3bf54c92a9
Merge branch 'main' of github.com:xlang-ai/OSWorld
2025-10-23 14:28:14 +08:00
Timothyxxx
a484f2e484
Update setup.py for version bump and dependency adjustments
...
- Bump version from 1.0.0 to 1.0.1
- Update numpy dependency to allow versions >=1.26 and <3
- Adjust pandas dependency to allow versions >=2.2 and <2.3
- Add new __init__.py file in the docker provider directory
2025-10-23 14:27:52 +08:00
Atharva Gundawar
9f97535ef9
oswrold agent wrapper for trained v7 ( #360 )
2025-10-18 02:29:15 +08:00
ludunjie.ldj
afd29115da
support aliyun eval of qwen3vl
2025-10-16 16:20:54 +08:00
Dunjie Lu
55372c4432
Fix API base URLs for OpenAI and DashScope
...
Updated the base URLs for OpenAI and DashScope API calls.
2025-10-14 12:57:00 +08:00
Dunjie Lu
d25464c203
Djlu/qwen3vl dash ( #356 )
...
* support dashscopoe sdk to call qwen3-vl-plus
* support dashscopoe sdk to call qwen3-vl-plus
---------
Co-authored-by: Timothyxxx <Timothyxxx@users.noreply.github.com>
2025-10-13 16:31:06 +08:00
Xinyuan Wang
f9e9273b3b
OpenCUA-72B ( #354 )
...
* use aws pub ip
* os task fix: set the default dim screen time to be 300s
* OpenCUA-72B
* update password
* update
* update
* update opencua72b agent
* change provider ip
---------
Co-authored-by: Jiaqi <dengjiaqi@moonshot.cn>
2025-10-13 10:39:33 +08:00
Yan98
ddb8372a6c
init public release ( #350 )
2025-10-06 22:16:31 +08:00
eun2ce
5eff00a9e3
Fix #347 : Fix NameError in open_file timeout message ( #351 )
...
- Fix undefined 'timeout' variable in error message
- Use defined TIMEOUT constant instead of undefined timeout variable
- Prevents NameError when LibreOffice crashes during file opening
2025-10-06 22:14:15 +08:00
Timothyxxx
ff6285cfbb
Add safe browsing feature to Chrome evaluator
...
- Implemented `get_enable_safe_browsing` function to retrieve safe browsing settings based on the operating system.
- Updated the `__init__.py` to include the new function.
- Modified JSON examples to reflect the change from enabling enhanced safety browsing to enabling safe browsing.
- Added necessary commands in the JSON examples for setting up preferences for safe browsing.
2025-10-05 04:56:08 +00:00
Danyang Zhang
afd5952e44
ver Oct3rd ( #349 )
...
updated a series of instructions to ask the agent not to do any
unnecessary actions.
2025-10-04 00:13:29 +08:00
Timothyxxx
1572068035
Refactor evaluator functions in JSON examples to use URL pattern matching. Update expected URL formats to regex patterns for better validation in chrome evaluation examples.
2025-10-01 19:20:06 +00:00
Timothyxxx
9be518435c
Update GIMP evaluation examples to replace local file paths with cloud file URLs for consistency and accessibility.
2025-10-01 09:54:52 +00:00
Timothyxxx
bfb467da18
Merge branch 'main' of github.com:xlang-ai/OSWorld
2025-10-01 06:56:43 +00:00
Timothyxxx
4c685bed99
Update run_maestro.py to run in headless mode with a single environment and specify result directory. Adjust default TTL for AWS instances from 60 to 180 minutes in config.py. Enhance AWSProvider to handle missing security groups, subnet IDs, and instance types with fallbacks, and improve termination logic to skip already terminated instances while logging relevant information.
2025-10-01 06:56:33 +00:00
eun2ce
5eb5417188
fix #210 : add a11y_tree support to UITARSAgent ( #346 )
2025-09-26 18:25:28 +08:00
Yanxiao Zhao
6827949418
fix _update_browse_history_setup ( #345 )
2025-09-25 13:22:40 +08:00
Yanxiao Zhao
a4f8fe2f00
Add autoglm-os-9b-v ( #344 )
...
* update for autoglm-v
* Update run_autoglm.py
---------
Co-authored-by: hanyullai <hanyullai@outlook.com>
2025-09-24 19:43:28 +08:00
alexandruilie7
f59cf00cae
Add ui agent ( #343 )
...
* add uipath agent
* readme update
2025-09-24 19:42:46 +08:00
Long Chen
088e68798c
update aworldguiAgent code ( #342 )
2025-09-23 16:50:29 +08:00
Timothyxxx
584c7a9875
Enhance AWSProvider instance handling with fallback mechanisms for security groups, subnet IDs, and instance types. Implement checks to skip termination of instances already in 'shutting-down' or 'terminated' states, and handle potential termination errors gracefully.
2025-09-18 07:16:10 +00:00
molanhand
7213eca069
support mano agent ( #338 )
...
Co-authored-by: Fei Hu <molanhand@users.noreply.github.com>
2025-09-16 18:10:29 +08:00
ZhangZuhao
dc7e46e7aa
Refactor platform detection for VM image download ( #337 )
...
Sometimes the platform detection for VM image download is wrong
2025-09-15 21:00:15 +08:00
Dunjie Lu
b012301609
support qwen3vl agent ( #336 )
...
Co-authored-by: root <ludunjie1219@github.com>
2025-09-15 16:04:29 +08:00
Hiroid
a668670349
fix(maestro): Fixed the debug logging level ( #334 )
...
Co-authored-by: Liangxuan Guo <guoliangxuan@deepmatrix.com.cn>
2025-09-11 01:03:59 +08:00
Hiroid
3a4b67304f
Add multiple new modules and tools to enhance the functionality and extensibility of the Maestro project ( #333 )
...
* Added a **pyproject.toml** file to define project metadata and dependencies.
* Added **run\_maestro.py** and **osworld\_run\_maestro.py** to provide the main execution logic.
* Introduced multiple new modules, including **Evaluator**, **Controller**, **Manager**, and **Sub-Worker**, supporting task planning, state management, and data analysis.
* Added a **tools module** containing utility functions and tool configurations to improve code reusability.
* Updated the **README** and documentation with usage examples and module descriptions.
These changes lay the foundation for expanding the Maestro project’s functionality and improving the user experience.
Co-authored-by: Hiroid <guoliangxuan@deepmatrix.com>
2025-09-08 16:07:21 +09:00
Timothyxxx
029885e78c
Merge branch 'main' of github.com:xlang-ai/OSWorld
2025-09-05 15:36:39 +00:00
Timothyxxx
640f3fcd96
Update default path_to_vm argument to None in quickstart.py for improved flexibility
2025-09-05 15:36:31 +00:00
Timothyxxx
756923beea
Update instruction wording in LibreOffice Impress example to clarify text color change requirements. Address https://github.com/xlang-ai/OSWorld/issues/324
2025-09-01 23:29:47 +08:00
Timothyxxx
0c681b91e0
Fix README update
2025-09-01 15:15:50 +00:00