OSWorld/evaluation_examples/examples
MillanK cbc3b590ff
Task fix batch (#383)
* update 873cafdd-a581-47f6-8b33-b9696ddb7b05 task eval

* c1fa57f3-c3db-4596-8f09-020701085416 fix, add tolerance to url matching

* 8df7e444-8e06-4f93-8a1a-c5c974269d82 add more clear instruction to the filename for compress

* add address string normalization for 6f4073b8-d8ea-4ade-8a18-c5d1d5d5aa9a

---------

Co-authored-by: Jiaqi <dengjiaqi@moonshot.cn>
2025-11-19 17:24:25 +08:00
..
chrome Task fix batch (#383) 2025-11-19 17:24:25 +08:00
gimp Update GIMP evaluation examples to replace local file paths with cloud file URLs for consistency and accessibility. (#372) 2025-11-07 21:49:49 +08:00
libreoffice_calc ver Oct3rd (#349) 2025-10-04 00:13:29 +08:00
libreoffice_impress Update instruction wording in LibreOffice Impress example to clarify text color change requirements. Address https://github.com/xlang-ai/OSWorld/issues/324 2025-09-01 23:29:47 +08:00
libreoffice_writer
multi_apps Task fix batch (#383) 2025-11-19 17:24:25 +08:00
os refactor: update command in JSON example to use placeholder for client password 2025-07-31 05:20:04 +00:00
thunderbird
vlc
vs_code