Official implementation of "SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience"
-
Updated
Aug 7, 2025 - Python
Official implementation of "SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience"
[NeurIPS 2025] Official repository of RiOSWorld: Benchmarking the Risk of Multimodal Computer-Use Agents
Edge-optimized OpenCUA-7B computer-use agent evaluated on OSWorld, exploring systematic vLLM inference optimizations across CPU and GPU, including precision tuning, image history management, speculative decoding, and prefix caching.
Evaluation of GPT-4o-mini on OSWorld desktop automation benchmark. Compares screenshot-only vs accessibility tree-enhanced approaches across 10 tasks (Chrome, LibreOffice, file ops, etc). Documents critical coordinate extraction failures and provides architectural recommendations for GUI agents.
Add a description, image, and links to the osworld topic page so that developers can more easily learn about it.
To associate your repository with the osworld topic, visit your repo's landing page and select "manage topics."