Official implementation of "SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience"
-
Updated
Aug 7, 2025 - Python
Official implementation of "SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience"
[NeurIPS 2025] Official repository of RiOSWorld: Benchmarking the Risk of Multimodal Computer-Use Agents
Mano-P: Open-source GUI-VLA agent for edge devices. #1 on OSWorld (specialized, 58.2%). Runs locally on Apple M4 Mac mini/MacBook — no data leaves your device.Mano-P 是一个开源 GUI-VLA 项目,支持在 Mac mini/MacBook 上或通过算力棒本地运行推理,实现纯视觉驱动的跨平台 GUI 自动化操作。数据完全本地处理,支持复杂多步骤任务规划与执行。
Edge-optimized OpenCUA-7B computer-use agent evaluated on OSWorld, exploring systematic vLLM inference optimizations across CPU and GPU, including precision tuning, image history management, speculative decoding, and prefix caching.
Evaluation of GPT-4o-mini on OSWorld desktop automation benchmark. Compares screenshot-only vs accessibility tree-enhanced approaches across 10 tasks (Chrome, LibreOffice, file ops, etc). Documents critical coordinate extraction failures and provides architectural recommendations for GUI agents.
Add a description, image, and links to the osworld topic page so that developers can more easily learn about it.
To associate your repository with the osworld topic, visit your repo's landing page and select "manage topics."