OSWorld-Verified
Benchmark ComputerAgent on OSWorld tasks using HUD
OSWorld-Verified is a curated subset of OSWorld tasks that can be run using the HUD framework.
Use ComputerAgent with HUD to benchmark on these tasks.
Was this page helpful?
Benchmark ComputerAgent on OSWorld tasks using HUD
OSWorld-Verified is a curated subset of OSWorld tasks that can be run using the HUD framework.
Use ComputerAgent with HUD to benchmark on these tasks.
Was this page helpful?