mobilerun framework (formerly droidrun) achieves state of the art performance on AndroidWorld with 91.4% success rate across 116 diverse tasks. Despite setup challenges and evaluation limitations, our open source approach using GPT-5 and direct Accessibility API access outperforms all competing agents.
Success rates of leading AI agents on the 116-task AndroidWorld benchmark (03.10.2025)
106 out of 115 tasks completed successfully
read the methodology.
Learn how the Manager-Executor architecture with dynamic feedback loops achieves state-of-the-art results.