by xinnan-tech
Automates Android phone interactions through vision-language models that capture screenshots, analyze UI elements, and execute touch actions.