Skip to content

Google Launches Gemini 2.5 for AI-Powered Computer Control

Google's new Gemini 2.5 model brings AI to computer control. Explore its robust safety features and developer-friendly controls in the public preview.

In this image we can see a few people around the table using laptops, there we can see a person...
In this image we can see a few people around the table using laptops, there we can see a person using a mobile phone and few objects on the table.

Google Launches Gemini 2.5 for AI-Powered Computer Control

Google has launched the Gemini 2.5 Computer Use model, a specialized AI tool designed to enhance user interface control in web applications and mobile apps. The model, now in public preview, is available via the Gemini API on Google's AI Studio and Vertex AI.

The Gemini 2.5 Computer Use model stands out with its robust safety features and developer-friendly controls, mitigating potential risks associated with AI agents controlling computers. While primarily optimized for web browsers and mobile UI control tasks, it's not yet fully optimized for desktop OS-level control.

The core capabilities of the model are accessible through the new tool in the Gemini API. Operators should run these capabilities within a loop to ensure smooth interaction. Google has not yet revealed any specific firms or development teams that have integrated this model into production, but it's available for developers to explore and automate tasks in web applications and mobile UIs.

The Gemini 2.5 Computer Use model, built on the visual understanding and reasoning capabilities of Gemini 2.5 Pro, outperforms leading alternatives in web and mobile control benchmarks with lower latency. Google's release of this model, along with its safety features and developer controls, signals a step forward in AI's role in automating computer tasks.

Read also:

Latest