Today, chatbots can answer questions, write poems and generate images. In the future, they could also autonomously perform tasks like online shopping and work with tools like spreadsheets.
Google on Wednesday unveiled a prototype of this technology, which artificial intelligence researchers call an A.I. agent.
Google is among the many tech companies building A.I. agents. Various A.I. start-ups, including OpenAI and Anthropic, have unveiled similar prototypes that can use software apps, websites and other online tools.
Google’s new prototype, called Mariner, is based on Gemini 2.0, which the company also unveiled on Wednesday. Gemini is the core technology that underpins many of the company’s A.I. products and research experiments. Versions of the system will power the company’s chatbot of the same name and A.I. Overviews, a Google search tool that directly answers user questions.
“We’re basically allowing users to type requests into their web browser and have Mariner take actions on their behalf,” Jaclyn Konzelmann, a Google project manager, said in an interview with The New York Times.
Gemini is what A.I researchers call a neural network — a mathematical system that can learn skills by analyzing enormous amounts of data. By recognizing patterns in articles and books culled from across the internet, for instance, a neural network can learn to generate text on its own.
The latest version of Gemini learns from a wide range of data, from text to images to sounds. That might include images showing how people use spreadsheets, shopping sites and other online services. Drawing on what Gemini has learned, Mariner can use similar services on behalf of computer users.
“It can understand that it needs to press a button to make something happen,” Demis Hassabis, who oversees Google’s core A.I. lab, said in an interview with The Times. “It can take action in the world.”
Mariner is designed to be used “with a human in the loop,” Ms. Konzelmann said. For instance, it can fill a virtual shopping cart with groceries if a user is in an active browser tab, but it will not actually buy the groceries. The user must make the purchase.
Sundar Pichai, Google’s chief executive, said in a blog post that the developments “bring us closer to our vision of a universal assistant.”
The project was developed as an extension for Google’s popular web browser, Chrome, making it an important platform for the company’s future A.I. ambitions. But those plans could face a setback. The Justice Department has asked a federal judge to force Google to sell or spin off Chrome after a landmark ruling that the company’s search engine is an illegal monopoly.
There are other challenges as well. Ms. Konzelmann acknowledged that, like other chatbots, Mariner makes mistakes. Because such systems operate according to patterns found in vast amounts of data, they sometimes go awry. The mistakes that chatbots make when generating text sometimes go unnoticed, but errors are more problematic when systems are trying to use websites and take other actions.
Source : https://dnyuz.com/2024/12/11/google-unveils-a-i-agent-that-can-use-websites-on-its-own/