Subscribe to Our Newsletter

Success! Now Check Your Email

To complete Subscribe, click the confirmation link in your inbox. If it doesn't arrive within 3 minutes, check your spam folder.

Ok, Thanks
Google’s 'Project Jarvis' AI may soon handle your flight bookings and shopping

Google’s 'Project Jarvis' AI may soon handle your flight bookings and shopping

It can automate common web-based tasks directly within Google Chrome.

Kelechi Edeh profile image
by Kelechi Edeh

The race to build fully autonomous AI agents seems to be heating up. Just last week, Anthropic introduced an updated Claude 3.5, allowing it to interact with computers in ways similar to human actions.

Now, Google is reportedly working on its own “computer-using agent," known as Project Jarvis, which aims to automate common web-based tasks like booking flights, doing online research, and finding shopping deals—all within Google Chrome.

Jarvis’s functionality reportedly relies on capturing regular screenshots to interpret the content on a screen before taking actions, such as clicking buttons or filling out text fields.

Due to its cloud-based setup, some sources note that Jarvis takes a few seconds between actions—likely allowing for the precision needed in handling complex, multi-step tasks.

Anthropic’s Claude can now interact with computers like a human in new upgrade to its model
Claude 3.5 makes automating complex tasks easier for developers, while the upcoming Claude 3.5 Haiku promises faster, more efficient AI performance.

Reports from The Information further suggest that Google’s Gemini 2.0 model will drive Jarvis, enhancing its capabilities to interpret and respond to complex commands more effectively.

To test and fine-tune these features, Google may initially release Jarvis to Gemini Advanced subscribers in a phased rollout, potentially beginning in December. This measured approach could allow Google to gather early feedback and address any issues before a wider release. However, these timelines remain flexible as development continues.

Project Jarvis also enters a competitive field where similar tools are emerging. Microsoft’s Copilot Vision and OpenAI's ChatGPT offer related functionalities, and Apple is rumored to be developing its own screen-aware AI assistant.

Yet, by integrating Jarvis directly within Chrome, Google could provide unique value to users relying heavily on browser-based applications. This approach also resembles Rabbit’s "large action model," which also seeks to streamline web interactions through automation but is yet to be fully available.

Should Jarvis progress as planned, it may redefine how Google Gemini's 330.9 million users approach repetitive online tasks, potentially enhancing productivity by automating everyday activities.

With the anticipated December preview approaching, further updates on Project Jarvis could indicate a shift in how AI integrates into daily digital workflows, shaping the future of web-based automation.

INFOGRAPHIC: Google Gemini vs Microsoft Copilot - Which AI Assistant Is the Best?
Gemini and Copilot are two of the biggest workspace AI assistants out there, but which is better?
Kelechi Edeh profile image
by Kelechi Edeh

Subscribe to Techloy.com

Get the latest information about companies, products, careers, and funding in the technology industry across emerging markets globally.

Success! Now Check Your Email

To complete Subscribe, click the confirmation link in your inbox. If it doesn’t arrive within 3 minutes, check your spam folder.

Ok, Thanks

Read More