- By Vikas Yadav
- Wed, 15 May 2024 12:59 AM (IST)
- Source:JND
Google I/O 2024: Google introduced a host of AI-based updates for Android at its I/O Developer Conference. These new announcements included an upgraded Circle to Search feature, better contextual understanding for Gemini, multimodal capabilities for Gemini Nano and more. While some of the features are live, others will ship in the coming days.
Moreover, the company is also testing a feature to alert users when a conversation on call seems like a possible scam. Here are the details to know about all the features.
Improved Circle To Search: Google shared that the much-hyped AI feature can now help students solve problems in physics and math word problems. Further, Circle to Search will improve to solve "even more complex problems" that may involve symbolic formulas, graphs, diagrams, and more, thanks to the new LearnLM, Gemini-based AI models optimised for learning-related tasks. Showcased at Samsung Galaxy Unpacked 2024, the AI feature is available on over 100 million devices.
Circle to Search can now help with homework—directly from your Pixel phone or tablet.
— Made by Google (@madebygoogle) May 14, 2024
When you circle the exact part of a prompt you're stuck on, you'll get step-by-step guidance to solve physics word problems without leaving your digital info sheet or syllabus. #GoogleIO pic.twitter.com/Fsmtcu7emn
Gemini AI Upgrades: Google's popular AI model will get better at understanding context while responding to content or apps running on the screen. "Soon, you'll be able to bring up Gemini's overlay on top of the app you're in to easily use Gemini in more ways," Google said. It will let users drag and drag generated images into select Google apps and more and ask queries about videos and PDFs (in Gemini Advanced) in the upcoming months.
Multimodal Gemini Nano: Moreover, Pixel smartphones that run on-device Gemini Nano can expect multimodality later this year. With the updated capabilities, the model will support "context from sights, sounds and spoken language" processing apart from the text. Further, the tech giant said Gemini Nano's multimodal capabilities will be available in TalkBack for "richer and clearer" image descriptions. The feature will work on-device and offline.
Coming to Pixel later this year, we’ll be introducing our latest model, Gemini Nano with Multimodality.
— Made by Google (@madebygoogle) May 14, 2024
This means your phone will not just be able to process text input but also understand more information in context like sights, sounds and spoken language. #GoogleIO pic.twitter.com/1yTujAl1W7
AI-Powered Spam Call Detection: The search king is also testing a Gemini Nano-backed feature that will send real-time alerts to assess if the conversation pattern hints at a possible scam. Meanwhile, the company also introduced new AI features for its apps and Search. You can read more on this here.