It ’s passably unnerving to hear an AI utter in an spookily favorable tone and say me to clean up the welter on my workstation . I am pretty proud of it , but I judge it ’s clip to stack the at random dust gadgets and tidy up the conducting wire mess .

My baby would agree , too . But parachute into action after an AI “ sees ” my table , recognise the mess , and doles out homemaker advice is the big picture . Google ’s Gemini AI chatbot can now do that . And a lot more .

The secluded sauce here is a recent feature update calledProject Astra . It has been in development for eld , andfinally come out roll out earlier this month . The overarching idea is to do an all - seeing , all - auditory sense , and overtly sound AI on your telephone .

Home screen of Gemini Live with camera and screen sharing.

Nadeem Sarwar / Digital Trends.

Google hawk these superpowers under a rather uninspiring name : Gemini Livewith tv camera and screen sharing . Developed at the company ’s DeepMind unit , the ship’s company begin its exploitation as a “ universal AI assistant . ” It ’s a shame the final name is n’t as aspirational .

Let ’s start with the access situation . The capability is now available forPixel 9andGalaxy S25users . But if you have an Android telephone with a Gemini Advanced subscription to go with it , you could reach the new toolkit .

That would be a $ 20 per calendar month , by the way . I tried it on the two aforesaid phones and now have it ready to roll on myOnePlus 13 , as well . The nicest part ? You do n’t have to go through any proficient hoops to access it .

Identify painting using Gemini Live with camera and screen sharing.

Nadeem Sarwar / Digital Trends.

A mightiness / volume release combo , or screen corner swipe to summon Gemini is all you need . Does n’t weigh what app you are running , you may reach the unexampled tv camera and cover - sharing chop as an sheathing in every turning point of the OS .

Making sense of the world around you

I started by pointing the camera at a painting , and asked about it . Gemini Live was able to accurately detect it as a Madhubani dash painting , decoding the sheer use of colors and depiction of fauna .

It then proceeded to give me a brief history deterrent example and the fluctuation that have grow over the twelvemonth . The information was precise , down to the most mealy level . gratefully , you could also choose to have a text - based back - and - forth with Gemini , if you ’re in a post where voice conversations could be awkward .

What I like the most about Gemini Live ’s new camera and screen share-out incarnation is that it ’s not exceedingly talky . you could interrupt it at any given moment , which only adds to the “ natural ” appeal of the conversation .

Talking via text using Gemini Live with camera and screen sharing.

Nadeem Sarwar / Digital Trends.

I judge Gemini in a variety of scenario . I was not inclined for it .

The answers it provides are unremarkably succinct , as if it wants to give you a luck ( or even nudge ) to postulate a follow - up inquiry instead of have an overwhelmingly recollective answer . It excels in a whole range of topics and ocular scenario , but there are a few pitfalls .

It ca n’t employ Google Lens yet , which means Gemini ca n’t compare the figure it sees on your phone ’s screen against match results on the internet . Moreover , it ca n’t access information in veridical - metre if you involve Gemini to look up the latest development around a topic or personality .

Understanding Hindi with Gemini Live with camera and screen sharing.

Nadeem Sarwar / Digital Trends.

I asked it about plant species , restaurant listings , picking up data point from notice boards , and making sense of my aesculapian prescription for a late tear of flu . Gemini fare pretty well , more so than I ’ve ever experienced the AI chatbot perform so far .

Unlocking a knowledge bank

Next , I pushed Gemini to make sensory faculty of complex pedantic stuff . I put a book on Machine Learning in the photographic camera anatomy . Gemini Live not only recognized it , but also proceeded to give me an overview of the book ’s contents and its core subjects .

inquisitively , I started flipping through the page and landed on the chapter lean . The AI recognized the progress , stop babble , and expect me whether I was interested in any fussy chapter now that I was tally out the subject list .

I was taken aback by surprise at this minute .

Reading Urdu using Gemini Live with camera and screen sharing.

Nadeem Sarwar / Digital Trends.

I postulate it to discontinue down a few complex topic , and the AI did a respectable job , even going beyond the scope of on - page material and pull information from its expansive knowledge bank .

For example , when I ask it about the contents of the basic page on Bhisham Sahni ’s seminal novel , Tamas , the AI aright picked up the honorable mention of the Sahitya Akademi Award . It then went on to mention details that were not even listed on the pageboy , such as the class it pull ahead the esteemed literary honor and what the Scripture is all about .

On the snotty-nosed side , the Hindi language readout by Gemini Live was ugly . It was not just the hapless accent , but the fact that Gemini was mouth pure gibber and no - word repeatedly . While trying to take Urdu , Persian , and Arabic , it did a well good chore , but often mixed up words from random lines .

Scanning a book using Gemini Live with camera and screen sharing.

Nadeem Sarwar / Digital Trends.

On my first attempt with Urdu poetry , it agnise not only the Urdu text , but also gave an accurate summary of the poem . The biggest challenge , once again , was narration . Hearing an anglicized version of Urdu really hurt my ear .

Excels in surprising spots

AI is a fantastic trouble - figure out tool , and there are legion benchmarks to prove it . I tested it against physics job dealing with thermodynamics , electrochemical equations , and statistical problems appearing in a handwritten notebook computer . Gemini Live did a fantastic job at such tasks .

It even excelled at creative chores , too . My baby , who is a way interior decorator , presented one of her sketches in the camera opinion , and asked for feedback as well as improvements . Gemini Live started with praise the design , depict parallels with a few style make ’ design political theory , and made a smattering of recommendation .

When prodded further , the AI also advised my sis on the best tools for converting hand - drawn sketches into digital concepts . It followed those parole of direction by providing helpful information on the computer software stack and where one could find learning material .

Scanning a sticker using Gemini Live with camera and screen sharing.

Nadeem Sarwar / Digital Trends.

When I put a couple of Duracell barrage fire in the photographic camera view , it not only recognize them accurately , but also told me the hyperlocal e - commerce platforms that can save them to me within minutes .

The service – named Blinkit and Swiggy Instamart — are only useable in India and mostly reserved for urban locales .   Even in a murkily lit elbow room , it was able-bodied to identify a pair of wired earphones in the first attempt .

state of affairs cognisance is its stiff courtship .

Alert for Gemini Live with camera and screen sharing.

Nadeem Sarwar / Digital Trends.

compare to your usual Gemini chat or what you find in theAI overview sectionof Google Search ,   the Gemini Live conversation take a more cautious approach to doling out cognition , especially if it ’s sensitive in nature . I detect that issue such as food recommendations and aesculapian treatment are handled with an more and more cautious approach , and users are often nudged to chance the right expert resource .

A few familiar pitfalls

My overwhelming takeout food is that Gemini ’s “ Project Astra ” makeover is mighty impressive . It ’s a coup d’oeil into the future of what smartphones can achieve . With a few improvements , desegregation , and cross - app work flow , it can make Google Search feel like an outdated relic . But for now , there are a few glaring flaws .

On a few occasions , I did notice that the memory system run haywire . When asked the AI to name a fitness band in the camera horizon , it correctly recognize it as theSamsung Galaxy Fit 3 . But when I pushed a follow - up head , it mistakenly perceive the gimmick as a fitness stria from Huawei .

It can also blatantly lie . And quite confidently , I might say . For example , when I severalize it to summarize my revaluation of the wearable gimmick , the AI answer that Digital Trends has n’t reviewed it yet . In reality , the article was publish a week ago .

Reading a passage with Gemini Live with camera and screen sharing.

Nadeem Sarwar / Digital Trends.

Next , I need it to go through a few article on my author varlet after I enable screen sharing . Gemini did a decent job at explain the report , but occasionally stumbled at contextual agreement . For example , it incorrectly mentioned that only Intel and AMD can make NPUs that qualify for theCopilot+ badge .

The article , on the other hand , intelligibly remark that Qualcomm was the first to meet that criteria , in the lead of the competition . And that it was only belated last year that AMD and Intel could in the end level up and meet that AI chip baseline with a new portfolio of central processor .

Midway through the conversation about an clause , it again ran into a memory issue . Instead of summarizing the floor that was being talk over , it belong back to lecture about the first article that it view via screen sharing . When I interrupted it mid - way through the narration , Gemini desexualise its mistake .

Another issue I point out with narration of non - English languages is that Gemini Live indiscriminately changed the voice and step midway through the narration . It was quite jarring , and the orthoepy was dead mechanical , far different from its homo - like English conversational skills .

The machine vision struggles are also ostensible against stylistic fonts . On a few occasions , it confidently applaud out wrong entropy , and when asked to discipline itself , the AI expressed inability to find the latest information on that subject . Those scenarios are rare , but the Gemini errors are here to outride .

To add up it all up , I think Gemini Live with camera and screen share-out is one of the biggest saltation AI has made so far . It is one of the most practically rewarding implementations of generative AI so far . All it need is a dash of diversity and a fix for its “ convinced liar ” syndrome .

Things are by all odds on the correct lead now , and irresistibly so , but still a few all-important milestones off from being the thoroughgoing AI companion of techno - futuristic dream .