What will you use the ai in the phone to do for you? I can understand tablets an...

Workaccount2 · 2025-12-17T22:11:46 1766009506

I desperately want to be able to real-time dictate actions to take on my phone.

Stuff like:

"Open Chrome, new tab, search for xyz, scroll down, third result, copy the second paragraph, open whatsapp, hit back button, open group chat with friends, paste what we copied and send, send a follow-up laughing tears emoji, go back to chrome and close out that tab"

All while being able to just quickly glance at my phone. There is already a tool like this, but I want the parsing/understanding of an LLM and super fast response times.

KoolKat23 · 2025-12-17T22:36:59 1766011019

This new model is absurdly quick on my phone and for launch day, wonder if it's additional capacity/lower demand or if this is what we can expect going forward.

On a related note, why would you want to break down your tasks to that level surely it should be smart enough to do some of that without you asking and you can just state your end goal.

pests · 2025-12-17T23:18:31 1766013511

This has been my dream for voice control of PC for ages now. No wake word, no button press, no beeping or nagging, just fluently describe what you want to happen and it does.

pylotlight · 2025-12-18T00:48:36 1766018916

without a wake word, it would have to listen and process all parsed audio. you really want everything captured near the device/mic to be sent to external servers?

TeMPOraL · 2025-12-18T07:31:52 1766043112

I might if that's what it takes to make it finally work. The fueling of the previous 15 years was not worth it, but that was then.

nielsbot · 2025-12-18T09:23:08 1766049788

Apple tried this ages ago:

https://en.wikipedia.org/wiki/PlainTalk

procaryote · 2025-12-17T22:31:17 1766010677

is that faster to say than do, or is it an accessibility or while-driving need?

CamperBob2 · 2025-12-18T05:09:32 1766034572

I don't understand that use case at all. How can you tell it to do all that stuff, if you aren't sitting there glued to the screen yourself?

TeMPOraL · 2025-12-18T07:50:11 1766044211

Because typing on mobile is slow, app switching is slow, text selection and copy-paste are torture. Pretty much the only interaction of the ones OP listed is scrolling.

Plus, if the above worked, the higher level interactions could trivially work too. "Go to event details", "add that to my calendar".

FWIW, I'm starting to embrace using Gemini as general-purpose UI for some scenarios just because it's faster. Most common one, "<paste whatever> add to my calendar please."

wiseowise · 2025-12-18T09:45:17 1766051117

Analyse e-mails/text/music/videos, edit photos, summarization, etc.