Conceivably, they don't have precise answers for that yet, and won't until after... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

swatcoder on June 10, 2024 | parent | context | favorite | on: Apple's On-Device and Server Foundation Models

Conceivably, they don't have precise answers for that yet, and won't until after they see what real-world usage looks like.

They built out a system that's ready to scale to deliver features that may not work on available hardware, but they're also incentivized to minimize actual reliance on that cloud stuff as it incurs per-use costs that local runs don't.

GaggiX on June 10, 2024 [–]

Yeah this is probably right. If it works well enough during real-world usage it will be using the on-device model, if not then there is the bigger one on the servers. There is also GPT-4o, so they have 3 different models to use depending on the task.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact