{"id":258032,"date":"2026-05-28T14:30:00","date_gmt":"2026-05-28T18:30:00","guid":{"rendered":"https:\/\/news-you-need.com\/index.php\/2026\/05\/28\/apple-working-to-cram-massive-gemini-model-into-iphone-to-power-new-siri\/"},"modified":"2026-05-28T15:00:13","modified_gmt":"2026-05-28T19:00:13","slug":"apple-working-to-cram-massive-gemini-model-into-iphone-to-power-new-siri","status":"publish","type":"post","link":"https:\/\/news-you-need.com\/index.php\/2026\/05\/28\/apple-working-to-cram-massive-gemini-model-into-iphone-to-power-new-siri\/","title":{"rendered":"Apple working to cram massive Gemini model into iPhone to power new Siri"},"content":{"rendered":"<p><a href=\"https:\/\/arstechnica.com\/ai\/2026\/05\/apple-reportedly-trying-to-distill-googles-multi-trillion-parameter-gemini-ai-to-run-on-iphone\/\">Apple working to cram massive Gemini model into iPhone to power new Siri<\/a><\/p>\n<p><a href=\"https:\/\/arstechnica.com\/ai\/2026\/05\/apple-reportedly-trying-to-distill-googles-multi-trillion-parameter-gemini-ai-to-run-on-iphone\/\">https:\/\/arstechnica.com\/ai\/2026\/05\/apple-reportedly-trying-to-distill-googles-multi-trillion-parameter-gemini-ai-to-run-on-iphone\/<\/a><\/p>\n<p>Publish Date: <a href=\"publish_date]\">2026-05-28 14:30:00<\/a><\/p>\n<p>Source Domain: <a href=\"arstechnica.com\">arstechnica.com<\/a><\/p>\n<p>It\u2019s impossible to totally avoid generative AI when interacting with technology anymore, but Apple has a bit less of it. That\u2019s not entirely by choice, though. The iPhone maker has delayed the AI-enhanced Siri multiple times since first promising it in 2024, but a deal with Google will merge the iconic assistant with Gemini later this year. As we approach the <span id=\"_OIsYar66MbSDm9cP-si7qQ4_65\" class=\"K6pdKd wtBS9\">Worldwide Developers Conference<\/span>, Apple has been working to bring big AI smarts to the modest processing environment of a smartphone. Apple fans may not like the outcome, though.<\/p>\n<p>Apple has long crowed about the privacy value of running AI locally, but a new report suggests that despite Apple\u2019s best efforts, the iPhone\u2019s Gemini makeover will lean heavily on Google and Nvidia in the cloud. The Information reports that Apple\u2019s Gemini-infused Siri will run both on-device and in the cloud, an apparent reversal of its privacy-focused preference for local AI.<\/p>\n<p>With every new chip announcement, we hear about how the silicon has been optimized for AI\u2014even Apple does this with its focus on Neural Engine upgrades. You may think from the grandiose language that smartphones are equipped to handle beefy AI models, but that\u2019s not necessarily the case. In fact, the GPUs in most phones can process more AI tokens than the AI-focused NPUs. Components like Apple\u2019s Neural Engine are designed for contextual, efficient AI processing. Even if phones had faster AI processing, they lack the RAM to keep enormous models in memory.<\/p>\n<p>Even the largest AI models are still middling assistants, and that makes local AI very challenging. The AI models that run on phones are physically smaller, featuring at most a few billion parameters. Compare that to Google\u2019s latest Gemini models, which have trillions of parameters, The Information reports. On-device AI models are also \u201cquantized\u201d to run at lower precision, making them faster but affecting the accuracy of token&#8230;<\/p>\n<p><a href=\"https:\/\/arstechnica.com\/ai\/2026\/05\/apple-reportedly-trying-to-distill-googles-multi-trillion-parameter-gemini-ai-to-run-on-iphone\/\">Source<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Apple working to cram massive Gemini model into iPhone to power new Siri https:\/\/arstechnica.com\/ai\/2026\/05\/apple-reportedly-trying-to-distill-googles-multi-trillion-parameter-gemini-ai-to-run-on-iphone\/ Publish&#8230;<\/p>\n","protected":false},"author":1,"featured_media":258034,"comment_status":"closed","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"fifu_image_url":"https:\/\/cdn.arstechnica.net\/wp-content\/uploads\/2024\/04\/apple-google-logo-1152x648.jpg","fifu_image_alt":"","footnotes":""},"categories":[120],"tags":[],"class_list":["post-258032","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-iphone"],"_links":{"self":[{"href":"https:\/\/news-you-need.com\/index.php\/wp-json\/wp\/v2\/posts\/258032"}],"collection":[{"href":"https:\/\/news-you-need.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/news-you-need.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/news-you-need.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/news-you-need.com\/index.php\/wp-json\/wp\/v2\/comments?post=258032"}],"version-history":[{"count":1,"href":"https:\/\/news-you-need.com\/index.php\/wp-json\/wp\/v2\/posts\/258032\/revisions"}],"predecessor-version":[{"id":258035,"href":"https:\/\/news-you-need.com\/index.php\/wp-json\/wp\/v2\/posts\/258032\/revisions\/258035"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/news-you-need.com\/index.php\/wp-json\/wp\/v2\/media\/258034"}],"wp:attachment":[{"href":"https:\/\/news-you-need.com\/index.php\/wp-json\/wp\/v2\/media?parent=258032"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/news-you-need.com\/index.php\/wp-json\/wp\/v2\/categories?post=258032"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/news-you-need.com\/index.php\/wp-json\/wp\/v2\/tags?post=258032"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}