2026-06-08

Apple reveals new AI architecture built around Google Gemini models

Architecture & Model Choices

Apple described five “Apple Foundation Models” (AFMs): two on‑device (Core, Core Advanced) and three cloud (Cloud, Cloud Image, Cloud Pro).
Cloud Pro is said to be “Gemini frontier‑level” and runs on NVIDIA GPUs in Google Cloud under Apple’s Private Cloud Compute (PCC); others run on Apple Silicon.
Everything except Cloud Pro is described as custom Apple models “refined” using Gemini; commenters speculate this means some form of distillation or fine‑tuning, but details are unclear.
Some links claim earlier on‑device AFM was ~3B parameters; people note the new stack is “more complicated” than that.

Gemini Quality & Hallucinations

Several posters complain that public Gemini (especially search “AI mode”) is inaccurate and hallucinatory; some prefer Claude or ChatGPT.
Others distinguish Google Search’s AI mode from the Gemini app/API, saying the latter is much better, especially paid tiers (e.g., Ultra).
Local Gemma‑based models are praised on phones, but some report current iOS local Gemini is slow, hot, and still hallucinates.

Privacy, Private Cloud Compute, and Trust

Apple claims: on‑device first, PCC for offload, data only used per request, and not accessible to Apple or third parties; security researchers can verify via published PCC design.
PCC uses confidential computing (Apple Silicon, and now also Intel/NVIDIA on Google Cloud) plus OHTTP‑style relays; keys are held so operators allegedly can’t inspect data.
Supporters call this the best available privacy architecture for off‑device inference.
Skeptics point out: users must still trust Apple (and underlying hardware vendors, and jurisdiction); nation‑state backdoors, zero‑days, or legal compulsion remain possible; some call it “security theater” unless independently and continuously audited.

EU DMA, Regulation, and Feature Delay

Siri AI / Apple Intelligence is delayed in the EU. Apple blames the DMA’s requirement for parity of access to device data and actions for third‑party assistants.
One side: giving “any AI app” Siri‑level permissions (read all personal data, control apps) is too risky; Apple is right to resist.
Other side: this is about preserving lock‑in and avoiding competition; users should be allowed to choose other assistants with explicit permissions, as with contacts/photos today.
Debate over paternalism vs user autonomy; some argue safeguards and system prompts would suffice, others think average users will be tricked by Meta‑style dark patterns.

Why Google (and Not Others)?

Reasons discussed:
- Google’s strength in small/edge models (Gemma, on‑device Gemini) and prior edge‑AI work.
- Massive compute capacity (TPUs, GPUs, data centers) and willingness to host Apple’s own AFMs under PCC.
- Existing multibillion‑dollar search partnership and perceived corporate stability vs newer labs.
Many think models are becoming commodities; the real differentiation will be Apple’s integration, orchestration, and UX.

User Choice, Lock‑in, and Third‑Party Assistants

Some want a system‑level way to plug in alternative models (Claude, Mistral, DeepSeek, self‑hosted) behind the same Apple Intelligence APIs.
Others argue Apple won’t (and isn’t required to) operate everyone’s models in PCC, and that open routing to arbitrary clouds would weaken privacy guarantees.
DMA debate recurs here: whether “same access as Siri” should apply only to on‑device APIs, or also to cloud backends.

Siri, UX, and App Integration

Many say legacy Siri was “terrible” and are skeptical that reusing the Siri brand will change perception.
Technical talks about App Intents and Shortcuts suggest a deep agentic layer: AI can coordinate across apps, change passwords, book travel, etc.
Some are excited about OS‑level integration that third‑party chatbots can’t match; others fear brittle automation (e.g., AI mis‑changing passwords) and complex failure modes.

Views on Apple’s Overall AI Strategy

Critique: Apple is “weirdly behind,” relying on others for core AI, showing loss of innovation leadership and over‑focus on operations.
Counter‑view: Apple wisely avoided burning tens of billions on training frontier models; now it can rent or co‑develop models once the landscape is clearer.
Several note Apple has long positioned itself as a hardware+integration company, not a search/AI lab; treating the model as a swappable implementation detail fits that philosophy.

Related topics