foundation-models

Foundation Models

Apple's on-device AI framework providing access to a 3B parameter language model for summarization, extraction, classification, and content generation. Runs entirely on-device with no network required.

Overview

Foundation Models enable intelligent text processing directly on device without server round-trips, user data sharing, or network dependencies. The core principle: leverage on-device AI for specific, contained tasks (not for general knowledge).

Reference Loading Guide

ALWAYS load reference files if there is even a small chance the content may be required. It's better to have the context than to miss a pattern or make a mistake.

Reference Load When

Getting Started Setting up LanguageModelSession, checking availability, basic prompts

Structured Output Using @Generable for type-safe responses, @Guide constraints

Tool Calling Integrating external data (weather, contacts, MapKit) via Tool protocol

Streaming AsyncSequence for progressive UI updates, PartiallyGenerated types

Troubleshooting Context overflow, guardrails, errors, anti-patterns

Core Workflow

Check availability with SystemLanguageModel.default.availability
Create LanguageModelSession with optional instructions
Choose output type: plain String or @Generable struct
Use streaming for long generations (>1 second)
Handle errors: context overflow, guardrails, unsupported language

Model Capabilities

Use Case Foundation Models? Alternative

Summarization Yes

Extraction (key info) Yes

Classification Yes

Content tagging Yes (built-in adapter)

World knowledge No ChatGPT, Claude, Gemini

Complex reasoning No Server LLMs

Platform Requirements

iOS 26+, macOS 26+, iPadOS 26+, visionOS 26+
Apple Intelligence-enabled device (iPhone 15 Pro+, M1+ iPad/Mac)
User opted into Apple Intelligence

Common Mistakes

Using Foundation Models for world knowledge — The 3B model is trained for on-device tasks only. It won't know current events, specific facts, or "who is X". Use ChatGPT/Claude for that. Keep prompts to: summarizing user's own content, extracting info, classifying text.

Blocking the main thread — LanguageModelSession calls must run on a background thread or async context. Blocking the main thread locks UI. Always use Task { } or background queue.

Ignoring context overflow — The model has finite context. If the user pastes a 50KB document, it will fail silently or truncate. Check input length and trim/truncate proactively.

Forgetting to check availability — Not all devices support Foundation Models. Check SystemLanguageModel.default.availability before using. Graceful degradation is required.

Ignoring guardrails — The model won't answer harmful queries. Instead of fighting it, design prompts that respect safety guidelines. Rephrasing requests usually works.