2026-06-21

Apertus – Open Foundation Model for Sovereign AI

Overall focus

Discussion centers on Apertus as a “fully open” model (weights, data, training recipes) aimed at European/Swiss “sovereign AI” and whether that matters given its current capability.

Openness, pipelines, and “SOTA”

Many value Apertus for being genuinely open: open weights, open data, full training pipeline.
Some argue true “state of the art” should mean models that can be inspected and replicated, not closed “cutting-edge” systems from frontier labs.
Others maintain that frontier lab models remain the real performance SOTA, regardless of openness.

Model quality and practical use

Earlier Apertus versions were described as “pretty bad”; some testing suggests the new ones are still not competitive with top models.
Users report it’s workable as a backbone for RAG and some agents (e.g., legal consulting, translation), but not yet “agentic” or frontier-level.
Weaknesses include hallucinations in multilingual tasks and basic language questions (e.g., conjugations, word spellings).

Training data, copyright, and ethics

Apertus uses FineWeb/Common Crawl; some criticize this as unlicensed scraping that contradicts “copyright-compliant” marketing.
Others argue scraping public web data for training is legal and that expanding copyright here would be harmful.
There’s demand for a “vegan” model trained only on licensed or public-domain data for ethical reasons.

Sovereign AI, geopolitics, and data locality

Strong theme: countries (especially in Europe) need their own AI capabilities to avoid dependence on US or Chinese tech, given concerns about US rule of law, surveillance, export controls, and political instability.
Some see Apertus and similar projects as capability-building more than immediate model competitiveness.
Debate over which jurisdictions are safest for data (US vs EU vs Switzerland vs Nordics) and whether any country is truly “safe.”

Comparison with other open models

Other fully open or near-open pipelines mentioned: OLMo 3.1, K2 Think V2, Nvidia Nemotron, plus strong Chinese models (GLM, DeepSeek, Qwen).
Consensus that Nemotron and several Chinese models currently outperform Apertus; some users prefer them in production.

Local vs service models and UX

Several argue the real near-term battleground is local vs hosted LLMs, not just open vs closed.
Local models are already “good enough” for many tasks, but tooling and UX are confusing and fragmented.
Concern that poor local UX is pushing users toward centralized, closed services, reducing digital autonomy.

Compute, licensing, and compliance

Claim that “the Swiss have no GPUs” is refuted by references to the Alps supercomputer with thousands of Grace-Hopper chips.
License includes a novel mechanism: periodically downloading a hash-based filter to remove personal data from outputs based on deletion requests; unclear how sustainable this is.
Some see Apertus mainly serving European compliance/sovereignty requirements rather than chasing peak benchmark scores.

Related topics