2026-05-15

UK sovereign LLM inference

Product and Positioning

UK-based LLM inference platform with OpenAI-compatible API, targeting cost-sensitive users and those needing UK data residency.
Runs open-source / non-US models (e.g., DeepSeek V4 Pro, Kimi, Nemotron, GPT-OSS 120B) on NVIDIA Blackwell GPUs in a UK datacentre.
Presented as suitable for regulated sectors (finance, legal, health, defence) and as a Civo spin-out focused on “sovereign inference.”

Pricing and Cost-Savings Claims

Claims “up to 80% cheaper per token” vs OpenAI; some commenters challenge this as potentially misleading if not model-for-model comparable.
Token prices are listed in GBP; some UK-based users say they still expect USD pricing.
Confusion over plans: per-seat “unlimited chat” vs per-token API usage, especially for coding tools.
Cache pricing and economics are unclear; one commenter’s back-of-envelope comparison suggests it might be more expensive than a rival when heavy caching is used.

Data Sovereignty and CLOUD Act Concerns

Strong interest in avoiding US hyperscalers and US jurisdiction (CLOUD Act).
Platform claims fully UK-incorporated ownership, UK-resident leadership, and UK datacentre; no US parent, so CLOUD Act should not apply.
Some argue “sovereign” is oversold because chips, models, and energy are imported; others say “sovereign inference” reasonably means no foreign government can directly “pull the plug.”

Privacy, Terms, and Data Usage

Privacy policy allows sharing personal data outside the UK; retention periods require emailing for “additional information.”
Terms mention a separate Data Processing Agreement that isn’t easily found, raising concerns for serious commercial use.
Desire for explicit zero-retention options and clear opt-out of training on paid plans.

Comparisons to Alternatives

Compared to OpenAI, Anthropic, OpenRouter, Novita, Doubleword, and LocAI.
Value proposition: UK jurisdiction + cost vs US-centric providers and generic model routers.
Some see this as compelling for UK public sector / regulated workloads; others say OpenRouter and local deployment already cover their needs.

Model Quality and Technical Capabilities

Acknowledgment that GPT-5 / Claude Opus still lead on hardest reasoning tasks.
Advocates say open models now match frontier models on 80–90% of real use cases at much lower cost; critics call this optimistic and note heavy optimization and prompt-engineering overhead for OSS in enterprises.
Questions raised about prompt/prefix caching support; not clearly documented.
Tooling: supports OpenAI-compatible clients and coding assistants (e.g., OpenCode, Claude Code integrations mentioned).

Market Demand for “Sovereign” AI

Several commenters report strong demand in the UK for data-resident, non-US infrastructure, especially in government-adjacent sectors.
Others are skeptical of “sovereign X” as a buzzword, or note that true sovereignty would require control over chips, models, and energy.
Some argue jurisdictions like Switzerland are preferred for privacy, while UK residency is mainly about regulatory compliance.

Branding, UX, and Communication Feedback

Mixed reactions to the name “relaxAI” and associated marketing copy; some find it generic or confusing, others think it’s fine.
Criticism that the link went straight to docs with weak “About / who we are” context and few navigation links back to the main site.
Suggestions to make the UK ownership, datacentre location, and corporate structure more prominent and linkable (e.g., to official registries).

Related topics