UK sovereign LLM inference

Product and Positioning

  • UK-based LLM inference platform with OpenAI-compatible API, targeting cost-sensitive users and those needing UK data residency.
  • Runs open-source / non-US models (e.g., DeepSeek V4 Pro, Kimi, Nemotron, GPT-OSS 120B) on NVIDIA Blackwell GPUs in a UK datacentre.
  • Presented as suitable for regulated sectors (finance, legal, health, defence) and as a Civo spin-out focused on “sovereign inference.”

Pricing and Cost-Savings Claims

  • Claims “up to 80% cheaper per token” vs OpenAI; some commenters challenge this as potentially misleading if not model-for-model comparable.
  • Token prices are listed in GBP; some UK-based users say they still expect USD pricing.
  • Confusion over plans: per-seat “unlimited chat” vs per-token API usage, especially for coding tools.
  • Cache pricing and economics are unclear; one commenter’s back-of-envelope comparison suggests it might be more expensive than a rival when heavy caching is used.

Data Sovereignty and CLOUD Act Concerns

  • Strong interest in avoiding US hyperscalers and US jurisdiction (CLOUD Act).
  • Platform claims fully UK-incorporated ownership, UK-resident leadership, and UK datacentre; no US parent, so CLOUD Act should not apply.
  • Some argue “sovereign” is oversold because chips, models, and energy are imported; others say “sovereign inference” reasonably means no foreign government can directly “pull the plug.”

Privacy, Terms, and Data Usage

  • Privacy policy allows sharing personal data outside the UK; retention periods require emailing for “additional information.”
  • Terms mention a separate Data Processing Agreement that isn’t easily found, raising concerns for serious commercial use.
  • Desire for explicit zero-retention options and clear opt-out of training on paid plans.

Comparisons to Alternatives

  • Compared to OpenAI, Anthropic, OpenRouter, Novita, Doubleword, and LocAI.
  • Value proposition: UK jurisdiction + cost vs US-centric providers and generic model routers.
  • Some see this as compelling for UK public sector / regulated workloads; others say OpenRouter and local deployment already cover their needs.

Model Quality and Technical Capabilities

  • Acknowledgment that GPT-5 / Claude Opus still lead on hardest reasoning tasks.
  • Advocates say open models now match frontier models on 80–90% of real use cases at much lower cost; critics call this optimistic and note heavy optimization and prompt-engineering overhead for OSS in enterprises.
  • Questions raised about prompt/prefix caching support; not clearly documented.
  • Tooling: supports OpenAI-compatible clients and coding assistants (e.g., OpenCode, Claude Code integrations mentioned).

Market Demand for “Sovereign” AI

  • Several commenters report strong demand in the UK for data-resident, non-US infrastructure, especially in government-adjacent sectors.
  • Others are skeptical of “sovereign X” as a buzzword, or note that true sovereignty would require control over chips, models, and energy.
  • Some argue jurisdictions like Switzerland are preferred for privacy, while UK residency is mainly about regulatory compliance.

Branding, UX, and Communication Feedback

  • Mixed reactions to the name “relaxAI” and associated marketing copy; some find it generic or confusing, others think it’s fine.
  • Criticism that the link went straight to docs with weak “About / who we are” context and few navigation links back to the main site.
  • Suggestions to make the UK ownership, datacentre location, and corporate structure more prominent and linkable (e.g., to official registries).