AI subscriptions are a ticking time bomb for enterprise
Inference economics & subsidies
- Many argue token-based inference is now sold above marginal cost; big losses come from training/R&D and data centers, not serving queries.
- Others doubt this: open and Chinese providers may also be subsidizing, mispricing depreciation, or state-backed, so their prices don’t prove profitability.
- Consensus: even if inference is profitable in isolation, it doesn’t yet cover massive capex and ongoing training needed to stay competitive.
Subscriptions vs usage-based billing
- Core article claim: flat-fee AI subscriptions massively underprice heavy agentic workloads vs API/token rates, creating a “time bomb.”
- Several commenters say large enterprises mostly already pay per token (or per seat plus metered usage) via Azure/Bedrock/enterprise contracts; the big subsidies are on consumer and small-team plans.
- GitHub Copilot’s shift from “requests” to token-based billing is cited as an example of subsidies being reeled in.
Enterprise risk and lock-in
- Some see classic “land-grab then enshittification”: cheap subs to build dependence, then price hikes once workflows are “load‑bearing.”
- Others argue switching costs between model providers are relatively low compared to cloud migration, especially as models commoditize, limiting how far prices can be pushed.
- A minority view: the real time bomb is for investors and macroeconomy (overbuilt AI infra, weak profits), not for enterprise buyers who can cut usage or switch.
Local/open models and competition
- Many report strong results from newer open-weight models (Qwen, Gemma, DeepSeek, Kimi, etc.), claiming they’re close to proprietary “frontier” quality for many tasks.
- Optimists expect enterprises to hedge by self‑hosting or mixing local models with paid APIs, capping exposure to future price hikes.
- Skeptics note that frontier-scale models still demand enormous VRAM and power; most firms and users won’t run serious workloads locally.
Hardware, scaling, and future prices
- One camp expects continued efficiency gains and more supply (GPUs, memory) to keep per-capability token prices falling, even as top models get pricier.
- Another camp points to GPU scarcity, rising energy and RAM costs, and ever-larger models and “reasoning loops” as reasons AI could become more expensive.
Quality & AI-generated prose
- Many readers react negatively to the article’s style, calling it “AI slop” full of clichéd contrastive phrases and corporate-speak.
- This fuels broader fatigue with AI-written marketing content and skepticism toward arguments that look LLM-generated, even when the underlying concerns are valid.