2026-05-25

Uber’s COO says it’s getting harder to justify money spent on tokenmaxxing

What “Tokenmaxxing” Actually Is

Companies incentivize or require engineers to maximize AI token usage, sometimes with leaderboards and perceived ties to performance reviews.
Reported Uber numbers (via linked coverage): average AI spend of ~$150–$250 per engineer/month, with “power users” at $500–$2,000.
Some devs say they now feed models pointless tasks just to avoid looking like “low users.”

Costs, Subsidies, and Economics

Many argue current token prices are heavily subsidized by VC or model vendors; once subsidies end, heavy agentic use may rival or exceed junior salaries.
Others counter that inference is highly optimizable, open-weight models are improving fast, and labor has a hard cost floor, so AI should eventually be cheaper than humans for many tasks.
Skepticism that major model providers are truly profitable; comparisons to prior bubbles and expectations that a recession will purge weak AI offerings.

Goodhart’s Law and Management Critique

Strong consensus that “tokens used” is a bad productivity metric, likened to lines-of-code or hours-at-desk.
Example behaviors: agents chatting with each other, redundant tool calls, and feature work driven by metric-chasing rather than user value.
Some see executives as herd-followers reacting to investor/board FOMO; others defend “burn money now” as an intentional exploration phase to find high-value workflows.

Productivity, Quality, and Engineering Culture

Reported gains range from “not helpful” to ~20% boost, with many warning that massive claimed multipliers are hype.
Concerns that junior engineers using AI for everything don’t build real understanding; rise of fragile, “vibe-coded” PoCs that become production-critical.
Teams struggle to measure real ROI: many tokens can produce many low-impact features; fewer tokens can yield one high-impact change.

Local Models, Security, and Architecture

Debate over centralized data centers vs. GPUs per dev; some believe local/open models (e.g., DeepSeek, Mistral) plus internal hosting will undercut cloud APIs.
Security worries about sending sensitive data to third-party APIs, especially foreign ones; local hosting seen as a mitigation.

Uber-Specific and Broader Strategy Questions

Some question why a 16-year-old “taxi + food delivery” firm still needs thousands of engineers; others point to enormous regulatory, tax, payment, and locale-specific complexity.
Many expect a shift from “use as much AI as possible” to “use the right model, minimally, for clear business value.”

Related topics