Uber’s COO says it’s getting harder to justify money spent on tokenmaxxing
What “Tokenmaxxing” Actually Is
- Companies incentivize or require engineers to maximize AI token usage, sometimes with leaderboards and perceived ties to performance reviews.
- Reported Uber numbers (via linked coverage): average AI spend of ~$150–$250 per engineer/month, with “power users” at $500–$2,000.
- Some devs say they now feed models pointless tasks just to avoid looking like “low users.”
Costs, Subsidies, and Economics
- Many argue current token prices are heavily subsidized by VC or model vendors; once subsidies end, heavy agentic use may rival or exceed junior salaries.
- Others counter that inference is highly optimizable, open-weight models are improving fast, and labor has a hard cost floor, so AI should eventually be cheaper than humans for many tasks.
- Skepticism that major model providers are truly profitable; comparisons to prior bubbles and expectations that a recession will purge weak AI offerings.
Goodhart’s Law and Management Critique
- Strong consensus that “tokens used” is a bad productivity metric, likened to lines-of-code or hours-at-desk.
- Example behaviors: agents chatting with each other, redundant tool calls, and feature work driven by metric-chasing rather than user value.
- Some see executives as herd-followers reacting to investor/board FOMO; others defend “burn money now” as an intentional exploration phase to find high-value workflows.
Productivity, Quality, and Engineering Culture
- Reported gains range from “not helpful” to ~20% boost, with many warning that massive claimed multipliers are hype.
- Concerns that junior engineers using AI for everything don’t build real understanding; rise of fragile, “vibe-coded” PoCs that become production-critical.
- Teams struggle to measure real ROI: many tokens can produce many low-impact features; fewer tokens can yield one high-impact change.
Local Models, Security, and Architecture
- Debate over centralized data centers vs. GPUs per dev; some believe local/open models (e.g., DeepSeek, Mistral) plus internal hosting will undercut cloud APIs.
- Security worries about sending sensitive data to third-party APIs, especially foreign ones; local hosting seen as a mitigation.
Uber-Specific and Broader Strategy Questions
- Some question why a 16-year-old “taxi + food delivery” firm still needs thousands of engineers; others point to enormous regulatory, tax, payment, and locale-specific complexity.
- Many expect a shift from “use as much AI as possible” to “use the right model, minimally, for clear business value.”