Back to Podcast Digest
The Artificial Intelligence Show Podcast19m

Uber Capped AI Spending. Most Companies Haven't Even Started Thinking About This.

TL;DR

  • Uber's cap makes AI spend feel very real: After burning through its 2026 AI budget in about four months, Uber set a $1,500 monthly cap per tool and says 10% of its code is now written by AI.

  • Even the biggest buyers think frontier models cost too much: Microsoft's Mustafa Suleyman said Anthropic is "extremely expensive," and Microsoft responded by announcing seven in-house models, including one it claims rivals Claude Opus 4.6 on a coding benchmark for less money.

  • Token usage is exploding faster than companies expected: Sam Altman said AI cost concerns appeared "quite suddenly" in early 2026, while examples cited in the segment included 603 billion tokens used in 30 days and 210 billion in a single week by OpenAI employees.

  • Model routing is becoming the practical fix: Factory says its new router can pick the cheapest capable model per task and cut costs by 25%, and Coinbase says it has kept costs roughly flat even as token usage grows exponentially by routing prompts to cheaper models when possible.

  • Outcome-based pricing sounds appealing, but only works in narrow cases: Sierra argues AI should be priced on outcomes rather than usage, but the hosts stress that autonomy and attribution determine whether outcome, usage, seat, or hybrid pricing actually makes sense.

  • AI literacy may matter as much as model quality: The hosts argue most employees do not understand when to use Opus versus Sonnet versus Haiku, which means companies are burning premium tokens on simple tasks like emails that cheaper models could handle just fine.

The Breakdown

Uber blew through its entire 2026 AI budget in four months, then capped employee spend on coding tools at $1,500 per month per tool, a sign that AI costs have suddenly become a board-level problem. The bigger reveal is that companies are now scrambling for model routing, token governance, and AI literacy before expensive frontier models get replaced by cheaper or open-source options for most work.

Was This Useful?

Share