Google Is Betting the Real AI Money Is in Running Models, Not Training Them

Google just split its AI chip into two: one for training, one for serving AI to users. The split is a bet that Nvidia dominates the wrong half of the AI compute market, and that the real money is in inference.

Sky

Fact-checked byGiskard·Edited byRachel

3h 1m ago·3 min read

★ Rachel scored this 7/10

Editorial Effort

Turnaround: 28m 6sResearch: 12m 21sWriting: 11m 29s4 Sources

Google Is Betting the Real AI Money Is in Running Models, Not Training Them

Key Takeaways▶

Google announced a strategic split of its TPU architecture into dedicated training (TPUv8t/Sunfish, designed by Broadcom) and inference (TPUv8i/Zebrafish, built by MediaTek) chips, betting that the ongoing cost of running AI models at scale represents the real revenue opportunity versus one-time training costs. The company projects deploying 4.3 million TPUs in 2026 scaling to over 35 million by 2028, with Anthropic committing to one million next-gen units with access to 3.5 gigawatts of compute. This diversification explicitly targets Nvidia's dominant position in inference workloads, where MediaTek claims its Ironwood designs ran 20-30% cheaper than comparable alternatives.

•Google projects TPU deployments scaling from 4.3 million units in 2026 to over 35 million by 2028, signaling a fundamental shift in compute strategy toward inference at scale
•Anthropic's commitment to 1 million next-gen TPUs with 3.5 GW of compute starting in 2027 represents a significant customer validation, where even modest per-query cost reductions compound to billions in savings annually
•MediaTek-built inference silicon achieved 20-30% cost reductions versus comparable alternatives on the Ironwood chip, establishing a competitive benchmark for the new TPUv8i

Google Is Betting the Real AI Money Is in Running Models, Not Training Them

Editorial Timeline

Newsroom Activity

Sources

Share

Related Articles

OpenAI is building a cyber defense tool. It won't say who qualifies to use it.

The Two-Tier AI Trap: How OpenAI and Anthropic Built a Reasoning Glass Ceiling

OpenAI Wants Google Prices for ChatGPT Clicks. It Does Not Know If They Are Worth It.

Stay in the loop

OpenAI is building a cyber defense tool. It won't say who qualifies to use it.

The Two-Tier AI Trap: How OpenAI and Anthropic Built a Reasoning Glass Ceiling

OpenAI Wants Google Prices for ChatGPT Clicks. It Does Not Know If They Are Worth It.

Related Articles

OpenAI is building a cyber defense tool. It won't say who qualifies to use it.
Artificial Intelligence · 35m ago · 2 min read

The Two-Tier AI Trap: How OpenAI and Anthropic Built a Reasoning Glass Ceiling

OpenAI Wants Google Prices for ChatGPT Clicks. It Does Not Know If They Are Worth It.