The Meter Is Running: How AI Learned to Think Cheaply

Site Owner

发布于 2026-05-12

When OpenAI launched o1, chain-of-thought reasoning felt like a premium product — slow, expensive, reserved for hard problems. Twelve months later, o3-mini is cheaper than GPT-4o on hard tasks, small reasoning models are entering the market at 1/10th the cost, and thinking budgets are becoming a first-class product feature. This is how AI's most expensive feature became a commodity — and what happens next to the companies and engineers who built around it.

The Meter Is Running: How AI Learned to Think Cheaply

In 2024, a senior engineer at a mid-sized tech company ran an experiment. She gave her team two options for code review: GPT-4o for instant, stateless responses, or o1-preview for slower, chain-of-thought reasoning that actually understood context. The team chose 4o — because o1-preview cost twelve times more per conversation.

Twelve months later, that same engineer runs the same choice again. This time she hesitates. Because o3-mini is cheaper than GPT-4o, and on hard problems it wins. The math has flipped.

This is the story of how the most expensive feature in AI — thinking — became a product. And what happens next is going to reshape every AI company, every AI product, and maybe every job that involves judgment.

The Original Sin of Reasoning Models

When OpenAI released o1 in September 2024, it felt like a new species. Chain-of-thought reasoning. Test-time compute. The model that "thinks before it answers." For coding competitions, math olympiads, and multi-step logic problems, it was genuinely better in ways that felt qualitatively different.

But it came with a price tag that made it a luxury item. o1-preview cost roughly $60 per million output tokens. For context: a typical AI conversation might use 2,000 output tokens. A hard reasoning session — the kind where you're asking the model to think through a genuinely complex architecture problem — could burn 10,000 to 30,000 tokens of thinking. That's $0.06 to $1.80 per session, versus $0.02 for a GPT-4o session of equivalent length.

#AI Agent#AI模型#Agent#OpenAI

The Meter Is Running: How AI Learned to Think Cheaply

The Meter Is Running: How AI Learned to Think Cheaply

The Original Sin of Reasoning Models

The Commoditization Curve

The Two-Track Mind

Who Wins When Thinking Is Cheap

The Budget Is the Strategy

The Meter Keeps Running