Why did Sonnet 5's cost per task nearly double if token prices stayed the same?

Sonnet 5 consumes roughly 40 percent more output tokens per task at maximum performance than Sonnet 4.6, and runs about three times as many agent loops on knowledge work benchmarks. The model's more agentic behavior—combined with potential tokenizer changes—inflated the real bill per task from about $1.20 to $2.29, even though official rates remained at $3 per million input tokens and $15 per million output tokens.

How does Sonnet 5's performance compare to other Anthropic models?

Sonnet 5 scored 53 points in Artificial Analysis's Intelligence Index, a six-point jump over Sonnet 4.6 (47 points) and beating Opus 4.8 on some agent-based knowledge work tasks. However, it ranked fifth overall and falls short of larger models on reasoning-heavy benchmarks like CritPt, where it scored 17 percent, below GLM-5.2, Claude Opus, Fable, and GPT-5.5 in their higher configurations.

Is Anthropic offering any discount on Sonnet 5?

Anthropic is running a promotional rate of $2 or $10 per million tokens through September 1, but analyses cited in the article are based on regular prices of $3 per million input tokens and $15 per million output tokens.

Back to articlesLarge Language Models

Large Language Models

Anthropic hides price hikes in Sonnet 5 through higher token burn, not rate increases

THE DECODER3h ago6 min read

Key takeaway

Anthropic released Claude Sonnet 5, which ranks fifth in independent benchmarks but consumes roughly 40 percent more tokens per task than its predecessor, nearly doubling the real cost to $2.29 per task despite keeping official token prices unchanged. This marks the second time Anthropic has raised effective prices without changing published rates; the earlier Opus 4.7 launch saw a tokenizer change inflate token counts by approximately 30%, and developers measured a 1.325x to 1.47x cost increase. For businesses choosing between AI models, the hidden cost creep makes transparent pricing harder to compare, particularly against cheaper competitors in the mid-range segment.

Summaries like this, in your inbox every morning.

3 Key Points

What happened
Anthropic released Claude Sonnet 5, which ranked fifth in Artificial Analysis's Intelligence Index with 53 points, but the model consumes roughly 40 percent more output tokens per task at maximum performance than its predecessor Sonnet 4.6, and runs about three times as many agent loops on knowledge work benchmarks. Despite keeping official token prices flat at $3 per million input tokens and $15 per million output tokens, the real cost per task nearly doubled from about $1.20 to $2.29—now more expensive than the pricier Opus 4.8 at $1.97 per task.
Why it matters
Anthropic has repeated this pattern before; when Opus 4.7 launched, a new tokenizer inflated token counts by approximately 30%, and developer measurements found a 1.325x to 1.47x cost increase. With Sonnet 5, the combination of higher token consumption and more agentic behavior compounds the hidden cost creep. For developers and businesses evaluating AI models, real per-task pricing is now substantially higher than the official rate card suggests, making model selection less transparent—especially compared to competitors like Deepseek V4 Pro and GLM-5.2 that offer competitive performance at a lower cost in the mid-range segment.
What to watch
Anthropic is running a promotional rate of $2 or $10 per million tokens through September 1, but analyses cited in the article are based on regular prices. Sonnet 5 still lags larger models on reasoning-heavy benchmarks like CritPt (17 percent), where it scored 14 points above Sonnet 4.6 but below GLM-5.2, Claude Opus, Fable, and GPT-5.5 in their higher configurations.

FAQ

Why did Sonnet 5's cost per task nearly double if token prices stayed the same?: Sonnet 5 consumes roughly 40 percent more output tokens per task at maximum performance than Sonnet 4.6, and runs about three times as many agent loops on knowledge work benchmarks. The model's more agentic behavior—combined with potential tokenizer changes—inflated the real bill per task from about $1.20 to $2.29, even though official rates remained at $3 per million input tokens and $15 per million output tokens.
How does Sonnet 5's performance compare to other Anthropic models?: Sonnet 5 scored 53 points in Artificial Analysis's Intelligence Index, a six-point jump over Sonnet 4.6 (47 points) and beating Opus 4.8 on some agent-based knowledge work tasks. However, it ranked fifth overall and falls short of larger models on reasoning-heavy benchmarks like CritPt, where it scored 17 percent, below GLM-5.2, Claude Opus, Fable, and GPT-5.5 in their higher configurations.
Is Anthropic offering any discount on Sonnet 5?: Anthropic is running a promotional rate of $2 or $10 per million tokens through September 1, but analyses cited in the article are based on regular prices of $3 per million input tokens and $15 per million output tokens.

Discussion

No comments yet. Be the first to share your thoughts!

Genesis AI model PEARL shows drug discovery can finally work—hitting real-world accuracy thresholds

Latent Space36m ago

Meta's non-invasive brain-reading AI cuts word errors to 39%, closing gap with implants

THE DECODER36m ago

Stay ahead with AI news

Get curated AI news from 200+ sources delivered daily to your inbox. Free to use.

Get Started Free

Free · takes 30 seconds · unsubscribe anytime

1 minute a day. The AI essentials.

200+ sources · Email / LINE / Slack

Get it free →

Anthropic hides price hikes in Sonnet 5 through higher token burn, not rate increases

Key takeaway

3 Key Points

FAQ

Discussion

Related Articles

Genesis AI model PEARL shows drug discovery can finally work—hitting real-world accuracy thresholds

Meta's non-invasive brain-reading AI cuts word errors to 39%, closing gap with implants

Springboards' Flint AI breaks LLM groupthink with wilder responses

Google brings Gemini Spark AI agent to Mac, adds task automation

OpenAI unveils three-tier Pro models for ChatGPT in benchmark paper

Anthropic quietly tracked Chinese users in Claude Code, now rolling back

Stay ahead with AI news