How much does Claude Sonnet 5 cost?

List pricing is $3/million input tokens and $15/million output tokens, with an introductory discount of $2/$10 through 31st August. However, the new tokenizer produces approximately 30% more tokens than Sonnet 4.6 for the same input, effectively raising costs by 30%.

What is Claude Sonnet 5's context window size?

It has a 1 million token context window and 128,000 maximum output tokens.

How does the tokenizer efficiency differ by language?

The same input text produces 1.42× more tokens for English, 1.33× more for Spanish, 1.27× more for Python code, and approximately the same cost for Simplified Mandarin compared to Sonnet 4.6.

Back to articlesLarge Language Models

Large Language Models

Claude Sonnet 5 launches with 1M token window, but tokenizer inflates costs

Simon Willison's Weblog8h ago5 min read

Key takeaway

Anthropic released Claude Sonnet 5 with a 1 million token context window and headline pricing matching Sonnet 4.6. However, the new tokenizer produces approximately 30% more tokens for the same input text, creating an effective 30% price increase despite lower list rates. The model performs close to the more expensive Opus 4.8 but is significantly less capable at cyber tasks than Mythos 5, allowing Anthropic to release it without US government delays.

Summaries like this, in your inbox every morning.

3 Key Points

What happened
Anthropic released Claude Sonnet 5, a new AI model with performance close to its more expensive Opus 4.8 variant. The model has a 1 million token context window and 128,000 maximum output tokens. However, the new tokenizer produces approximately 30% more tokens than Claude Sonnet 4.6 for the same input, effectively raising the price by 30% despite list pricing remaining at $3/million input and $15/million output (with an introductory discount to $2/$10 through 31st August).
Why it matters
Sonnet 5 offers high performance at lower list prices, but the tokenizer change means real-world costs are higher than they appear. The model is less capable at cyber tasks than Anthropic's more powerful Mythos 5, which allowed the company to release it without US government restrictions. For developers and businesses using Claude, the effective price increase may offset the appeal of lower headline rates.
What to watch
The tokenizer efficiency varies by language—English text costs 1.42× more per token than Sonnet 4.6, Spanish 1.33×, Python code 1.27×, and Simplified Mandarin roughly the same. Adaptive thinking is enabled by default, and the model uses the same tools and platform features as Claude Sonnet 4.6.

FAQ

How much does Claude Sonnet 5 cost?: List pricing is $3/million input tokens and $15/million output tokens, with an introductory discount of $2/$10 through 31st August. However, the new tokenizer produces approximately 30% more tokens than Sonnet 4.6 for the same input, effectively raising costs by 30%.
What is Claude Sonnet 5's context window size?: It has a 1 million token context window and 128,000 maximum output tokens.
How does the tokenizer efficiency differ by language?: The same input text produces 1.42× more tokens for English, 1.33× more for Spanish, 1.27× more for Python code, and approximately the same cost for Simplified Mandarin compared to Sonnet 4.6.

Discussion

No comments yet. Be the first to share your thoughts!

AWS launches $1 billion（約1600億円） unit to embed AI engineers inside customer organizations

Yahoo Finance AI8h ago

AI startups warned: plan for success or infrastructure will collapse

Hacker News8h ago

Mneme: single-binary memory tool for AI agents, now MCP-native

Hacker News8h ago

Stay ahead with AI news

Get curated AI news from 200+ sources delivered daily to your inbox. Free to use.

Get Started Free

Free · takes 30 seconds · unsubscribe anytime

1 minute a day. The AI essentials.

200+ sources · Email / LINE / Slack

Get it free →

Claude Sonnet 5 launches with 1M token window, but tokenizer inflates costs

Key takeaway

3 Key Points

FAQ

Discussion

Related Articles

Genpact launches AI tool to recover lost revenue for consumer goods firms

Google Cloud restricts Gemini AI access as compute capacity tightens

Anthropic wins approval to restore Claude Fable 5 after Trump ban

AWS launches $1 billion（約1600億円） unit to embed AI engineers inside customer organizations

AI startups warned: plan for success or infrastructure will collapse

Mneme: single-binary memory tool for AI agents, now MCP-native

Stay ahead with AI news