AI model providers are raising prices sharply ahead of new hardware efficiency gains expected in early to mid 2027

Hacker NewsMay 23, 2026

Summaries like this, in your inbox every morning.

3 Key Points

OpenAI doubled prices for GPT-5.5 to $5 (input), $0.50 (cached input), and $30 (output) per million tokens. Google followed with Gemini Flash 3.5 priced between 3x and 6x more expensive than Gemini 3.1 Flash-Lite and Gemini 3 Flash Preview.
New AI-optimized GPUs and accelerators from Nvidia, AMD, AWS, Intel, and Google are being rearchitected to reduce cost per token (the expense of inference—where an AI produces an answer). Most of these systems won't see widespread deployment until early to mid 2027.
Agent systems (AI tools that make decisions and perform tasks autonomously) built on top of these models are consuming tokens orders of magnitude faster than typical chatbots, making usage-based pricing increasingly necessary. Microsoft abandoned seat-based pricing for GitHub Copilot in favor of usage-based billing.

AI-summarized, only the topics you pick — one digest a day via Email, Slack, or Discord.

Free · takes 30 seconds · unsubscribe anytime

No discussion yet for this article

Get curated AI news from 200+ sources delivered daily to your inbox. Free to use.

Free · takes 30 seconds · unsubscribe anytime

1 minute a day. The AI essentials.

200+ sources · Email / LINE / Slack