
Summaries like this, in your inbox every morning.
Sign up free →OpenAI doubled prices for GPT-5.5 to $5 (input), $0.50 (cached input), and $30 (output) per million tokens. Google followed with Gemini Flash 3.5 priced between 3x and 6x more expensive than Gemini 3.1 Flash-Lite and Gemini 3 Flash Preview.
New AI-optimized GPUs and accelerators from Nvidia, AMD, AWS, Intel, and Google are being rearchitected to reduce cost per token (the expense of inference—where an AI produces an answer). Most of these systems won't see widespread deployment until early to mid 2027.
Agent systems (AI tools that make decisions and perform tasks autonomously) built on top of these models are consuming tokens orders of magnitude faster than typical chatbots, making usage-based pricing increasingly necessary. Microsoft abandoned seat-based pricing for GitHub Copilot in favor of usage-based billing.
No discussion yet for this article
Get curated AI news from 200+ sources delivered daily to your inbox. Free to use.
Get Started Free5 minutes a day. The AI essentials.
200+ sources · Email / LINE / Slack