AIToday

AWS redesigns cloud infrastructure for AI agents, decoupling compute from storage to enable instant scaling and zero idle costs

TechCrunch AI5d ago3 min read
AWS redesigns cloud infrastructure for AI agents, decoupling compute from storage to enable instant scaling and zero idle costs

Summaries like this, in your inbox every morning.

Sign up free →

3 Key Points

  1. 1

    AWS launched next-generation OpenSearch Serverless on Thursday, a fully managed search and vector database designed specifically for agentic workloads that can instantly scale up when agents trigger tasks and scale back down to zero when idle.

  2. 2

    The key technical change decouples compute from storage, allowing compute to scale up in seconds and down to zero so customers pay $0 when agents are idle—a shift from the prior Serverless version where at least one instance had to remain operational because storage and compute were coupled.

  3. 3

    At launch, OpenSearch Serverless will integrate natively with AI development platforms like Vercel and Kiro, enabling developers to deploy production-ready search and vector backends for agents without managing infrastructure.

  4. 4

    Cloudflare reported that bots accounted for 31% of overall HTTP traffic over the last six months, with AI crawlers, search engines, and assistants making up roughly a quarter of all bot requests during that period; Li Yi Ohlsen, senior product manager at Cloudflare, stated that non-human traffic will exceed human traffic sometime in the first half of 2027.

Discussion

No comments yet. Be the first to share your thoughts!

Log in to join the discussion

Related Articles

Stay ahead with AI news

Get curated AI news from 200+ sources delivered daily to your inbox. Free to use.

Get Started Free

5 minutes a day. The AI essentials.

200+ sources · Email / LINE / Slack

Get it free →