AIToday

Anthropic publishes documentation of sandbox techniques used across Claude.ai, Claude Code, and Cowork

Simon Willison's Weblog3d ago1 min read

Summaries like this, in your inbox every morning.

Sign up free →

3 Key Points

  1. 1

    Anthropic released a detailed overview explaining how it constrains agent behavior using process sandboxes, virtual machines (VMs), filesystem boundaries, and egress controls—with the goal of preventing agents from accessing credentials or exfiltrating data regardless of the cause.

  2. 2

    Each Claude product uses different sandbox implementations: Claude.ai runs gVisor; Claude Code uses Seatbelt on macOS and Bubblewrap on Linux; Claude Cowork runs a full VM via Apple's Virtualization framework on macOS or HCS on Windows.

  3. 3

    The documentation includes examples of previously missed security risks, such as an api.anthropic.com/v1/files exfiltration vector that Anthropic had identified.

Discussion

No comments yet. Be the first to share your thoughts!

Log in to join the discussion

Related Articles

Stay ahead with AI news

Get curated AI news from 200+ sources delivered daily to your inbox. Free to use.

Get Started Free

5 minutes a day. The AI essentials.

200+ sources · Email / LINE / Slack

Get it free →