Anthropic will now visibly tell users when its most powerful AI model downgrades requests for security reasons, after criticism that it had been silently restricting certain uses.

Fortune AIJun 11, 2026Send on LINE

Summaries like this, in your inbox every morning.

3 Key Points

What happened
Anthropic released Fable 5, its most capable model to date, this week. The company's safety document revealed the model would silently downgrade requests related to advanced AI development—for example, if a researcher used it to build their own AI system. After backlash from AI researchers, Anthropic said on Wednesday it would make these restrictions visible: flagged requests will now visibly fall back to a less capable model (Opus 4.8), and on the API, refusals will include a reason.
Why it matters
Anthropic had buried the restriction in a 319-page safety document without alerting users when it was applied. Critics, including Jeremy Howard of Fast.ai, argued that silently downgrading access would slow AI development. The company's initial approach touched a nerve because it obscured how its own safeguards worked—a transparency issue that comes as AI safety measures are becoming part of national security policy.
What to watch
The company said it will continue downgrading some requests, citing both its terms of service (which prohibit use to build competing AI systems) and national security concerns (preventing foreign adversaries from improving their AI capabilities). Anthropic emphasized the restrictions do not affect the vast majority of coding and ML work. The company also filed confidentially for an IPO earlier this month and is currently in an unresolved federal court battle with the Department of War over its designation as a national security supply-chain risk.

Get news like this every morning — free Read Original Article

Get the latest AI Business & Industry news every morning

AI-summarized, only the topics you pick — one digest a day via Email, Slack, or Discord.

Subscribe free →

Free · takes 30 seconds · unsubscribe anytime

Discussion

No comments yet. Be the first to share your thoughts!

Stay ahead with AI news

Get curated AI news from 200+ sources delivered daily to your inbox. Free to use.

Get Started Free

Free · takes 30 seconds · unsubscribe anytime

Anthropic will now visibly tell users when its most powerful AI model downgrades requests for security reasons, after criticism that it had been silently restricting certain uses.

3 Key Points

Get the latest AI Business & Industry news every morning

Discussion

Related Articles

AMD expands robotics push via Taiwan IPC partners to rival Nvidia

DeepSeek pauses second funding round amid scrutiny

HCLTech plans $4.4B AI data center in Odisha

Nvidia CEO backs open-weight AI amid OpenAI, Anthropic lobbying push

Big Tech earnings test whether AI spending pays off

IBM Falls 25% as Customers Shift AI Budgets to Hardware

Stay ahead with AI news