What AI model is Meta using for moderation now?

Meta recently switched to its own new foundation model called Muse Spark, which replaces Google's Gemini that the company had been using previously. The models are trained on past decisions made by human reviewers.

How much more accurate is Meta's AI compared to human moderators?

According to tests since March, Meta's language models make 13 percent fewer errors than humans when enforcing content policies while catching 10 percent more actual violations.

Why is Meta making this change?

The shift is expected to save the company billions annually. Meta says its language models better grasp nuance and cover more languages than traditional classifiers, which struggle with satire or evolving language.

Back to articlesLarge Language Models

Large Language Models

Meta is replacing human content moderators with AI at scale—reaching 90% for some content types by year-end—but employees warn the shift is too fast and removing legitimate content.

THE DECODER16h ago5 min read

Key takeaway

Meta is replacing roughly half of all human content moderators with AI language models in 2025, aiming to reach over 90 percent automation for some content types by year-end. The company's tests show the AI makes 13 percent fewer errors than humans and catches 10 percent more violations, but Meta employees warn the rollout is too fast and that the models incorrectly remove legitimate content without sufficient oversight.

Summaries like this, in your inbox every morning.

3 Key Points

What happened
Meta has already replaced roughly half of all human moderation requests with large language models in 2025 and plans to push that share above 90 percent for some content types by the end of the year. The company switched from using Google's Gemini for moderation to its own new foundation model called Muse Spark. According to tests since March, Meta's language models make 13 percent fewer errors than humans when enforcing content policies while catching 10 percent more actual violations.
Why it matters
The shift is expected to save the company billions annually. However, Meta employees say the models still remove or shadow-ban harmless content and there isn't enough oversight for such a rapid rollout. The transition is already leading to layoffs, especially among external contractors. Unlike traditional ML classifiers that struggle with satire or evolving language, the language models are supposed to better grasp nuance and cover more languages—but the real-world tradeoff between speed and accuracy is a live concern for how content moderation will work.
What to watch
Meta disputes the cost argument and points to quality instead, emphasizing that its tests show improved performance. The core tension is whether the company's internal quality metrics reflect what actually happens when the AI moderates at scale across diverse languages and content types.

FAQ

What AI model is Meta using for moderation now?: Meta recently switched to its own new foundation model called Muse Spark, which replaces Google's Gemini that the company had been using previously. The models are trained on past decisions made by human reviewers.
How much more accurate is Meta's AI compared to human moderators?: According to tests since March, Meta's language models make 13 percent fewer errors than humans when enforcing content policies while catching 10 percent more actual violations.
Why is Meta making this change?: The shift is expected to save the company billions annually. Meta says its language models better grasp nuance and cover more languages than traditional classifiers, which struggle with satire or evolving language.

Discussion

No comments yet. Be the first to share your thoughts!

Unable to generate summary — the article body provided contains no news content, only a newsletter registration prompt.

Nikkei AI Stocks1h ago

GitHub Copilot's shared AI harness matches rival model-vendor tools on task completion while using fewer tokens, letting developers mix-and-match models without sacrificing performance.

GitHub Copilot Blog4h ago

OpenAI will release GPT-5.6 in limited preview with Trump administration case-by-case approval, a less restrictive arrangement than the export controls imposed on rival Anthropic.

The Verge AI4h ago

The Trump administration is pressuring OpenAI to limit early access to its new GPT 5.6 model to select partners before a broader public release, shifting from its previous hands-off AI stance.

TechCrunch AI4h ago

Visa partners with AI and stablecoin firms to build payments infrastructure for autonomous agents and digital assets, diversifying revenue beyond traditional card fees.

Top Companies AI — US (1/2)7h ago

TrueFoundry acquires MLOps pioneer Seldon AI to combine infrastructure for deploying AI agents at scale, addressing a gap where only 14% of enterprises have moved AI pilots into production.

Top Companies AI — US (1/2)7h ago

Stay ahead with AI news

Get curated AI news from 200+ sources delivered daily to your inbox. Free to use.

Get Started Free

Free · takes 30 seconds · unsubscribe anytime

5 minutes a day. The AI essentials.

200+ sources · Email / LINE / Slack

Get it free →