How is OCR 4 different from earlier OCR versions?

OCR 4 not only extracts text but also identifies where each element sits on the page and its structural role—whether it's a title, table, equation, or signature—and provides confidence scores for each word or page it reads.

How much does OCR 4 cost?

OCR 4 costs $4 per 1,000 pages in standard mode, or $2 per 1,000 pages in batch mode.

Where can I access OCR 4?

The model is available through the API, Mistral Studio, and Microsoft Foundry.

Back to articlesLarge Language Models

Large Language Models

Mistral AI releases OCR 4, a document-reading model that outperformed competitors in blind tests and is now available through multiple platforms at $4 per 1,000 pages.

THE DECODER7h ago4 min read

Key takeaway

Mistral AI has released OCR 4, a document-reading model that identifies text location and structural role within PDFs, Word, and PowerPoint files. In a blind test comparing it against competitors on over 600 documents, independent reviewers preferred OCR 4 72 percent of the time. The model supports 170 languages and is available now through multiple platforms at $4 per 1,000 pages ($2 in batch mode).

Summaries like this, in your inbox every morning.

3 Key Points

What happened
Mistral AI launched OCR 4, a model that reads text from PDFs, Word files, and PowerPoint presentations while also identifying the structural role of each element (titles, tables, equations, signatures) and assigning confidence scores to its readings. In blind tests with over 600 documents, independent reviewers preferred OCR 4's output 72 percent of the time over competing models.
Why it matters
Unlike earlier OCR versions that extract raw text only, OCR 4 organizes documents into meaningful sections automatically—useful for feeding data into search systems or letting AI agents process them. The model supports 170 languages, including less common ones, making it potentially valuable for businesses that handle documents in multiple languages.
What to watch
OCR 4 is available now through the API, Mistral Studio, and Microsoft Foundry. Pricing is $4 per 1,000 pages in standard mode or $2 in batch mode.

FAQ

How is OCR 4 different from earlier OCR versions?: OCR 4 not only extracts text but also identifies where each element sits on the page and its structural role—whether it's a title, table, equation, or signature—and provides confidence scores for each word or page it reads.
How much does OCR 4 cost?: OCR 4 costs $4 per 1,000 pages in standard mode, or $2 per 1,000 pages in batch mode.
Where can I access OCR 4?: The model is available through the API, Mistral Studio, and Microsoft Foundry.

Discussion

No comments yet. Be the first to share your thoughts!

OpenAI and Broadcom have jointly developed Jalapeño, a custom chip designed specifically for running large language models, with deployment planned for late 2026 at gigawatt scale.

THE DECODER1h ago

AI text detector Pangram says language models are detectable because they produce uniform arguments, clustering in narrow bands unlike the diversity of human reasoning.

THE DECODER4h ago

Seltz, an AI-native search startup, raises $12.5 million（約20億円） to build an alternative search engine designed for AI agents rather than human users.

Fortune AI4h ago

Tecan integrates AI agents into its lab analytics platform using NVIDIA's toolkit, enabling laboratories to prevent operational issues proactively rather than react to them after they occur.

Yahoo Finance AI7h ago

Anthropic launched Claude Tag, a Slack integration that lets teams delegate work to Claude as a persistent team member rather than a chat interface.

Latent Space7h ago

Anthropic launches Claude Tag, embedding its AI directly into Slack for team collaboration, with internal testing showing the tool already generates 65 percent of code on the company's product team.

THE DECODER7h ago

Stay ahead with AI news

Get curated AI news from 200+ sources delivered daily to your inbox. Free to use.

Get Started Free

Free · takes 30 seconds · unsubscribe anytime

5 minutes a day. The AI essentials.

200+ sources · Email / LINE / Slack

Get it free →