The Circuitry
THE CIRCUITRYYour one-stop source for all tech news
HOMETODAYNEWSFEEDEVENTS
BOOKMARKS
RSS
© 2026 The Circuitry
About UsSourcesContactCorrectionsPrivacy
  • Today
  • Feed
  • Events
  • Saved
Scroll for more
Verification
VERIFIEDConfidence: HIGH
Source identified
Claims cross-referenced
No discrepancies found
Fact-check summary

Mistral AI's official June 23 announcement confirms OCR 4 launch with bounding boxes, typed block classification, and confidence scores.

Sourcing
1source

via Mistral

Home/Tech/Mistral launches OCR 4 featuring bounding boxes and typed block classification
VERIFIEDBy Xavier Rivera· ·2 min read

Mistral launches OCR 4 featuring bounding boxes and typed block classification

Mistral launched OCR 4 on June 23, 2026, adding bounding boxes, typed-block classification, and inline confidence scores across 170 languages in 10 groups. The single-container model serves as a self-hostable ingestion component for enterprise search, RAG, and domain-specific pipelines while posting top benchmark results.

Source:Mistral
Post
Mistral launches OCR 4 featuring bounding boxes and typed block classification
TL;DRAI · 60 sec read

Mistral AI launches OCR 4 on June 23. The model extracts text from documents with bounding boxes, typed block classifications, and confidence scores. It wins 72 percent of comparisons and scores 85.20 on OlmOCRBench. The structured output integrates directly with the Mistral Search Toolkit to improve RAG and enterprise document workflows.

Mistral AI introduced OCR 4 on June 23, 2026. The compact model extracts document content while supplying bounding boxes for localization, typed-block classification for elements such as titles, tables, equations and signatures, plus inline confidence scores at both page and word levels.
Mistral OCR 4 announcement graphic showing bounding boxes and block classification
Mistral OCR 4 announcement graphic showing bounding boxes and block classification · Mistral AI
OCR 4 delivers breakthrough benchmark performance. Independent annotators preferred the new release over all leading OCR and document-AI systems evaluated, producing average win rates of 72 percent. It also posted the highest overall score on OlmOCRBench with 85.20. The company highlighted known limitations in those evaluation methods within its release notes.
Unlike earlier versions centered on producing clean text and tables, this iteration returns a structured document view.
Unlike earlier versions centered on producing clean text and tables, this iteration returns a structured document view. Each identified block carries spatial coordinates, a functional category, and reliability metrics. Those additions reportedly support tasks including in-context highlighting, source-grounded citations, redactions, and human-in-the-loop review.
Comparison chart or benchmark results for OCR 4 vs other systems
Comparison chart or benchmark results for OCR 4 vs other systems · Mistral AI
Integration with Mistral Search Toolkit is now available in public preview. The model functions as an ingestion layer inside the open-source, composable search framework that Mistral unveiled at the AI Now Summit. Its structured output feeds directly into the toolkit’s retrieval, evaluation, and RAG workflows.
POST FROM @MistralAI· official announcement tweet from Mistral AI introducing OCR 4
https://x.com/MistralAI/status/2069420263825895917
OCR 4 handles standard enterprise file types such as PDF, DOC, PPT, and OpenDocument formats. It shows measurable improvements on rare and low-resource languages where many rival tools reportedly falter. The model fits inside a single container, enabling fully self-hosted operation that satisfies data residency, sovereignty, and compliance needs while permitting efficient batch processing at scale.
From The CircuitryThe Feed — live briefs across tech, all day.See what’s happening →
Pricing and access paths target both developers and enterprise teams. Access through the API costs $4 per 1,000 pages, with a 50 percent discount available for Batch-API calls. Developers call the service programmatically; non-technical teams can reach the same engine via the no-code Document AI interface in Mistral Studio. Self-managed deployment remains restricted to enterprise customers.
It shows measurable improvements on rare and low-resource languages where many rival tools reportedly falter.

The release also positions the model for semantic chunking inside RAG systems, for agentic primitives that enable form filling or compliance checks, and for consistent typed feeds into indexing pipelines. Mistral provided additional direction on selecting between the model API and Document AI depending on use case.
Why this mattersAI · ~100 words

Tap a lens to see what this story means for you.

Reader-supported
DonateBuy me a coffee →Follow@thecircuitry_ →Follow@thecircuitry.to →

Reader-supported · Daily Brief

Daily brief at 7 AM ET. Top tech stories, every morning. Sourced and fact-checked.

HELP US IMPROVE
From The Circuitry

See what’s happening right now

The Feed runs all day — short, verified briefs the moment they break.

Open the Feed →
From The Circuitry

Follow @thecircuitry_

Every story we publish, as it happens. No noise between.

Follow on X ↗On Bluesky ↗

Reader-supported

The Circuitry is a passion project I've always wanted to build, and I love the work behind it.

Running it costs real money. APIs, hosting, time. To keep improving the site and growing this into something useful for everyone, those costs have to be covered.

Any contribution is appreciated. If not, no pressure. Thanks for reading.

Buy me a coffee
AIOCRMistral
More inTech
  • O2 sets summer 2029 start for UK 2G switch-off

    Tech · 36m
  • Claude outage resolved after spiking thousands of reports

    Tech · 1h
  • Meta Debuts First $299 Own-Brand Smart Glasses

    Tech · 1h
SupportThe Work

The Circuitry is reader-supported. If you find the daily brief useful, you can buy me a coffee to keep it going.

Buy a coffee →
SubscribeCircuitry Brief

Daily brief at 7 AM ET. Top tech stories, every morning.

MORE IN TECH

O2 sets summer 2029 start for UK 2G switch-off

Virgin Media O2 will begin switching off its 2G network in summer 2029, joining BT/EE and Vodafone in a government-coordinated UK phase-out. The move affects not only legacy phones but also smart meters, telecare alarms and other IoT devices that still rely on the 32-year-old technology.

Claude outage resolved after spiking thousands of reports

Anthropic's Claude chatbot suffered a widespread outage on June 23, 2026 that affected the chat interface and Claude Code before being fixed within the hour. The incident highlights recurring instability for the AI service this month.

Meta Debuts First $299 Own-Brand Smart Glasses

Meta introduced its first own-brand smart glasses with the $299 Adventurer and Fury plus a $399 Starfire model created with Kylie Jenner. The release targets expansion of the company's camera-equipped lineup before Apple's anticipated 2027 debut.