AI-powered OCR for images and PDFs. Extracts structured text, markdown, and tables from scanned documents, forms, invoices, receipts, and spreadsheets โ powered by Mistral AI.
Traditional OCR (like Tesseract) struggles with tables, multi-column layouts, handwriting, and low-quality scans. Mistral AI OCR understands document structure, preserves table formatting in Markdown, and handles PDFs with mixed text and images โ including scanned Brazilian notas fiscais, contracts, and forms.
| Field | Type | Description |
|---|---|---|
| image | string | JPEG or PNG as base64 (data:image/... prefix) or public URL. Max 10MB. |
| string | PDF as base64 (data:application/pdf;base64,...) or public URL. Max 10MB. | |
| pages | number[] | Optional. Array of 0-based page indices to process. Default: all pages. |
| Feature | /image/ocr (Tesseract) | /document/ocr (Mistral AI) |
|---|---|---|
| Tables | โ Plain text | โ Structured Markdown |
| PDF support | โ Images only | โ Native PDF |
| Multi-column layouts | โ Often garbled | โ Preserves structure |
| Price | $0.002/req | $0.015/req |
| Best for | Simple text images | Complex docs, PDFs, forms |
$0.015 per request via x402 micropayment on Base (USDC). No monthly fees. Pay per use.