Running OCR...
This may take a moment for multi-page documents
Back to Home

Drop your scanned PDF here

or click to browse files (max 50 MB)

file.pdf 0 KB

How it works

1

PDF pages are converted to images using Ghostscript

2

Tesseract OCR reads text from each page image

3

Extracted text is compiled into a .txt file for download

Server Requirements

Ghostscript — Required for PDF-to-image conversion. Download from ghostscript.com
Tesseract OCR — Required for text recognition. Download from tesseract-ocr.github.io. Language packs must be installed separately for non-English languages.

Extracted Text Preview