Extract text from scanned documents and image-based PDFs using optical character recognition.
or click to browse files (max 50 MB)
PDF pages are converted to images using Ghostscript
Tesseract OCR reads text from each page image
Extracted text is compiled into a .txt file for download