Learn / OCR

How to Extract Text from a Scanned PDF

Scanned PDFs store pages as images — you can't copy the text. OCR changes that. Here's how to extract text from any scanned document for free.

Ready to extract text? Use the free OCR tool.

Extract Text with OCR

Why You Can't Copy Text from a Scanned PDF

Scanned PDFs are different from regular PDFs. When a scanner digitizes a paper document, it photographs each page and embeds the photos inside a PDF container. The resulting file looks like a document, but it contains no actual text — just images.

That's why clicking on a word doesn't select it, and Ctrl+F finds nothing. OCR (Optical Character Recognition) analyzes those images pixel by pixel, identifies characters, and converts them into a real, selectable text layer.

Two Ways to Extract Text from a Scanned PDF

Option 1: Make the PDF Searchable (Keeps Original Layout)

Use this if you want to keep the PDF looking the same but gain the ability to search and select text within it.

1. Upload to OCR Scanner

2. Select language → Run OCR

3. Download the searchable PDF

Option 2: Export Text as a Plain TXT File

Use this if you want the raw text to paste into another app, analyze, or edit freely.

1. Upload to OCR Scanner → Run OCR

2. Download the searchable PDF

3. Upload that PDF to PDF to TXT

4. Download the plain text file

How to Improve Text Extraction Accuracy

  • Scan at 300 DPI or higher. Most scanner apps let you change the resolution setting. Higher DPI means sharper images, which means better OCR.
  • Keep the document flat and straight. Curved pages (from book scanning) and tilted pages confuse OCR engines. A flatbed scanner gives the best results.
  • Use good lighting. For phone photos, make sure there are no shadows falling across the text. Use Phone Scan Cleanup to improve contrast before OCR.
  • Select the right language. OCR engines use language dictionaries to resolve ambiguous characters — matching the language improves results noticeably.

Common Use Cases

  • Contracts: Extract clause text to paste into a summary or compare documents
  • Receipts and invoices: Pull amounts and dates into a spreadsheet
  • Research papers: Quote specific passages without re-typing them
  • Medical records: Copy doctor's notes into a health app
  • Historical documents: Digitize and make archives searchable

Extract Text from Your Scanned PDF

Free OCR — 10 conversions/day. No software or signup required.

Extract Text Now

Frequently Asked Questions

How do I extract text from a scanned PDF?

Upload your scanned PDF to PDF.it's OCR Scanner. The tool reads every page and adds a real text layer, so you can then select and copy any text, or export the full text as a TXT file.

Can I extract text from a scanned PDF for free?

Yes. PDF.it gives you 10 free OCR conversions per day. The first 3 don't require an account.

What if I want the text as a plain TXT file?

First run OCR to make the scanned PDF searchable, then use the PDF to TXT tool to extract the text layer as a plain text file. Two steps, both free.

Why does the extracted text have errors?

OCR accuracy depends on scan quality. Low-resolution scans, skewed pages, or faded ink reduce accuracy. Rescan at 300 DPI or use Phone Scan Cleanup to improve image quality before running OCR.

Can I extract text from a multi-page scanned PDF?

Yes. PDF.it's OCR Scanner processes every page in your document in a single pass. There's no page limit — larger documents just take a few seconds longer.