Can GrabText process scanned PDFs without a text layer?
Yes. GrabText automatically detects whether a PDF contains a native text layer. For scanned or image-based PDFs, Tesseract OCR is used. The OCR language can be set manually or left on Auto.
How do I select specific pages?
In the advanced options, use the Page Selection field. Enter ranges like 1-3 or individual pages like 1,3,5 to extract only the relevant part of the PDF.
Which languages does OCR support?
German, English, Spanish, French, Italian, Portuguese and Dutch — individually or as language combinations for multilingual documents.
What is the maximum file size?
Up to 20 MB per file. Larger PDFs can be split before uploading. Multi-page PDFs are processed page by page.
What is the difference between Markdown and Plain Text?
Markdown preserves document structure: headings as #, lists as -. Plain Text contains only raw text without formatting. Markdown is recommended for AI tools.