Not all documents read the same.
All four can become data with a codebook.
Text
Prose that's already text. Emails, articles, transcripts, clean PDFs.
Read the words.
Scientific
Manuscripts, patents, reports. Figures, tables, equations — the evidence isn't only in the prose.
Read the text — and see the figures.
Visual
Forms, invoices, typed letters, well-scanned pages. The layout carries meaning.
Look before you read.
Squint
Ledgers, registries, field notes. The same structure on every page — clean or damaged.
Read each page the same way.
On Data Mint, we add more reader modes as we find document types that need them.
Four disciplines of reading. One extraction, on Data Mint.
No matter where you interface with AI: pick the reader that fits the document.
@karlrohe