Skip to content

Interface: OcrConfigDto

@kortexya/reasoninglayer


@kortexya/reasoninglayer / Ingestion / OcrConfigDto

Interface: OcrConfigDto

Defined in: src/types/ingestion.ts:352

Configuration for document parsing (OCR).

Properties

enableFallback?

optional enableFallback: boolean

Defined in: src/types/ingestion.ts:354

Fall back to alternative parser on failure.


extractImages?

optional extractImages: boolean

Defined in: src/types/ingestion.ts:356

Extract embedded images.


extractTables?

optional extractTables: boolean

Defined in: src/types/ingestion.ts:358

Extract tables as markdown.


forceOcr?

optional forceOcr: boolean

Defined in: src/types/ingestion.ts:360

Force OCR even for text-based PDFs.


languages?

optional languages: string[]

Defined in: src/types/ingestion.ts:362

Language hints for OCR (ISO 639-1 codes).


parser?

optional parser: DocumentParser

Defined in: src/types/ingestion.ts:364

Primary parser to use.


useLlmEnhancement?

optional useLlmEnhancement: boolean

Defined in: src/types/ingestion.ts:366

Use LLM to improve table/formula accuracy.