DCR-CORE - Application - Document Language
1. Overview
DCR-CORE
supports the processing of documents in different languages.
The supported languages must be accepted by Pandoc respectively Babel, spaCy and Tesseract OCR.
2. Default Document Language
The default document language is English .
Application | Content |
---|---|
Pandoc | en |
spaCy | en_core_web_trf |
Tesseract OCR | eng |