Skip to content

DCR-CORE - Application - Document Language

GitHub (Pre-)Release GitHub (Pre-)Release Date

1. Overview

DCR-CORE supports the processing of documents in different languages. The supported languages must be accepted by Pandoc respectively Babel, spaCy and Tesseract OCR.

2. Default Document Language

The default document language is English .

Application Content
Pandoc en
spaCy en_core_web_trf
Tesseract OCR eng