Add tesseract homepage link and note on how to add extra languages.
This commit is contained in:
@@ -2,8 +2,9 @@
|
|||||||
OCR backend
|
OCR backend
|
||||||
===========
|
===========
|
||||||
|
|
||||||
Mayan EDMS ships an OCR backend that uses the FLOSS engine Tesseract, but it can
|
Mayan EDMS ships an OCR backend that uses the FLOSS engine Tesseract
|
||||||
use other engines. To support other engines a wrapper that subclasess the
|
(https://github.com/tesseract-ocr/tesseract/), but it can
|
||||||
|
use other engines. To support other engines crate a wrapper that subclasess the
|
||||||
``OCRBackendBase`` class defined in mayan/apps/ocr/classes. This subclass should
|
``OCRBackendBase`` class defined in mayan/apps/ocr/classes. This subclass should
|
||||||
expose the ``execute`` method. For an example of how the Tesseract backend
|
expose the ``execute`` method. For an example of how the Tesseract backend
|
||||||
is implemented take a look at the file ``mayan/apps/ocr/backends/tesseract.py``
|
is implemented take a look at the file ``mayan/apps/ocr/backends/tesseract.py``
|
||||||
@@ -13,3 +14,8 @@ OCR_BACKEND and point it to your new OCR backend class path.
|
|||||||
|
|
||||||
The default value of OCR_BACKEND is ``"ocr.backends.tesseract.Tesseract"``
|
The default value of OCR_BACKEND is ``"ocr.backends.tesseract.Tesseract"``
|
||||||
|
|
||||||
|
To add support to OCR more languages when using Tesseract, install the
|
||||||
|
corresponding language file. If using a Debian based OS, this command will
|
||||||
|
display the available language files:
|
||||||
|
|
||||||
|
apt-cache search tesseract-ocr
|
||||||
|
|||||||
Reference in New Issue
Block a user