Roberto Rosario
799d47054e
Load the converter class on demand
...
Signed-off-by: Roberto Rosario <roberto.rosario.gonzalez@gmail.com >
2019-04-27 02:47:23 -04:00
Roberto Rosario
36a51eeb73
Switch to full app paths
...
Instead of inserting the path of the apps into the Python app,
the apps are now referenced by their full import path.
This solves name clashes with external or native Python libraries.
Example: Mayan statistics app vs. Python new statistics library.
Every app reference is now prepended with 'mayan.apps'.
Existing config.yml files need to be updated manually.
Signed-off-by: Roberto Rosario <roberto.rosario.gonzalez@gmail.com >
2019-04-05 02:02:57 -04:00
Roberto Rosario
317d07a355
Refactor OCR app. Removes document parsing. Moves OCR processing to
...
model manager. Add submit and finish events.
Signed-off-by: Roberto Rosario <roberto.rosario.gonzalez@gmail.com >
2017-08-23 02:04:57 -04:00
Roberto Rosario
43d2539c95
Update OCR backend to work with the new document image caching system.
2016-11-02 05:05:25 -04:00
Roberto Rosario
d04117d345
PEP8 and code style cleanups. Replace lists with tuples.
2015-08-12 04:41:59 -04:00
Roberto Rosario
bec85f38f4
Text parsers and OCR backends are now used in tandem for each document.
2015-08-08 04:49:08 -04:00
Roberto Rosario
8382df91a6
Update PDF text parser classes. Remove SlateParser and substitute with a PDFMiner based parser.
2015-07-31 02:09:48 -04:00
Roberto Rosario
4527563d89
PEP8 cleanups, specially E501 line too long.
2015-07-22 18:21:37 -04:00
Roberto Rosario
47a74360dd
Remove double execution of backend. Store the language in the instance.
2015-07-08 04:15:58 -04:00
Roberto Rosario
48df3dcafa
PEP8 cleanups
2015-06-24 17:11:24 -04:00
Roberto Rosario
e4623fadcd
PEP8 cleanups
2015-06-23 02:23:23 -04:00
Roberto Rosario
78198f3398
Smart settings refactor
2015-06-22 21:04:06 -04:00
Roberto Rosario
08a8ae2554
Move document page content code to the OCR app. Prep work for issue #186 .
2015-06-17 00:21:35 -04:00
Roberto Rosario
5275061f9f
Refactor OCR backend class to be file object based and use images from document page not the actual file. Use pytesseract instead of calling the CLI directly.
2015-06-09 03:28:38 -04:00