Roberto Rosario
9564db398f
Backport configuration file improvements
...
Remove support for quoted entried. Support unquoted entries. Support
custom location for the config files.
Signed-off-by: Roberto Rosario <roberto.rosario.gonzalez@gmail.com >
2019-07-09 15:40:20 -04:00
Roberto Rosario
ea3b513ae3
Add new app to handle all dependencies
...
Signed-off-by: Roberto Rosario <roberto.rosario.gonzalez@gmail.com >
2019-05-03 01:12:20 -04:00
Roberto Rosario
32cf0a0595
Add new default Tesseract OCR backend
...
This new backend uses a command call to avoid
Tesseract bug 1670
(https://github.com/tesseract-ocr/tesseract/issues/1670 ).
Signed-off-by: Roberto Rosario <roberto.rosario.gonzalez@gmail.com >
2019-04-27 15:44:09 -04:00
Roberto Rosario
6bfdb053e3
Add new OCR backend using PyOCR. Remove current direct call Tesseract backend.
2016-12-30 00:36:45 -04:00
Roberto Rosario
8712c6ee37
PEP8 cleanups.
2016-05-17 05:08:21 -04:00
Roberto Rosario
27aae995f0
The Tesseract OCR backend now reports if the requested language file is missing. GitLab issue #289 .
2016-05-17 03:49:23 -04:00
Roberto Rosario
3b728328ad
PEP8 cleanups, E501.
2015-07-23 04:05:29 -04:00
Roberto Rosario
ba7cb433d4
Don't hide OCR errors by doing a fallback try without language option. gh-issue #211
2015-07-18 03:25:36 -04:00
Roberto Rosario
2033f85874
Log OCR subclass errors.
2015-07-08 04:16:35 -04:00
Roberto Rosario
e4623fadcd
PEP8 cleanups
2015-06-23 02:23:23 -04:00
Roberto Rosario
78198f3398
Smart settings refactor
2015-06-22 21:04:06 -04:00
Roberto Rosario
5275061f9f
Refactor OCR backend class to be file object based and use images from document page not the actual file. Use pytesseract instead of calling the CLI directly.
2015-06-09 03:28:38 -04:00
Roberto Rosario
e6754c9a6f
Update the OCR app to work based on document versions not documents, document version are the module which hold the document pages instances. Remove old OCR document queue and replace with a single module for OCR processing error entries. Increase compatibility with Django 1.7 and Python 3.
2015-01-15 03:01:43 -04:00
Roberto Rosario
5f6438ac55
Merge branch 'feature/polish_PR_52' into development
2014-09-30 15:48:42 -04:00
Roberto Rosario
4968051b6d
Don't silence OCR errors even if Tesseract is optional otherwise the user won't know happened.
...
Catch the OSError generic exception and return a friendlier "Tesseract not found" exception
2014-09-30 15:41:00 -04:00
Roberto Rosario
b761037d99
Move all settings files from <app>/conf/settings.py to <app>/settings.py
2014-09-11 05:02:40 -04:00
Roberto Rosario
a9390d55ba
Unify the way backends are defined and loaded, unify the fs_cleanup function
2014-07-01 00:22:31 -04:00
Roberto Rosario
ac061f2203
PEP8 Cleanups, simple sintax errors fixes
2014-06-25 02:53:12 -04:00
Roberto Rosario
e0347785b7
Initial implementation of OCR pluggable backends
2014-06-24 22:48:16 -04:00