Refactor OCR backend class to be file object based and use images from document page not the actual file. Use pytesseract instead of calling the CLI directly.

This commit is contained in:
Roberto Rosario
2015-06-09 03:27:02 -04:00
parent 931bdfd113
commit 5275061f9f
6 changed files with 81 additions and 45 deletions

View File

@@ -1,3 +1 @@
class BackendBase(object):
def execute(self, input_filename, language=None):
raise NotImplementedError