Commit Graph

5 Commits

Author SHA1 Message Date
Roberto Rosario
f9a3c4611b PEP8 cleanups, remove OCR_CACHE_URI 2012-01-18 13:53:02 -04:00
Roberto Rosario
1e38369919 Update parser to use the latest version of a document when extracting text 2011-12-02 05:56:34 -04:00
Roberto Rosario
922971274f Add office document text extractor 2011-12-01 04:54:14 -04:00
Roberto Rosario
90e876ca93 Code cleanup 2011-07-21 11:46:15 -04:00
Roberto Rosario
d566dfbb1d Added the first text parser backend (PDF) and updated the requirements files and README 2011-07-18 04:06:59 -04:00