Roberto Rosario
|
c18cb099c6
|
Improved tesseract execution handling
|
2011-02-17 23:31:54 -04:00 |
|
Roberto Rosario
|
77b8a432a2
|
Added distributed OCR queue support
|
2011-02-17 04:37:35 -04:00 |
|
Roberto Rosario
|
478fb3502e
|
Changed from python's multiprocessing to celery to handle concurrency
|
2011-02-17 03:45:30 -04:00 |
|
Roberto Rosario
|
409a52af95
|
First commit to support ocr subprocess
|
2011-02-17 01:57:14 -04:00 |
|
Roberto Rosario
|
dfd101c33b
|
Cleanup file after ocr
|
2011-02-16 20:54:11 -04:00 |
|
Roberto Rosario
|
b1e2f64617
|
Apply transformation before doing OCR, added unpaper to the OCR pre processing pipe
|
2011-02-16 03:32:21 -04:00 |
|
Roberto Rosario
|
fbc8bc960a
|
Decoupled page transformation interface, added default transformation support
|
2011-02-14 02:11:39 -04:00 |
|
Roberto Rosario
|
06d7e5a46a
|
Added multipage document support and document page transformation
|
2011-02-14 00:18:16 -04:00 |
|
Roberto Rosario
|
d6afcc64bb
|
Changed file permissions
|
2011-02-09 13:55:01 -04:00 |
|
Roberto Rosario
|
6569faad11
|
Added OCR capabilites
|
2011-02-09 02:12:14 -04:00 |
|