Commit Graph

28 Commits

Author SHA1 Message Date
Roberto Rosario
7e83a1074e Change the OCR_AUTOMATIC_OCR default to True, as per version 0.12.x 2012-07-28 04:45:05 -04:00
Roberto Rosario
24130cf716 Added sprites for several namespaces 2012-07-03 11:00:21 -04:00
Roberto Rosario
58f027db60 Clean up (unused imports, PEP8, etc) 2012-06-08 16:43:54 -04:00
Roberto Rosario
c24a3f935e Merge branch 'hotfix/v0.12.1' into development
Conflicts:
	apps/converter/conf/settings.py
	apps/documents/__init__.py
	apps/main/__init__.py
	apps/metadata/__init__.py
	apps/navigation/templatetags/navigation_tags.py
	apps/ocr/__init__.py
	apps/ocr/conf/settings.py
	docs/intro/installation.rst
	docs/releases/index.rst
	requirements/production.txt
2012-06-03 22:50:33 -04:00
Roberto Rosario
f2f99bf0ad Change default value of OCR_AUTOMATIC_OCR to True 2012-05-30 19:27:23 -04:00
Roberto Rosario
0cf8fe2dec Update PDFTOTEXT_PATH config option description 2012-05-30 18:53:28 -04:00
Roberto Rosario
200e5f3947 Add setting to specify the location of the pdftotext binary 2012-05-30 16:15:30 -04:00
Roberto Rosario
c9dcda3a5c Convert all apps to use the new class based settings 2012-03-06 16:08:28 -04:00
Roberto Rosario
f9a3c4611b PEP8 cleanups, remove OCR_CACHE_URI 2012-01-18 13:53:02 -04:00
Roberto Rosario
31ea558b60 Update the REPLICATION_DELAY default to be 0 seconds 2011-12-01 04:46:00 -04:00
Roberto Rosario
648be556a6 Finished adapting the OCR app to the new transformations refactor 2011-07-19 04:21:36 -04:00
Roberto Rosario
5bfd607b31 Removed pdftotext from the requirements, move unpaper calling to the OCR app 2011-07-18 04:06:19 -04:00
Roberto Rosario
9e61213241 Created new smart_settings app and move everything related to app settings to it 2011-05-07 01:32:02 -04:00
Roberto Rosario
7f2c563192 Converted whole project to a smarter method of defining app settings 2011-05-07 01:15:40 -04:00
Roberto Rosario
7469fe991f Made the OCR cache backend used for locking configurable, move ocr locking to queued document from periodic task, added again a random delay fallback in case no cache backend is used 2011-05-06 15:31:49 -04:00
Roberto Rosario
ebdcede59f Made the queue processing interval configurable by means of a new setting: OCR_QUEUE_PROCESSING_INTERVAL 2011-04-23 05:38:59 -04:00
Roberto Rosario
eaaaa5b645 Added support for the command line program pdftotext from the poppler-utils packages to extract text from PDF documents without doing OCR 2011-04-15 23:59:52 -04:00
Roberto Rosario
6b5a17af39 Made English the default language for Tesseract if none is specified 2011-04-13 03:25:45 -04:00
Roberto Rosario
71a3c218f4 PEP8, pylint and django-lint cleanups 2011-04-08 02:09:39 -04:00
Roberto Rosario
283df926d1 Made automatic OCR a function of the OCR app and not of Documents app (via signals)
Renamed setup option DOCUMENT_AUTOMATIC_OCR to OCR_AUTOMATIC_OCR
2011-04-04 15:36:00 -04:00
Roberto Rosario
3cb0f37b5b Made the concurrent ocr code more granular, per node, every node can handle different amounts of concurrent ocr tasks 2011-03-22 04:17:48 -04:00
Roberto Rosario
f9ab61647e Reduced default delay time 2011-03-22 03:43:18 -04:00
Roberto Rosario
bbcc0ead65 * Added a new option OCR_REPLICATION_DELAY to allow the storage some time for replication before attempting to do OCR to a document 2011-03-21 12:24:42 -04:00
Roberto Rosario
6a9e114acb Set all *.py files permissions to 644 2011-03-07 12:15:25 -04:00
Roberto Rosario
595d7227a2 Added navigation link from document page view and document page transformation back to document view 2011-02-17 23:27:25 -04:00
Roberto Rosario
478fb3502e Changed from python's multiprocessing to celery to handle concurrency 2011-02-17 03:45:30 -04:00
Roberto Rosario
d6afcc64bb Changed file permissions 2011-02-09 13:55:01 -04:00
Roberto Rosario
6569faad11 Added OCR capabilites 2011-02-09 02:12:14 -04:00