Roberto Rosario
922971274f
Add office document text extractor
2011-12-01 04:54:14 -04:00
Roberto Rosario
6d9b6f9ada
Add more debugging logging and update to only process 1 queue item per execution
...
Previously the task_process_document_queues processed all the pending queue
items it could find, this could lead to the of inexisting queue items
from a stale queryset
2011-12-01 04:48:24 -04:00
Roberto Rosario
c63721cbf6
Move OCR queue document requeueing from the view to the model and add proper exception
2011-12-01 04:47:27 -04:00
Roberto Rosario
31ea558b60
Update the REPLICATION_DELAY default to be 0 seconds
2011-12-01 04:46:00 -04:00
Roberto Rosario
29f547ee48
Update the queue signal processor to only trigger ocr queue processing on newly submitted documents and not requeued documents
2011-12-01 04:44:38 -04:00
Roberto Rosario
deb09d3d83
Re enabled tesseract language specific OCR processing and added a 1 time language neutral retry for failed language specific OCR
2011-11-22 17:46:18 -04:00
Roberto Rosario
667af2a442
Added multiple document OCR submit link
2011-11-22 17:45:56 -04:00
Roberto Rosario
290fcc925b
Added signal processing to the ocr queue to speed up ocr queue processing
2011-11-22 15:42:41 -04:00
Roberto Rosario
dc63c3225e
Updated ocr task to use the new lock manager abstracted class
2011-11-22 15:42:04 -04:00
Roberto Rosario
78685b9fc5
Reduce the ocr lock name size
2011-11-22 15:22:20 -04:00
Roberto Rosario
c9e8f2fac0
Updated the ocr app to use the lock manager
2011-11-22 15:07:29 -04:00
Roberto Rosario
eabc694b56
Updated language source file
2011-11-22 11:29:21 -04:00
Roberto Rosario
21927c00bb
Updated the ocr and document indexing apps to the new maintenante register function
2011-11-21 05:42:53 -04:00
Roberto Rosario
1f8c180567
Spanish translation updates
2011-11-07 00:34:15 -04:00
Roberto Rosario
7577f5b0e4
Added Russian locale post OCR cleanup backend (Сергей Глита [Sergei Glita])
2011-11-06 01:21:19 -04:00
Roberto Rosario
f0c019f6fc
Reduce severity of the messages displayed when no OCR backend is found for a language
2011-11-06 01:06:43 -04:00
Roberto Rosario
6d81185fc1
Updated compiled language files
2011-11-04 13:16:59 -04:00
Roberto Rosario
e58e6f8d8a
Spanish translation source file updates
2011-11-04 13:14:17 -04:00
Roberto Rosario
5a26ccc4ab
Complete Russian translation source messages
2011-11-04 13:13:36 -04:00
Roberto Rosario
b39f5c4ba1
Updated compiles translation files
2011-11-03 21:10:42 -04:00
Roberto Rosario
f71e2a4b62
Further Russian translation updates
2011-11-03 16:57:53 -04:00
Roberto Rosario
eea1abbc80
Russian translation update
2011-11-03 16:57:04 -04:00
Roberto Rosario
8e2210b799
Added initial Russian translation files
2011-11-03 16:40:19 -04:00
Roberto Rosario
fd92a1cd78
Portuguese translation updates
2011-11-03 16:19:55 -04:00
Roberto Rosario
0f72ed5d0d
Spanish translation updates
2011-09-30 01:30:51 -04:00
Roberto Rosario
85349bea03
Updated Spanish .po files, added English .po files to add project to Transifex.com
2011-09-29 18:52:11 -04:00
Roberto Rosario
c7e13576bc
Moved OCR links to the tools main menu
2011-08-18 19:45:48 -04:00
Roberto Rosario
18899f78f2
Improved tools menu using horizontal button widget just like the project_setup app
2011-08-18 11:24:25 -04:00
Roberto Rosario
0a2591d58f
Removed unused import, PEP8 cleanups
2011-08-12 02:13:23 -04:00
Roberto Rosario
08bc9ebf0e
Improved handling of Issue #10
2011-08-08 23:38:36 -04:00
Roberto Rosario
84e12efb43
Added special case handling for DjangoZoom, which executes collectstatic
...
management command before executing syncdb first to create the db
structure. Handles issue #10
2011-08-08 23:24:31 -04:00
Roberto Rosario
2169bbd0d2
Finished adding encapsulation to lambda functions to get around Django bug #15791
2011-08-05 09:46:28 -04:00
Roberto Rosario
1b7183be85
Added encapsulate factory function to get around Django bug #15791
2011-08-05 09:30:26 -04:00
Roberto Rosario
529a9e7eca
Added the ability to unschedule jobs to the scheduler
2011-07-27 01:27:16 -04:00
Roberto Rosario
1507f3afaa
Use the new model's tranformation namespace
2011-07-25 05:04:44 -04:00
Roberto Rosario
828ecd2a33
Use a different namespace for the transformation manager's method, but restoring the original 'objects' namespace
2011-07-25 05:03:03 -04:00
Roberto Rosario
055f64c1cf
Updated OCR models to use the identical source manager SourceTransformationManager
2011-07-25 03:41:28 -04:00
Roberto Rosario
bcb61c3ca3
Enabled OCR queue transformation processing
2011-07-25 03:40:15 -04:00
Roberto Rosario
1321491c1f
Migrated same solution to ocr queue transformation too
2011-07-25 02:59:39 -04:00
Roberto Rosario
a7204ee38f
Added a new ocr_queue_edit permission
2011-07-25 02:55:14 -04:00
Roberto Rosario
842d0c8868
Added job_processors app to abstract background job processing
2011-07-23 16:54:45 -04:00
Roberto Rosario
8462341533
Added new scheduler app to abstract job scheduling
2011-07-23 16:05:31 -04:00
Roberto Rosario
90e876ca93
Code cleanup
2011-07-21 11:46:15 -04:00
Roberto Rosario
89fc258a59
Adapter the OCR app to the new pre cache and preview generation methods
2011-07-21 03:49:27 -04:00
Roberto Rosario
8579c5081d
Improved OCR file conversion
2011-07-19 20:56:21 -04:00
Roberto Rosario
8a017e2af0
Added PDF file support to the python converter backend via ghostscript
2011-07-19 20:55:08 -04:00
Roberto Rosario
648be556a6
Finished adapting the OCR app to the new transformations refactor
2011-07-19 04:21:36 -04:00
Roberto Rosario
1d579e965c
Added convinience model method get_transformation_list to the ocr queue model
2011-07-19 04:20:08 -04:00
Roberto Rosario
d566dfbb1d
Added the first text parser backend (PDF) and updated the requirements files and README
2011-07-18 04:06:59 -04:00
Roberto Rosario
5bfd607b31
Removed pdftotext from the requirements, move unpaper calling to the OCR app
2011-07-18 04:06:19 -04:00