Roberto Rosario
1e38369919
Update parser to use the latest version of a document when extracting text
2011-12-02 05:56:34 -04:00
Roberto Rosario
8c02c4c426
Disable ocr document queue signal
...
It appears Django executes signals using the same process
as the caller, effectively blocking the view until the OCR process
completes which could take several minutes :/
2011-12-02 05:55:42 -04:00
Roberto Rosario
d83e8b5428
Initial set of model, form and API changes to support document versions
2011-12-02 02:51:59 -04:00
Roberto Rosario
922971274f
Add office document text extractor
2011-12-01 04:54:14 -04:00
Roberto Rosario
6d9b6f9ada
Add more debugging logging and update to only process 1 queue item per execution
...
Previously the task_process_document_queues processed all the pending queue
items it could find, this could lead to the of inexisting queue items
from a stale queryset
2011-12-01 04:48:24 -04:00
Roberto Rosario
c63721cbf6
Move OCR queue document requeueing from the view to the model and add proper exception
2011-12-01 04:47:27 -04:00
Roberto Rosario
31ea558b60
Update the REPLICATION_DELAY default to be 0 seconds
2011-12-01 04:46:00 -04:00
Roberto Rosario
29f547ee48
Update the queue signal processor to only trigger ocr queue processing on newly submitted documents and not requeued documents
2011-12-01 04:44:38 -04:00
Roberto Rosario
deb09d3d83
Re enabled tesseract language specific OCR processing and added a 1 time language neutral retry for failed language specific OCR
2011-11-22 17:46:18 -04:00
Roberto Rosario
667af2a442
Added multiple document OCR submit link
2011-11-22 17:45:56 -04:00
Roberto Rosario
290fcc925b
Added signal processing to the ocr queue to speed up ocr queue processing
2011-11-22 15:42:41 -04:00
Roberto Rosario
dc63c3225e
Updated ocr task to use the new lock manager abstracted class
2011-11-22 15:42:04 -04:00
Roberto Rosario
78685b9fc5
Reduce the ocr lock name size
2011-11-22 15:22:20 -04:00
Roberto Rosario
c9e8f2fac0
Updated the ocr app to use the lock manager
2011-11-22 15:07:29 -04:00
Roberto Rosario
eabc694b56
Updated language source file
2011-11-22 11:29:21 -04:00
Roberto Rosario
21927c00bb
Updated the ocr and document indexing apps to the new maintenante register function
2011-11-21 05:42:53 -04:00
Roberto Rosario
1f8c180567
Spanish translation updates
2011-11-07 00:34:15 -04:00
Roberto Rosario
7577f5b0e4
Added Russian locale post OCR cleanup backend (Сергей Глита [Sergei Glita])
2011-11-06 01:21:19 -04:00
Roberto Rosario
f0c019f6fc
Reduce severity of the messages displayed when no OCR backend is found for a language
2011-11-06 01:06:43 -04:00
Roberto Rosario
6d81185fc1
Updated compiled language files
2011-11-04 13:16:59 -04:00
Roberto Rosario
e58e6f8d8a
Spanish translation source file updates
2011-11-04 13:14:17 -04:00
Roberto Rosario
5a26ccc4ab
Complete Russian translation source messages
2011-11-04 13:13:36 -04:00
Roberto Rosario
b39f5c4ba1
Updated compiles translation files
2011-11-03 21:10:42 -04:00
Roberto Rosario
f71e2a4b62
Further Russian translation updates
2011-11-03 16:57:53 -04:00
Roberto Rosario
eea1abbc80
Russian translation update
2011-11-03 16:57:04 -04:00
Roberto Rosario
8e2210b799
Added initial Russian translation files
2011-11-03 16:40:19 -04:00
Roberto Rosario
fd92a1cd78
Portuguese translation updates
2011-11-03 16:19:55 -04:00
Roberto Rosario
0f72ed5d0d
Spanish translation updates
2011-09-30 01:30:51 -04:00
Roberto Rosario
85349bea03
Updated Spanish .po files, added English .po files to add project to Transifex.com
2011-09-29 18:52:11 -04:00
Roberto Rosario
c7e13576bc
Moved OCR links to the tools main menu
2011-08-18 19:45:48 -04:00
Roberto Rosario
18899f78f2
Improved tools menu using horizontal button widget just like the project_setup app
2011-08-18 11:24:25 -04:00
Roberto Rosario
0a2591d58f
Removed unused import, PEP8 cleanups
2011-08-12 02:13:23 -04:00
Roberto Rosario
08bc9ebf0e
Improved handling of Issue #10
2011-08-08 23:38:36 -04:00
Roberto Rosario
84e12efb43
Added special case handling for DjangoZoom, which executes collectstatic
...
management command before executing syncdb first to create the db
structure. Handles issue #10
2011-08-08 23:24:31 -04:00
Roberto Rosario
2169bbd0d2
Finished adding encapsulation to lambda functions to get around Django bug #15791
2011-08-05 09:46:28 -04:00
Roberto Rosario
1b7183be85
Added encapsulate factory function to get around Django bug #15791
2011-08-05 09:30:26 -04:00
Roberto Rosario
529a9e7eca
Added the ability to unschedule jobs to the scheduler
2011-07-27 01:27:16 -04:00
Roberto Rosario
1507f3afaa
Use the new model's tranformation namespace
2011-07-25 05:04:44 -04:00
Roberto Rosario
828ecd2a33
Use a different namespace for the transformation manager's method, but restoring the original 'objects' namespace
2011-07-25 05:03:03 -04:00
Roberto Rosario
055f64c1cf
Updated OCR models to use the identical source manager SourceTransformationManager
2011-07-25 03:41:28 -04:00
Roberto Rosario
bcb61c3ca3
Enabled OCR queue transformation processing
2011-07-25 03:40:15 -04:00
Roberto Rosario
1321491c1f
Migrated same solution to ocr queue transformation too
2011-07-25 02:59:39 -04:00
Roberto Rosario
a7204ee38f
Added a new ocr_queue_edit permission
2011-07-25 02:55:14 -04:00
Roberto Rosario
842d0c8868
Added job_processors app to abstract background job processing
2011-07-23 16:54:45 -04:00
Roberto Rosario
8462341533
Added new scheduler app to abstract job scheduling
2011-07-23 16:05:31 -04:00
Roberto Rosario
90e876ca93
Code cleanup
2011-07-21 11:46:15 -04:00
Roberto Rosario
89fc258a59
Adapter the OCR app to the new pre cache and preview generation methods
2011-07-21 03:49:27 -04:00
Roberto Rosario
8579c5081d
Improved OCR file conversion
2011-07-19 20:56:21 -04:00
Roberto Rosario
8a017e2af0
Added PDF file support to the python converter backend via ghostscript
2011-07-19 20:55:08 -04:00
Roberto Rosario
648be556a6
Finished adapting the OCR app to the new transformations refactor
2011-07-19 04:21:36 -04:00