Commit Graph

82 Commits

Author SHA1 Message Date
Roberto Rosario
1c6e90a37d Initialize the converter backend later to allow proper registration of app 2012-09-07 02:15:30 -04:00
Roberto Rosario
7e44247319 Update converter app to new settings and icon apps 2012-09-05 14:08:47 -04:00
Roberto Rosario
576a2cc643 Support passing MIMETypes and actual document filenames to TextParser for better lexer guessing 2012-08-06 03:00:09 -04:00
Roberto Rosario
6fa16ebb7e Add TextParser to the converter pipeline 2012-07-28 04:44:23 -04:00
Roberto Rosario
eecf7c7751 Import and PEP8 cleanups 2012-07-26 22:42:36 -04:00
Roberto Rosario
c7ea5271e4 Add logging to converter.api 2012-05-31 01:42:01 -04:00
Roberto Rosario
f026ef8bd4 Handle unicode filenames in staging folder preview and upload (Sergei Glita) 2012-02-07 14:40:56 -04:00
Roberto Rosario
a13f048242 Fix indentation 2012-02-04 17:20:13 -04:00
Roberto Rosario
e069243f7c Add missing import 2012-01-18 14:43:17 -04:00
Roberto Rosario
970cb74d35 PEP8 cleanups 2012-01-18 14:37:15 -04:00
Roberto Rosario
34311fb17e Cleanups, permissions separation into explicit module, absolute import update 2012-01-02 03:48:26 -04:00
Roberto Rosario
fafadfaca2 Fix document print view 2011-12-04 02:37:11 -04:00
Roberto Rosario
60c05317b3 Fix indentation 2011-12-01 23:38:29 -04:00
Roberto Rosario
da4457b258 Improve office documents page number detection 2011-11-22 05:51:02 -04:00
Roberto Rosario
ba7016b0bc Don't call office converter if not initialized 2011-11-22 05:08:04 -04:00
Roberto Rosario
c598e4ceb3 Instantiate OfficeConverter just once 2011-11-21 08:22:00 -04:00
Roberto Rosario
bd24353886 Updated office converter to accept a mimetype argument to avoid yet another file mimetype detection 2011-11-21 06:14:07 -04:00
Roberto Rosario
2bae05f39c Updated the convert app api to reuse the office converter detected mimetype 2011-11-21 05:50:34 -04:00
Roberto Rosario
e590cb041c Finished office converter using MIME type detection 2011-11-21 02:47:52 -04:00
Roberto Rosario
67b3e19031 Initial commit of the new office converter class 2011-11-20 02:48:34 -04:00
Roberto Rosario
7ec7ed499e Detect invalid transformation argument and reset to empty list [] 2011-11-07 02:05:06 -04:00
Roberto Rosario
c46c52ebdf Added converter backend agnostic image file format descriptions 2011-08-12 04:06:48 -04:00
Roberto Rosario
0a2591d58f Removed unused import, PEP8 cleanups 2011-08-12 02:13:23 -04:00
Roberto Rosario
beea100cd9 Updated the get_page_count logic to let the converter raise UnknownFileFormat
and let the document model handle the exception, defaulting to one page
count and saving a comment on the document description
2011-08-05 03:48:51 -04:00
Roberto Rosario
cd0b1577a7 Renamed the exception UnknownFormat to UnkownFileFormat and updated all
backends get_page_count method to raise it, having the api's get_page_count
function decide how to handle the exception
2011-08-05 03:24:01 -04:00
Roberto Rosario
01de394b88 Properly import COMMON_TEMPORARY_DIRECTORY, removed more QUALITY related code 2011-07-21 03:45:14 -04:00
Roberto Rosario
8a017e2af0 Added PDF file support to the python converter backend via ghostscript 2011-07-19 20:55:08 -04:00
Roberto Rosario
7ea1b87ee0 Made size an optional argument of the convert function 2011-07-19 04:21:12 -04:00
Roberto Rosario
5bfd607b31 Removed pdftotext from the requirements, move unpaper calling to the OCR app 2011-07-18 04:06:19 -04:00
Roberto Rosario
5829bbde4d Added per OCR queue transformation models and CRUD views to replace the CONVERTER_OCR_OPTIONS with the new refactored converter transformations systems 2011-07-17 01:32:46 -04:00
Roberto Rosario
29adcce2a3 flake8 cleanups 2011-07-16 01:15:58 -04:00
Roberto Rosario
0fe032f7c9 Finished fixing new document transformations 2011-07-16 01:09:36 -04:00
Roberto Rosario
389253385c Source, document page and thumbnails working, new document transformations and OCR yet to convert 2011-07-15 20:25:49 -04:00
Roberto Rosario
743ae0fce0 Initial commit of the converter image transformation refactor 2011-07-15 06:16:14 -04:00
Roberto Rosario
415f0c8daa Refactored the converter backend system 2011-07-13 22:53:33 -04:00
Roberto Rosario
9250a6bbdc Added view to list supported file formats and reported by the converter backend 2011-06-18 00:51:32 -04:00
Roberto Rosario
07e9b12e78 flake8 cleanups, ununsed imports and variables cleanup, changed register_diagnostics to use reverse_lazy instead of reverse 2011-05-06 10:39:54 -04:00
Roberto Rosario
9c17a627ec Initial commit of the new_printing branch 2011-05-04 16:59:14 -04:00
Roberto Rosario
bfa70f114b Converted app cleanups, document pre-cache, magic number removal 2011-04-26 15:02:27 -04:00
Roberto Rosario
425e4a0086 Fix document previews 2011-04-25 11:33:16 -04:00
Roberto Rosario
f88b011365 Quick update to fix staging file previews 2011-04-25 10:56:04 -04:00
Roberto Rosario
06900f7cd9 Fixed typo from previous PEP8 cleanup 2011-04-24 04:34:51 -04:00
Roberto Rosario
700bd7071c Removed redundant tranformation calculation 2011-04-24 04:08:02 -04:00
Roberto Rosario
b2d0f7c310 Added doc extension to office document format list 2011-04-24 03:59:36 -04:00
Roberto Rosario
5b5a90100c flake8 cleanups 2011-04-23 23:03:56 -04:00
Roberto Rosario
46ec25b139 Added initial support for converting office documents (only ods and docx tested) 2011-04-23 22:20:30 -04:00
Roberto Rosario
221049100c Improved document convertion API 2011-04-23 19:42:15 -04:00
Roberto Rosario
75bed35d2c Fixed an error introduced in the last PEP8 cleanup 2011-04-23 05:36:17 -04:00
Roberto Rosario
2a744cefea PEP8, pylint cleanups and removal of relative imports 2011-04-23 02:49:07 -04:00
Roberto Rosario
ec2b313755 Lower image convertion quality if the format is jpg 2011-04-22 02:32:41 -04:00