595d7227a229aa6f6f381aa1b57dba32e78b5599
Added navigation link from document page view and document page transformation back to document view
Added navigation link from document page view and document page transformation back to document view
Mayan
Open source, Django based document manager with custom metadata indexing, file serving integration and OCR capabilities.
Features
- User defined metadata fields
- Dynamic default values for metadata
- Lookup support for metadata
- Filesystem integration by means of metadata indexing directories
- User defined document uuid generation
- Local file or server side staging file uploads
- Batch upload many documents with the same metadata
- User defined document checksum algorithm
- Previews for a great deal of image formats, including PDF
- Document OCR and searching
- Group documents by metadata automatically
- Permissions and roles support
- Multi page document support
- Page transformations
- OCR queue (via celery)
- Multilingual (English, Spanish)
Requirements
Python:
- Django - A high-level Python Web framework that encourages rapid development and clean, pragmatic design.
- django-pagination
- django-filetransfers - File upload/download abstraction
- django-celery
- celery
Or execute pip install -r requirements/production.txt to install the dependencies automatically.
Executables:
- ImageMagick - Convert, Edit, Or Compose Bitmap Images
- libmagic
- tesseract-ocr - An OCR Engine that was developed at HP Labs between 1985 and 1995... and now at Google.
- unpaper - post-processing scanned and photocopied book pages
License
See docs/LICENSE file
Author
Roberto Rosario - Twitter [E-mail](roberto.rosario.gonzalez at gmail)
Credits
See docs/CREDITS file
Description
Languages
Gettext Catalog
47.9%
Python
26.9%
Modelica
23.2%
HTML
0.8%
reStructuredText
0.7%
Other
0.3%
