Paperless-ngx, the open source document management system that scans, OCRs, indexes and archives your documents, preinstalled for AWS with PostgreSQL, Redis, tesseract OCR and nginx. Per instance administrator and database credentials are generated on first boot. Backed by 24/7 cloudimg support.
## Paperless-ngx by cloudimg
Paperless-ngx is the popular open source document management system: a self-hosted, searchable digital archive for your paperless office. Drop a scan or PDF into the consume directory and Paperless OCRs it, extracts the text, tags and indexes it, then archives a searchable original. This Amazon Machine Image delivers Paperless-ngx fully installed and configured, so a complete document management platform is running within minutes of launch. The current release available is Paperless-ngx 2.20.
## Application Stack
The image runs Paperless-ngx bare-metal from the official release: the granian webserver serves the bundled Angular frontend, a Celery worker and scheduler run the background task pipeline, and a consumer watches the input folder. PostgreSQL provides the database and Redis the task broker. The OCR pipeline is built on tesseract, Ghostscript and OCRmyPDF. nginx terminates HTTP on port 80 and reverse-proxies the application.
## Secure First Boot
On the first boot of every instance a one-shot service generates a fresh PostgreSQL password, a fresh Django secret key and a fresh Paperless administrator password, unique to that instance, and writes them to a root only file. The instance address is added to the trusted hosts automatically. No shared or default credentials and no documents ship in the image.
## Ready To Use
The application, web server, database, cache and OCR pipeline are all configured, with the document archive and database on dedicated, independently resizable storage volumes. Browse to the instance address, sign in as the administrator, and start scanning, tagging and searching your documents.
## cloudimg Support
24/7 technical support by email and chat for deployment, upgrades, OCR language configuration, consume-folder automation, object storage, TLS termination and database administration.