DuckDB preinstalled for AWS. DuckDB 1.5 in a JupyterLab notebook environment with a sample parquet dataset on Ubuntu 24.04, with a per instance admin password generated on first boot. Backed by 24/7 cloudimg support.
## DuckDB by cloudimg
DuckDB is the open source in-process analytical database engine designed for fast queries against large columnar datasets. This Amazon Machine Image delivers DuckDB inside a complete analytics environment, so you can connect, load data and run queries within minutes of launch. The release available is DuckDB 1.5.
## Application Stack
The image runs DuckDB 1.5 with the CLI on every user's PATH and the Python DuckDB client preinstalled. A JupyterLab notebook server is fronted by nginx on port 80 with HTTP basic authentication. Notebooks, sample datasets and persistent databases live on a dedicated, independently resizable EBS volume.
## Sample Dataset and Notebook
A one million row New York City yellow taxi trips parquet file ships on the data volume, alongside a starter notebook that opens a persistent DuckDB database and runs three analytical queries so you can see the engine in action before writing any code.
## Secure First Boot
On the first boot of every instance a one shot service generates a fresh JupyterLab basic authentication password, unique to that instance, and writes it to a root only file. No shared or default credentials ship in the image.
## Ready To Use
Browse to the instance address, sign in to JupyterLab and start querying. The DuckDB CLI is also available directly over SSH for terminal driven analytics.
## cloudimg Support
cloudimg provides 24/7 technical support for this image, covering DuckDB deployment, notebook configuration, dataset loading, performance tuning and engine upgrades.