Installation¶

Prerequisites¶

Python: 3.12 (the only supported version)
OS: Linux (tested). macOS and Windows (WSL2) are untested; may work for CPU-only use.
Optional: NVIDIA GPU with CUDA for GPU acceleration

From source¶

Clone the repository and install:

# codeberg mirror
git clone https://codeberg.org/piotrkrzysztof/quantum-pipeline.git

# or github
git clone https://github.com/straightchlorine/quantum-pipeline.git

cd quantum-pipeline

# Install with PDM
pdm install

# Or with pip
pip install -e .

Docker¶

Pre-built images are available on Docker Hub:

CPUGPU

docker pull straightchlorine/quantum-pipeline:cpu

docker run --rm straightchlorine/quantum-pipeline:cpu \
    --file data/molecules.json \
    --basis sto3g \
    --max-iterations 100

docker pull straightchlorine/quantum-pipeline:gpu

docker run --rm --gpus all straightchlorine/quantum-pipeline:gpu \
    --file data/molecules.json \
    --basis sto3g \
    --gpu \
    --max-iterations 100

GPU images require the NVIDIA Container Toolkit and Docker configured with the nvidia runtime. See the GPU Acceleration Guide for setup.

The runs above leave nothing behind once the container exits. To keep generated reports and plots, pass --report and mount the output directory. The container writes artifacts to gen/ under its working directory (/usr/src/quantum_pipeline):

docker run --rm \
    -v "$(pwd)/gen:/usr/src/quantum_pipeline/gen" \
    straightchlorine/quantum-pipeline:cpu \
    --file data/molecules.json \
    --molecule-index 0 \
    --basis sto3g \
    --max-iterations 100 \
    --report

To build images locally:

just docker-build cpu    # or: gpu, all

# in case of gpu:
# set CUDA_ARCH env var to match your GPU (default: 8.6/Ampere)
CUDA_ARCH=7.5 just docker-build gpu   # Turing (RTX 20xx)

Full platform with Docker Compose¶

This deploys the simulation module alongside Kafka, Spark, Airflow, Garage, monitoring and MLflow. The stack is defined in compose/docker-compose.ml.yaml.

The fastest way to get started is the setup script. It generates .env with random secrets, configures Garage (S3 storage), creates buckets, and sets up access keys:

# git clone https://github.com/straightchlorine/quantum-pipeline.git
git clone https://codeberg.org/piotrkrzysztof/quantum-pipeline.git

cd quantum-pipeline

# Automated first-time setup (generates .env, configures Garage)
just setup
# bash scripts/ml-setup.sh

# Start the stack
just up
# docker compose --env-file .env -f compose/docker-compose.ml.yaml up -d

# Stop the stack
just down

If .env already exists, it skips what is already configured. Otherwise copy .env.ml.example to .env and fill in the values yourself before running just up.

Services started:

Service	URL
Airflow	`http://localhost:8084`
Spark Master	`http://localhost:8080`
Garage S3 API	`http://localhost:3901`
Schema Registry	`http://localhost:8081`
MLflow	`http://localhost:5000`

See Docker Compose Guide for details.

Verify installation¶

quantum-pipeline --file data/molecules.json --molecule-index 0 --basis sto3g --max-iterations 10

If this completes without errors, the installation is working.

Optional dependency groups¶

Install extra groups with PDM or pip:

Group	Command	What it adds
`dev`	`pdm install -G dev`	pytest, ruff, mypy, debugpy, testcontainers
`docs`	`pdm install -G docs`	mkdocs, mkdocs-material, pymdown-extensions
`ml`	`pdm install -G ml`	scikit-learn, xgboost, mlflow, jupyterlab, seaborn

With pip, use pip install -e ".[dev]", pip install -e ".[docs]", or pip install -e ".[ml]".

After installing docs, run mkdocs serve and open http://127.0.0.1:8000.

Troubleshooting¶

See the Troubleshooting Guide for more.

Next steps¶

Quick Start - run your first VQE simulation
Configuration - all available parameters
Docker Compose - deploy the full platform