Installation

Requirements

  • Python 3.12

  • uv package manager (recommended)

Installing with uv

The recommended way to install Dataflow is using uv:

# Clone the repository
git clone <repository-url>
cd dataflow

# Install dependencies with uv
uv sync

Installing with pip

Alternatively, you can install using pip:

# Clone the repository
git clone <repository-url>
cd dataflow

# Create a virtual environment
python -m venv .venv
source .venv/bin/activate  # On Windows: .venv\Scripts\activate

# Install in development mode
pip install -e .

Running Dataflow

Important

Ensure that your .env file is properly configured with database connection details and other necessary environment variables for the desired workflow.

Once installed and configured, run your workflows:

# Set the workflow to run
export WORKFLOW=your_workflow_name

# Run with Python
python -m dataflow.main

# Or with uv
uv run python -m dataflow.main

Docker

Dataflow includes Docker support for containerized deployments:

# Build the image
docker build -t dataflow .

# Run with environment file
docker run --env-file .env dataflow