Fluid Forge
Get Started
See it run
  • Local (DuckDB)
  • Source-Aligned (Postgres → DuckDB)
  • AI Forge + Data Models
  • GCP (BigQuery)
  • Snowflake Team Collaboration
  • Declarative Airflow
  • Orchestration Export
  • Jenkins CI/CD
  • Universal Pipeline
  • 11-Stage Production Pipeline
  • Catalog Forge End-to-End
CLI Reference
  • Overview
  • Quickstart
  • Examples
  • Your own CI
  • Your own scaffolding
  • Custom validator
  • Apply hook
  • Reference
Demos
  • Overview
  • Architecture
  • GCP (BigQuery)
  • AWS (S3 + Athena)
  • Snowflake
  • Local (DuckDB)
  • Custom Providers
  • Roadmap
GitHub
GitHub
Get Started
See it run
  • Local (DuckDB)
  • Source-Aligned (Postgres → DuckDB)
  • AI Forge + Data Models
  • GCP (BigQuery)
  • Snowflake Team Collaboration
  • Declarative Airflow
  • Orchestration Export
  • Jenkins CI/CD
  • Universal Pipeline
  • 11-Stage Production Pipeline
  • Catalog Forge End-to-End
CLI Reference
  • Overview
  • Quickstart
  • Examples
  • Your own CI
  • Your own scaffolding
  • Custom validator
  • Apply hook
  • Reference
Demos
  • Overview
  • Architecture
  • GCP (BigQuery)
  • AWS (S3 + Athena)
  • Snowflake
  • Local (DuckDB)
  • Custom Providers
  • Roadmap
GitHub
GitHub
  • Introduction

    • Home
    • Getting Started
    • Snowflake Quickstart
    • See it run
    • Forge Data Model
    • Vision & Roadmap
    • Playground
    • FAQ
  • Concepts

    • Concepts
    • Builds, Exposes, Bindings
    • What is a contract?
    • Quality, SLAs & Lineage
    • Governance & Policy
    • Agent Policy (LLM/AI governance)
    • Providers vs Platforms
    • Fluid Forge vs alternatives
  • Data Products

    • Product Types — SDP, ADP, CDP
  • Walkthroughs

    • Walkthrough: Local Development
    • Source-Aligned: Postgres → DuckDB → Parquet
    • AI Forge And Data-Model Journeys
    • Walkthrough: Deploy to Google Cloud Platform
    • Walkthrough: Snowflake Team Collaboration
    • Declarative Airflow DAG Generation - The FLUID Way
    • Generating Orchestration Code from Contracts
    • Jenkins CI/CD for FLUID Data Products
    • Universal Pipeline
    • The 11-Stage Pipeline
    • End-to-End Walkthrough: Catalog → Contract → Transformation
  • CLI Reference

    • CLI Reference
    • fluid init
    • fluid demo
    • fluid forge
    • fluid skills
    • fluid status
    • fluid validate
    • fluid plan
    • fluid apply
    • fluid generate
    • fluid generate artifacts
    • fluid validate-artifacts
    • fluid verify-signature
    • fluid generate-airflow
    • fluid generate-pipeline
    • fluid viz-graph
    • fluid odps
    • fluid odps-bitol
    • fluid odcs
    • fluid export
    • fluid export-opds
    • fluid publish
    • fluid datamesh-manager
    • fluid market
    • fluid import
    • fluid policy
    • fluid policy check
    • fluid policy compile
    • fluid policy apply
    • fluid contract-tests
    • fluid contract-validation
    • fluid diff
    • fluid test
    • fluid verify
    • fluid product-new
    • fluid product-add
    • fluid workspace
    • fluid ide
    • fluid ai
    • fluid memory
    • fluid mcp
    • fluid scaffold-ci
    • fluid scaffold-composer
    • fluid scaffold-ide
    • fluid docs
    • fluid config
    • fluid split
    • fluid bundle
    • fluid auth
    • fluid doctor
    • fluid providers
    • fluid provider-init
    • fluid roadmap
    • fluid version
    • fluid runs
    • fluid retention
    • fluid secrets
    • fluid stats
    • fluid contract
    • fluid ship
    • fluid rollback
    • fluid schedule-sync
    • Catalog adapters

      • Source Catalog Integration (V1.5)
      • BigQuery Catalog
      • Snowflake Horizon Catalog
      • Databricks Unity Catalog
      • Google Dataplex Catalog
      • AWS Glue Data Catalog
      • DataHub Catalog
      • Data Mesh Manager Catalog
    • CLI by task

      • CLI by task
      • Add quality rules
      • Add agent governance
      • Debug a failed pipeline run
      • Switch clouds with one line
  • Recipes

    • Recipes
    • Recipe — add a quality rule
    • Recipe — switch clouds with one line
    • Recipe — tag PII in your schema
  • SDK & Plugins

    • SDK & Plugins
    • Quickstart — your first plugin
    • Examples

      • Runnable examples
      • Example: hello-scaffold — the minimal viable plugin
      • Example: gitlab-ci-scaffold — generate a complete CI project
      • Example: steward-validator — a custom governance rule
      • Example: prod-key-guard — apply-time invariant check
    • Journeys

      • Journeys
      • Your own CI/CD

        • You have your own CI/CD setup, no problem
        • GitLab CI — the bundle template
        • GitHub Actions — the bundle template
        • Jenkins — the bundle template
        • CircleCI — the bundle template
      • You have a strict project layout, no problem
      • You have governance rules, no problem
      • You want a check at apply time, no problem
    • Reference

      • Reference
      • Roles reference
      • Entry points reference
      • Trust model
      • Packaging
      • Companion packages
  • Providers

    • Providers
    • Provider Architecture
    • GCP Provider
    • AWS Provider
    • Snowflake Provider
    • Local Provider
    • Creating Custom Providers
    • Provider Roadmap
  • Advanced

    • Blueprints
    • Governance & Compliance
    • Airflow Integration
    • Built-in And Custom Forge Guidance
    • FLUID Forge Contract GPT Packet
    • Forge Discovery Guide
    • Forge Memory Guide
    • LLM Providers
    • Capability Warnings
    • LiteLLM Backend (opt-in)
    • MCP Server
    • Credential Resolver — Security Model
    • Cost Tracking
    • Agentic Primitives
    • Typed Errors
    • Typed CLI Errors
    • Authoring Forge Tools
    • Source-Aligned Acquisition
    • API Stability — fluid_build.api
    • Guided fluid forge UX
    • V1.5 Catalog Integration — Architecture Deep-Dive
    • V1.5 + V2 Hardening — Release Notes
  • Project

    • Contributing to Fluid Forge
    • Fluid Forge Docs Baseline: CLI 0.8.3
    • Fluid Forge Docs Baseline: CLI 0.8.0
    • Fluid Forge Docs Baseline: CLI 0.7.11
    • Fluid Forge Docs Baseline: CLI 0.7.9
    • Fluid Forge v0.7.1 - Multi-Provider Export Release

The 11-Stage Pipeline

A complete end-to-end walkthrough of the production pipeline promoted in forge-cli 0.8.0. Every stage is a CI gate that exits non-zero on failure; each stage maps to exactly one fluid command.

Overview

┌──────────── STRUCTURAL ────────────┐    ┌─ PUBLICATION ─┐
  1. bundle     → 2. validate                10. publish
        ↓                                          ↓
  3. generate   → 4. validate-artifacts       11. schedule-sync
     artifacts                                    (Path A only)
        ↓
  5. diff       → 6. plan → 7. apply
                                 ↓
  8. policy-apply → 9. verify
└────────────────────────────────────┘

Two structural invariants are enforced cryptographically:

  • Input integrity — every stage after bundle re-verifies the bundle's MANIFEST SHA-256 before running.
  • Apply isolation — fluid apply refuses to run unless the plan it was handed has matching bundleDigest + planDigest.

Prerequisites

# Install the stable release
pip install data-product-forge

# Health check
fluid doctor
fluid version

For the supply-chain stages (1 with --sign / --attest; 11 with --verify-signature), install cosign: https://docs.sigstore.dev/cosign/installation/.

The full flow (one product, one provider)

CONTRACT="contract.fluid.yaml"

# Stage 1 — bundle
fluid bundle $CONTRACT --format tgz --out runtime/bundle.tgz

# Stage 2 — validate
fluid validate runtime/bundle.tgz --strict --report runtime/validate-report.json

# Stage 3 — generate artifacts (catalog + DAGs + policy bindings)
fluid generate artifacts $CONTRACT --out dist/artifacts/

# Stage 4 — validate-artifacts (MANIFEST SHA-256 re-verify + per-format schemas)
fluid validate-artifacts dist/artifacts/ --manifest dist/artifacts/MANIFEST.json

# Stage 5 — drift gate
fluid diff $CONTRACT --env dev --out runtime/diff-report.json --exit-on-drift

# Stage 6 — plan (with mermaid-rendered HTML DAG + cryptographic digests)
fluid plan $CONTRACT --out runtime/plan.json --html --env dev

# Stage 7 — apply (verifies bundleDigest + planDigest before any DDL)
fluid apply runtime/plan.json --mode amend --env dev --yes --report runtime/apply-report.html

# Stage 8 — policy apply (stage between apply + verify so GRANTs land before transforms run)
fluid policy apply dist/artifacts/policy/bindings.json --mode enforce --env dev

# Stage 9 — verify (multi-dimensional schema / types / constraints / location)
fluid verify $CONTRACT --env dev --strict --out runtime/verify-report.json

# Stage 10 — publish (repeatable --target)
fluid publish $CONTRACT --target datamesh-manager --target fluid-command-center

# Stage 11 — schedule-sync (Path A only)
fluid schedule-sync \
  --scheduler airflow \
  --dags-dir dist/artifacts/schedule/ \
  --destination s3://my-airflow-dags/team-x/ \
  --env dev

Stage-by-stage detail

Stage 1 — bundle

fluid bundle packages the contract + its source files into a deterministic content-addressable tgz with a MANIFEST.json (SHA-256 per file + merkle root). Two independent runs produce byte-identical output.

fluid bundle contract.fluid.yaml --format tgz --out runtime/bundle.tgz

Optional supply-chain flags:

fluid bundle contract.fluid.yaml --format tgz --out runtime/bundle.tgz --sign           # Sigstore cosign
fluid bundle contract.fluid.yaml --format tgz --out runtime/bundle.tgz --sign --attest  # + SLSA L2 provenance

Stage 2 — validate

fluid validate checks the contract against the FLUID schema (auto-selected from fluidVersion) plus provider-specific rules.

fluid validate runtime/bundle.tgz --strict --report runtime/validate-report.json

Stage 3 — generate artifacts

fluid generate artifacts produces catalog + execution artifacts:

  • odcs/ — ODCS v3.1.0 contracts per expose
  • odps-bitol/ — ODPS-Bitol v1.0.0 product files
  • schedule/ — DAG files (when orchestration.engine is set)
  • policy/bindings.json — compiled IAM / GRANT bindings
  • MANIFEST.json — SHA-256 per file + merkle root

Reference-only contracts (builds[].pattern: hybrid-reference) auto-skip schedule and policies.

Stage 4 — validate-artifacts

fluid validate-artifacts re-hashes every file against MANIFEST.json and runs per-format schema validation (ODCS, ODPS-Bitol, DAG syntax, policy-bindings key-check). Self-gates on MANIFEST.json existence — skipped cleanly when stage 3 was off.

Stage 5 — diff (drift gate)

fluid diff compares the desired state against the deployed state (or a saved apply_report.json baseline). --exit-on-drift only hard-fails when a --state baseline was supplied — otherwise the gate downgrades to a warning, because every desired resource looks "new" on the first-ever run.

Stage 6 — plan

fluid plan emits plan.json with bundleDigest + planDigest fields. --html renders a mermaid DAG (colour-coded by action mode) alongside a raw-JSON drill-down.

Stage 7 — apply

fluid apply verifies both digests against the bundle MANIFEST before executing any DDL. --mode picks the dispatch semantic:

  • dry-run — render only
  • create-only — fail if target exists
  • amend (default) — ADD COLUMN IF NOT EXISTS; non-destructive
  • amend-and-build — amend + dbt run
  • replace — destructive; requires --allow-data-loss in non-dev
  • replace-and-build — destructive + dbt --full-refresh

Destructive modes auto-snapshot before mutating. Use fluid rollback to restore.

Stage 8 — policy apply

fluid policy apply deploys the IAM bindings from dist/artifacts/policy/bindings.json. Runs AFTER apply (GRANTs need the target schema objects) and BEFORE verify (so transforms on under-authorised objects surface as policy failures, not build errors). Self-gated on bindings.json existence.

Stage 9 — verify

fluid verify performs four-dimensional checks against the deployed warehouse:

  • Schema structure
  • Data types
  • Constraints (nullable / required)
  • Location

Reference-only contracts downgrade "table not found" to INFO under --strict (external pipeline owns creation).

Stage 10 — publish

fluid publish ships contracts + catalog artifacts to one or more targets. The new --target <name>[:<endpoint>] flag is repeatable; one call can publish to multiple catalogs with a per-target result block for partial-failure visibility.

For Data Mesh Manager / Entropy Data, ODPS product-to-product dependencies are published as Access agreements by default. Use DMM_AUTO_APPROVE_ACCESS=true or fluid dmm publish --auto-approve-access only in environments where those Access agreements should be approved automatically.

Stage 11 — schedule-sync (Path A only)

fluid schedule-sync pushes DAG files to the scheduler control plane (Airflow / MWAA / Composer / Astronomer / Prefect / Dagster). Path-B engines (EventBridge / Snowflake Tasks) apply their schedules inside stage-7 apply via SchedulePlanner, so stage 11 is a no-op for those contracts. Self-gated on dist/artifacts/schedule/ presence.

Generating a CI pipeline

The easiest way to run the 11 stages end-to-end is to let fluid generate ci emit a parameterised pipeline for your CI system:

fluid generate ci --system jenkins --install-mode pypi
# emits Jenkinsfile with all 11 stages + per-stage RUN_STAGE_N_X toggles

For Jenkins jobs that should publish to Entropy Data on the first Pipeline-from-SCM build, generate the defaults directly instead of patching the Jenkinsfile:

fluid generate ci contract.fluid.yaml \
  --system jenkins \
  --install-mode pypi \
  --default-publish-target datamesh-manager \
  --no-verify-strict-default \
  --publish-stage-default \
  --no-publish-include-env \
  --out Jenkinsfile

Those switches set VERIFY_STRICT=false, turn Stage 10 publish on by default, omit the Stage 10 --env flag, and bake a shell fallback of ${PUBLISH_TARGETS:-datamesh-manager} into the publish loop. They are Jenkins-only generation controls; other CI systems ignore them.

Supported systems: jenkins, github, gitlab, azure, bitbucket, circleci, tekton.

Each template exposes the full surface as build parameters:

  • CONTRACT, FLUID_ENV
  • RUN_STAGE_1_BUNDLE … RUN_STAGE_11_SCHEDULE_SYNC (booleans)
  • BUNDLE_FORMAT, APPLY_MODE, APPLY_BUILD_ID, ALLOW_DATA_LOSS, NO_VERIFY_DIGEST, POLICY_APPLY_MODE, VERIFY_STRICT, DIFF_EXIT_ON_DRIFT, PUBLISH_TARGETS, SCHEDULER, SCHEDULER_DESTINATION, …

Operators run selected stages for a deploy (e.g. only 1–9 for a dev smoke) by flipping the booleans in Jenkins Build-With-Parameters.

Failure posture

Per stage, the exit code semantics are:

ExitMeaning
0Stage passed (or legitimately skipped — e.g. ref-only contract at stage 3/8/11).
1Gate fired — drift detected, schema mismatch, policy violation, etc.
2Config error — missing file, bad flag value, etc.

Stages exit fast with clear error events (apply_mode_data_loss_blocked, schedule_sync_dags_dir_missing, validate_artifacts_sha_mismatch, …). CI log parsers can key off the event slugs for red/amber/green dashboards.

Related

  • fluid generate ci — generate a 7-system parameterised pipeline with all 11 stages wired
  • fluid rollback — restore from the auto-snapshot when stage 7 replaces went wrong
  • fluid verify-signature — verify supply-chain signatures on a bundle before letting stage 11 push DAGs
Edit this page on GitHub
Last Updated: 5/17/26, 6:10 PM
Contributors: fas89, Claude Opus 4.7 (1M context)
Prev
Universal Pipeline
Next
End-to-End Walkthrough: Catalog → Contract → Transformation