Fluid Forge
Get Started
See it run
  • Local (DuckDB)
  • Source-Aligned (Postgres → DuckDB)
  • AI Forge + Data Models
  • GCP (BigQuery)
  • Snowflake Team Collaboration
  • Declarative Airflow
  • Orchestration Export
  • Jenkins CI/CD
  • Universal Pipeline
  • 11-Stage Production Pipeline
  • Catalog Forge End-to-End
CLI Reference
  • Overview
  • Quickstart
  • Examples
  • Your own CI
  • Your own scaffolding
  • Custom validator
  • Apply hook
  • Reference
Demos
  • Overview
  • Architecture
  • GCP (BigQuery)
  • AWS (S3 + Athena)
  • Snowflake
  • Local (DuckDB)
  • Custom Providers
  • Roadmap
GitHub
GitHub
Get Started
See it run
  • Local (DuckDB)
  • Source-Aligned (Postgres → DuckDB)
  • AI Forge + Data Models
  • GCP (BigQuery)
  • Snowflake Team Collaboration
  • Declarative Airflow
  • Orchestration Export
  • Jenkins CI/CD
  • Universal Pipeline
  • 11-Stage Production Pipeline
  • Catalog Forge End-to-End
CLI Reference
  • Overview
  • Quickstart
  • Examples
  • Your own CI
  • Your own scaffolding
  • Custom validator
  • Apply hook
  • Reference
Demos
  • Overview
  • Architecture
  • GCP (BigQuery)
  • AWS (S3 + Athena)
  • Snowflake
  • Local (DuckDB)
  • Custom Providers
  • Roadmap
GitHub
GitHub
  • Introduction

    • Home
    • Getting Started
    • Snowflake Quickstart
    • See it run
    • Forge Data Model
    • Vision & Roadmap
    • Playground
    • FAQ
  • Concepts

    • Concepts
    • Builds, Exposes, Bindings
    • What is a contract?
    • Quality, SLAs & Lineage
    • Governance & Policy
    • Agent Policy (LLM/AI governance)
    • Providers vs Platforms
    • Fluid Forge vs alternatives
  • Data Products

    • Product Types — SDP, ADP, CDP
  • Walkthroughs

    • Walkthrough: Local Development
    • Source-Aligned: Postgres → DuckDB → Parquet
    • AI Forge And Data-Model Journeys
    • Walkthrough: Deploy to Google Cloud Platform
    • Walkthrough: Snowflake Team Collaboration
    • Declarative Airflow DAG Generation - The FLUID Way
    • Generating Orchestration Code from Contracts
    • Jenkins CI/CD for FLUID Data Products
    • Universal Pipeline
    • The 11-Stage Pipeline
    • End-to-End Walkthrough: Catalog → Contract → Transformation
  • CLI Reference

    • CLI Reference
    • fluid init
    • fluid demo
    • fluid forge
    • fluid skills
    • fluid status
    • fluid validate
    • fluid plan
    • fluid apply
    • fluid generate
    • fluid generate artifacts
    • fluid validate-artifacts
    • fluid verify-signature
    • fluid generate-airflow
    • fluid generate-pipeline
    • fluid viz-graph
    • fluid odps
    • fluid odps-bitol
    • fluid odcs
    • fluid export
    • fluid export-opds
    • fluid publish
    • fluid datamesh-manager
    • fluid market
    • fluid import
    • fluid policy
    • fluid policy check
    • fluid policy compile
    • fluid policy apply
    • fluid contract-tests
    • fluid contract-validation
    • fluid diff
    • fluid test
    • fluid verify
    • fluid product-new
    • fluid product-add
    • fluid workspace
    • fluid ide
    • fluid ai
    • fluid memory
    • fluid mcp
    • fluid scaffold-ci
    • fluid scaffold-composer
    • fluid scaffold-ide
    • fluid docs
    • fluid config
    • fluid split
    • fluid bundle
    • fluid auth
    • fluid doctor
    • fluid providers
    • fluid provider-init
    • fluid roadmap
    • fluid version
    • fluid runs
    • fluid retention
    • fluid secrets
    • fluid stats
    • fluid contract
    • fluid ship
    • fluid rollback
    • fluid schedule-sync
    • Catalog adapters

      • Source Catalog Integration (V1.5)
      • BigQuery Catalog
      • Snowflake Horizon Catalog
      • Databricks Unity Catalog
      • Google Dataplex Catalog
      • AWS Glue Data Catalog
      • DataHub Catalog
      • Data Mesh Manager Catalog
    • CLI by task

      • CLI by task
      • Add quality rules
      • Add agent governance
      • Debug a failed pipeline run
      • Switch clouds with one line
  • Recipes

    • Recipes
    • Recipe — add a quality rule
    • Recipe — switch clouds with one line
    • Recipe — tag PII in your schema
  • SDK & Plugins

    • SDK & Plugins
    • Quickstart — your first plugin
    • Examples

      • Runnable examples
      • Example: hello-scaffold — the minimal viable plugin
      • Example: gitlab-ci-scaffold — generate a complete CI project
      • Example: steward-validator — a custom governance rule
      • Example: prod-key-guard — apply-time invariant check
    • Journeys

      • Journeys
      • Your own CI/CD

        • You have your own CI/CD setup, no problem
        • GitLab CI — the bundle template
        • GitHub Actions — the bundle template
        • Jenkins — the bundle template
        • CircleCI — the bundle template
      • You have a strict project layout, no problem
      • You have governance rules, no problem
      • You want a check at apply time, no problem
    • Reference

      • Reference
      • Roles reference
      • Entry points reference
      • Trust model
      • Packaging
      • Companion packages
  • Providers

    • Providers
    • Provider Architecture
    • GCP Provider
    • AWS Provider
    • Snowflake Provider
    • Local Provider
    • Creating Custom Providers
    • Provider Roadmap
  • Advanced

    • Blueprints
    • Governance & Compliance
    • Airflow Integration
    • Built-in And Custom Forge Guidance
    • FLUID Forge Contract GPT Packet
    • Forge Discovery Guide
    • Forge Memory Guide
    • LLM Providers
    • Capability Warnings
    • LiteLLM Backend (opt-in)
    • MCP Server
    • Credential Resolver — Security Model
    • Cost Tracking
    • Agentic Primitives
    • Typed Errors
    • Typed CLI Errors
    • Authoring Forge Tools
    • Source-Aligned Acquisition
    • API Stability — fluid_build.api
    • Guided fluid forge UX
    • V1.5 Catalog Integration — Architecture Deep-Dive
    • V1.5 + V2 Hardening — Release Notes
  • Project

    • Contributing to Fluid Forge
    • Fluid Forge Docs Baseline: CLI 0.8.3
    • Fluid Forge Docs Baseline: CLI 0.8.0
    • Fluid Forge Docs Baseline: CLI 0.7.11
    • Fluid Forge Docs Baseline: CLI 0.7.9
    • Fluid Forge v0.7.1 - Multi-Provider Export Release

Source Catalog Integration (V1.5)

Forge data products directly from your existing catalog metadata — no re-typing descriptions, tags, lineage, ownership, or sensitivity classifications you've already invested in. V1.5 turns forge-cli into a vendor-neutral modeling layer that consumes the seven major catalog ecosystems and emits one Fluid contract.

fluid forge data-model from-source \
  --source snowflake \
  --credential-id snowflake-prod \
  --database BIZ_LAB --schema SEEDED \
  --technique data-vault-2 \
  -o biz_lab.fluid.yaml

This page is the catalog-by-catalog index. Each linked page is a self-contained walkthrough: privileges to grant, env-vars to set, auth methods, end-to-end demo, and error reference.

Supported catalogs

CatalogWhat it bringsOptional install extra
Snowflake HorizonINFORMATION_SCHEMA, OBJECT_TAGS, OBJECT_DEPENDENCIES (lineage), SYSTEM$CLASSIFY (auto-PII), business descriptions, certified/sandbox markerspip install "data-product-forge[snowflake]"
Databricks UnityTables / columns / lineage, column tags, column masks, business glossary, certificationspip install "data-product-forge[databricks]"
BigQueryINFORMATION_SCHEMA, table description + labels, partition keys, primary keyspip install "data-product-forge[gcp]"
DataplexAspect-types (data-quality scores, freshness SLAs), business glossary, lineage entriespip install "data-product-forge[gcp]"
AWS GlueTables / databases / partitions / classifiers + Lake Formation tagspip install "data-product-forge[aws]"
DataHubDatasets, schemas, lineage, business glossary, ownership, tags, domains, business attributespip install "data-product-forge[datahub]"
Data Mesh ManagerRegistered data products, owner, domain, contracts, lineage, certificationsalready in core deps (REST over httpx)

Combined: pip install "data-product-forge[catalogs]" installs every optional dep at once. Default pip install data-product-forge ships with zero catalog deps — you opt into only the catalogs you use.

Two ways to drive a catalog forge

1. CLI: fluid forge data-model from-source

fluid forge data-model from-source \
  --source <catalog>               # snowflake, unity, bigquery, dataplex, glue, datahub, datamesh_manager
  --credential-id <name>           # saved name from ~/.fluid/sources.yaml
  --database <db> --schema <sch> \
  --tables orders customers \
  --technique data-vault-2 | dimensional \
  -o contract.fluid.yaml

Non-interactive; reproducible; works in CI. The --source value selects the catalog adapter; --credential-id points at a row in ~/.fluid/sources.yaml configured via fluid ai setup --source CATALOG --name NAME.

2. MCP: forge_from_source

// .mcp.json (Claude Code, Cursor, any MCP client)
{
  "fluid": {
    "command": "fluid",
    "args": ["mcp", "serve"],
    "env": { "SNOWFLAKE_PRIVATE_KEY_PATH": "/path/to/key.p8" }
  }
}

Then tell the agent:

Forge a data-vault-2 model from my Snowflake BIZ_LAB.SEEDED schema. Use the snowflake adapter with credential id snowflake-prod. Save the contract to biz_lab.fluid.yaml.

The agent calls the MCP forge_from_source tool; same pipeline, same output.

Setup wizard: fluid ai setup --source CATALOG --name NAME

For first-time setup of a new source:

fluid ai setup --source snowflake --name snowflake-prod
# ? Account: myorg-abc12345
# ? Auth method:                 # ★ marks recommended path
#   ★ key_pair (recommended)     # — most secure, no password
#     password
#     externalbrowser (SSO)
# ? Private key path: /Users/me/.ssh/snowflake_key.p8
# ? Default warehouse: COMPUTE_WH
# ? Default role: ANALYST
# ✓ Saved to ~/.fluid/sources.yaml

This writes a single row to ~/.fluid/sources.yaml. Secrets land in the OS keyring (macOS Keychain / Windows Credential Manager / Linux secret-service); only non-sensitive fields land on disk.

Three guarantees that hold across every catalog

  1. Read-only. Adapters request INFORMATION_SCHEMA-equivalent privileges only. No SELECT * against any data table — only metadata.
  2. No data values fetched. Tag values, descriptions, classifications, schema — never row data. Defends against an upstream agent escalating "list tables" into "dump customer PII."
  3. Audited. Every catalog tool call writes an audit event via ~/.fluid/store/audit/. Credentials are scrubbed before write. Use fluid memory show audit to query.

Auto-detection: fluid ai status lists what you have

fluid ai status
# Configured sources:
#   snowflake-prod  (snowflake / key_pair, ANALYST role)
#   unity-staging   (databricks / pat)
#   datahub-corp    (datahub / pat)

When you run fluid forge without --source, AI mode's interview auto-suggests the configured source — same UX as fluid ai setup auto-discovering provider env vars.

Error model

Every catalog adapter raises one of three typed exceptions, each carrying a suggestions: list[str] field with the next-action:

ExceptionWhenExample suggestion
CatalogConfigErrorMissing dep, wrong shape, missing required fieldpip install "data-product-forge[snowflake]"
CatalogConnectionErrorNetwork / hostname / not-foundVerify SNOWFLAKE_ACCOUNT="myorg-abc12345" matches the URL in the Snowsight UI.
CatalogPermissionErrorMissing privilege / wrong roleGRANT USAGE ON SCHEMA BIZ_LAB.SEEDED TO ROLE ANALYST;

The next-action is in the message — not buried in the docs.

See also

  • Forge data-model overview
  • MCP server walkthrough
  • Add your own catalog adapter
Edit this page on GitHub
Last Updated: 5/17/26, 6:10 PM
Contributors: fas89, Claude Opus 4.7, Claude Opus 4.7 (1M context)
Next
BigQuery Catalog