Fluid ForgeFluid Forge
Home
Get Started
  • Local (DuckDB)
  • GCP (BigQuery)
  • Snowflake Team Collaboration
  • Declarative Airflow
  • Orchestration Export
  • Jenkins CI/CD
  • Universal Pipeline
CLI Reference
  • Overview
  • Architecture
  • GCP (BigQuery)
  • AWS (S3 + Athena)
  • Snowflake
  • Local (DuckDB)
  • Custom Providers
  • Roadmap
GitHub
GitHub
Home
Get Started
  • Local (DuckDB)
  • GCP (BigQuery)
  • Snowflake Team Collaboration
  • Declarative Airflow
  • Orchestration Export
  • Jenkins CI/CD
  • Universal Pipeline
CLI Reference
  • Overview
  • Architecture
  • GCP (BigQuery)
  • AWS (S3 + Athena)
  • Snowflake
  • Local (DuckDB)
  • Custom Providers
  • Roadmap
GitHub
GitHub
  • Introduction

    • /
    • Getting Started
    • Snowflake Quickstart
    • Vision & Roadmap
  • Walkthroughs

    • Walkthrough: Local Development
    • Walkthrough: Deploy to Google Cloud Platform
    • Walkthrough: Snowflake Team Collaboration
    • Declarative Airflow DAG Generation - The FLUID Way
    • Generating Orchestration Code from Contracts
    • Jenkins CI/CD for FLUID Data Products
    • Universal Pipeline
  • CLI Reference

    • CLI Reference
    • fluid init
    • fluid forge
    • fluid status
    • fluid validate
    • fluid plan
    • fluid apply
    • fluid generate
    • fluid publish
    • fluid market
    • fluid import
    • fluid policy-check
    • fluid diff
    • fluid test
    • fluid verify
    • fluid config
    • fluid split
    • fluid bundle
    • fluid auth
    • fluid doctor
    • fluid providers
    • fluid version
  • Providers

    • Providers
    • Provider Architecture
    • GCP Provider
    • AWS Provider
    • Snowflake Provider
    • Local Provider
    • Creating Custom Providers
    • Provider Roadmap
  • Advanced

    • Blueprints
    • Governance & Compliance
    • Airflow Integration
    • Built-in And Custom Forge Guidance
    • FLUID Forge Contract GPT Packet
    • Forge Discovery Guide
    • Forge Memory Guide
  • Project

    • Contributing to Fluid Forge
    • Fluid Forge Docs Baseline: CLI 0.7.9
    • Fluid Forge v0.7.1 - Multi-Provider Export Release

Forge Discovery Guide

This guide explains how the current fluid forge flow discovers local context before it scaffolds a project.

Current public entry point

Use fluid forge, not the older fluid forge --mode copilot examples that may still appear in archived material.

Examples:

fluid forge
fluid forge --discovery-path ./data
fluid forge --discovery-path ../shared-schemas
fluid forge --no-discover

What discovery is for

Discovery gives Forge grounded local context so it can:

  • ask fewer follow-up questions
  • infer useful provider or domain hints
  • summarize local schemas without sending raw rows
  • generate a stronger first contract draft

What Forge scans

Forge may inspect local:

  • SQL files
  • dbt projects
  • Terraform files
  • existing FLUID contracts
  • README headings
  • sample CSV, JSON, JSONL, Parquet, and Avro files

Optional helpers

If you want richer Parquet and Avro schema inspection, install the optional discovery helpers:

pip install "data-product-forge[copilot]"

That adds local readers such as pyarrow and fastavro. Without them, Forge can still discover files, but schema extraction is more limited.

Privacy boundary

Discovery is metadata-first.

Forge sends distilled summaries such as:

  • column names
  • inferred scalar types
  • referenced tables
  • provider hints
  • existing contract ids and expose ids

Forge does not send:

  • raw sample rows
  • full file contents
  • secrets
  • API keys
  • passwords

How discovery affects generation

Discovery is combined with:

  • CLI flags
  • current-run answers
  • optional project memory
  • built-in defaults

That combined context shapes the contract draft and any follow-up prompts.

Related guides

  • Forge memory guide
  • CLI reference for fluid forge
Edit this page on GitHub
Last Updated: 4/16/26, 11:38 AM
Contributors: Jeff Watson, jeffwatson-ai
Prev
FLUID Forge Contract GPT Packet
Next
Forge Memory Guide