Fluid Forge
Get Started
See it run
  • Local (DuckDB)
  • Source-Aligned (Postgres → DuckDB)
  • AI Forge + Data Models
  • GCP (BigQuery)
  • Snowflake Team Collaboration
  • Declarative Airflow
  • Orchestration Export
  • Jenkins CI/CD
  • Universal Pipeline
  • 11-Stage Production Pipeline
  • Catalog Forge End-to-End
CLI Reference
  • Overview
  • Quickstart
  • Examples
  • Your own CI
  • Your own scaffolding
  • Custom validator
  • Apply hook
  • Reference
Demos
  • Overview
  • Architecture
  • GCP (BigQuery)
  • AWS (S3 + Athena)
  • Snowflake
  • Local (DuckDB)
  • Custom Providers
  • Roadmap
GitHub
GitHub
Get Started
See it run
  • Local (DuckDB)
  • Source-Aligned (Postgres → DuckDB)
  • AI Forge + Data Models
  • GCP (BigQuery)
  • Snowflake Team Collaboration
  • Declarative Airflow
  • Orchestration Export
  • Jenkins CI/CD
  • Universal Pipeline
  • 11-Stage Production Pipeline
  • Catalog Forge End-to-End
CLI Reference
  • Overview
  • Quickstart
  • Examples
  • Your own CI
  • Your own scaffolding
  • Custom validator
  • Apply hook
  • Reference
Demos
  • Overview
  • Architecture
  • GCP (BigQuery)
  • AWS (S3 + Athena)
  • Snowflake
  • Local (DuckDB)
  • Custom Providers
  • Roadmap
GitHub
GitHub
  • Introduction

    • Home
    • Getting Started
    • Snowflake Quickstart
    • See it run
    • Forge Data Model
    • Vision & Roadmap
    • Playground
    • FAQ
  • Concepts

    • Concepts
    • Builds, Exposes, Bindings
    • What is a contract?
    • Quality, SLAs & Lineage
    • Governance & Policy
    • Agent Policy (LLM/AI governance)
    • Providers vs Platforms
    • Fluid Forge vs alternatives
  • Data Products

    • Product Types — SDP, ADP, CDP
  • Walkthroughs

    • Walkthrough: Local Development
    • Source-Aligned: Postgres → DuckDB → Parquet
    • AI Forge And Data-Model Journeys
    • Walkthrough: Deploy to Google Cloud Platform
    • Walkthrough: Snowflake Team Collaboration
    • Declarative Airflow DAG Generation - The FLUID Way
    • Generating Orchestration Code from Contracts
    • Jenkins CI/CD for FLUID Data Products
    • Universal Pipeline
    • The 11-Stage Pipeline
    • End-to-End Walkthrough: Catalog → Contract → Transformation
  • CLI Reference

    • CLI Reference
    • fluid init
    • fluid demo
    • fluid forge
    • fluid skills
    • fluid status
    • fluid validate
    • fluid plan
    • fluid apply
    • fluid generate
    • fluid generate artifacts
    • fluid validate-artifacts
    • fluid verify-signature
    • fluid generate-airflow
    • fluid generate-pipeline
    • fluid viz-graph
    • fluid odps
    • fluid odps-bitol
    • fluid odcs
    • fluid export
    • fluid export-opds
    • fluid publish
    • fluid datamesh-manager
    • fluid market
    • fluid import
    • fluid policy
    • fluid policy check
    • fluid policy compile
    • fluid policy apply
    • fluid contract-tests
    • fluid contract-validation
    • fluid diff
    • fluid test
    • fluid verify
    • fluid product-new
    • fluid product-add
    • fluid workspace
    • fluid ide
    • fluid ai
    • fluid memory
    • fluid mcp
    • fluid scaffold-ci
    • fluid scaffold-composer
    • fluid scaffold-ide
    • fluid docs
    • fluid config
    • fluid split
    • fluid bundle
    • fluid auth
    • fluid doctor
    • fluid providers
    • fluid provider-init
    • fluid roadmap
    • fluid version
    • fluid runs
    • fluid retention
    • fluid secrets
    • fluid stats
    • fluid contract
    • fluid ship
    • fluid rollback
    • fluid schedule-sync
    • Catalog adapters

      • Source Catalog Integration (V1.5)
      • BigQuery Catalog
      • Snowflake Horizon Catalog
      • Databricks Unity Catalog
      • Google Dataplex Catalog
      • AWS Glue Data Catalog
      • DataHub Catalog
      • Data Mesh Manager Catalog
    • CLI by task

      • CLI by task
      • Add quality rules
      • Add agent governance
      • Debug a failed pipeline run
      • Switch clouds with one line
  • Recipes

    • Recipes
    • Recipe — add a quality rule
    • Recipe — switch clouds with one line
    • Recipe — tag PII in your schema
  • SDK & Plugins

    • SDK & Plugins
    • Quickstart — your first plugin
    • Examples

      • Runnable examples
      • Example: hello-scaffold — the minimal viable plugin
      • Example: gitlab-ci-scaffold — generate a complete CI project
      • Example: steward-validator — a custom governance rule
      • Example: prod-key-guard — apply-time invariant check
    • Journeys

      • Journeys
      • Your own CI/CD

        • You have your own CI/CD setup, no problem
        • GitLab CI — the bundle template
        • GitHub Actions — the bundle template
        • Jenkins — the bundle template
        • CircleCI — the bundle template
      • You have a strict project layout, no problem
      • You have governance rules, no problem
      • You want a check at apply time, no problem
    • Reference

      • Reference
      • Roles reference
      • Entry points reference
      • Trust model
      • Packaging
      • Companion packages
  • Providers

    • Providers
    • Provider Architecture
    • GCP Provider
    • AWS Provider
    • Snowflake Provider
    • Local Provider
    • Creating Custom Providers
    • Provider Roadmap
  • Advanced

    • Blueprints
    • Governance & Compliance
    • Airflow Integration
    • Built-in And Custom Forge Guidance
    • FLUID Forge Contract GPT Packet
    • Forge Discovery Guide
    • Forge Memory Guide
    • LLM Providers
    • Capability Warnings
    • LiteLLM Backend (opt-in)
    • MCP Server
    • Credential Resolver — Security Model
    • Cost Tracking
    • Agentic Primitives
    • Typed Errors
    • Typed CLI Errors
    • Authoring Forge Tools
    • Source-Aligned Acquisition
    • API Stability — fluid_build.api
    • Guided fluid forge UX
    • V1.5 Catalog Integration — Architecture Deep-Dive
    • V1.5 + V2 Hardening — Release Notes
  • Project

    • Contributing to Fluid Forge
    • Fluid Forge Docs Baseline: CLI 0.8.3
    • Fluid Forge Docs Baseline: CLI 0.8.0
    • Fluid Forge Docs Baseline: CLI 0.7.11
    • Fluid Forge Docs Baseline: CLI 0.7.9
    • Fluid Forge v0.7.1 - Multi-Provider Export Release

LLM Providers

Forge data-model runs use one active LLM provider per run. The provider is selected from the CLI flag, environment, or saved AI config:

fluid forge data-model from-intent intent.yaml \
  -o customer_orders.fluid.yaml \
  --llm-provider gemini
FLUID_LLM_PROVIDER=ollama \
FLUID_OLLAMA_MODEL=gemma4:latest \
fluid forge data-model from-intent intent.yaml -o customer_orders.fluid.yaml

Supported providers

ProviderDefault / common modelNotes
Anthropicclaude-sonnet-4-6Tool-forced structured output and provider-native prompt caching. Streamed runs report accurate token usage in cost summaries (was previously "missing usage" on every streamed call).
OpenAIgpt-4.1-miniStrict JSON Schema output where available; seed support. Tiered runs use gpt-4.1 for deep logical modeling. Set FLUID_OPENAI_STRICT_SCHEMA=1 to harden the response-format schema for gpt-4o/gpt-4.1/o-series models that reject permissive nested objects.
Geminigemini-2.5-proUses Gemini response schema where suitable and validator repair when needed
OllamaFLUID_OLLAMA_MODEL such as gemma4:latestLocal-only; JSON mode is model-gated. Capability + token-budget catalogs cover gemma 1–4, qwen3-coder, qwen3, qwen2.5, llama3.1/3.2/3.3, mistral, mixtral, deepseek, phi. See Capability Warnings for tool-use accuracy notes per family.
Azure OpenAIFLUID_AZURE_DEPLOYMENTOpenAI-compatible wire shape with deployment names

Inspect the active catalog with:

fluid ai models
fluid ai models --provider gemini --json

Tiered mode

--tiered chooses different models within the same provider, never across providers. A typical layout is:

TierRole
deephardest reasoning and planning
balancedmain model-building execution
fastrouting, clarification, and light evaluation

If a provider has no distinct tier models configured, the CLI collapses tiered mode to a single-model run and emits a one-line warning. Ollama commonly runs this way unless the local model catalog is configured with separate fast, balanced, and deep models.

The deterministic stages stay deterministic even in tiered mode:

StageModel use
InterviewFast routing model
Logical modelerDeep model
Contract forgeNo model, deterministic
TransformationNo model, deterministic from .model.json
ValidatorNo model, deterministic
Self-evaluationFast routing model

Strict provider testing

For normal user experience, forge can fall back to deterministic heuristics if an LLM call fails. For provider certification and E2E testing, use:

fluid forge data-model from-intent intent.yaml \
  -o customer_orders.fluid.yaml \
  --llm-provider anthropic \
  --require-llm

--require-llm fails loudly if the provider cannot run. This prevents a green-looking smoke test that actually used heuristics.

Deterministic runs

fluid forge data-model from-intent intent.yaml \
  -o customer_orders.fluid.yaml \
  --deterministic

--deterministic disables cache and tiering for replayable output. Providers pin temperature=0; OpenAI, Ollama, and Azure OpenAI also pin seed where supported.

Environment variables

Provider + credentials

Env varPurpose
FLUID_LLM_PROVIDERActive provider for the run
FLUID_LLM_MODELSpecific model override
FLUID_LLM_TIMEOUT_SECONDSProvider HTTP timeout
OPENAI_API_KEYOpenAI key
ANTHROPIC_API_KEYAnthropic key
GOOGLE_API_KEY or GEMINI_API_KEYGemini key
OLLAMA_HOSTOllama endpoint; local addresses only
FLUID_OLLAMA_MODELOllama model name

Agent-loop tuning

Env varPurpose
FLUID_AGENT_COMPACT_AFTERIteration count after which the multi-turn agent loop compacts older tool results to stay under the model's context window. Default 6. Set to a higher number for long-context Anthropic / Gemini runs; lower for tight-context Ollama models.
FLUID_COMPACTION_STRATEGYtruncate (default — char/token-aware truncation), summarize (LLM-backed; calls your provider's fast tier once per compaction trigger), or hybrid (truncate first, then summarize the rest if still over budget). See Agentic primitives → Token-budget pre-flight & compaction.
FLUID_TOKEN_COUNTERInternal — selects the token-counting backend. Default is the pure-Python char-based heuristic; the CLI does not require an external tokenizer.
FLUID_OPENAI_STRICT_SCHEMA1 to enable the recursive strict-schema walker for OpenAI's response_format = json_schema mode. Closes the "Invalid schema for response_format 'ForgeContract'" 400 some gpt-4o/gpt-4.1/o-series deployments return when nested objects are free-form. Free-form fields are rewritten to JSON-encoded strings under strict mode.
FLUID_QUIET / FLUID_NONINTERACTIVE1 to silence the v2-preview banner and capability-degradation warnings. The warnings are still recorded to telemetry.

Use fluid ai setup for interactive setup and key storage. Provider and model choices are saved in ~/.fluid/ai_config.json; API keys go to the OS keyring by default. Plaintext API-key persistence requires explicit opt-in with FLUID_ALLOW_PLAINTEXT_AI_SECRETS=1.

Run-start capability warnings

When you pick a provider/model whose declared capabilities don't satisfy what the run needs (e.g. gpt-3.5 in agent-loop mode, an Ollama model with no tool-use support, or a model not yet in the capability catalog), the CLI prints a one-paragraph warning at the start of fluid forge data-model from-intent and continues with degraded behaviour. See Capability Warnings for the full matrix and the exact banner shape.

Operator-facing errors

When a provider call fails, the CLI raises a typed exception that distinguishes rate limits, context-overflow, auth failures, transient server errors, and schema-validation failures so retries honor Retry-After and the agent loop can route corrective feedback to the LLM. See Typed Errors for the full reference.

For complete command journeys, see AI Forge And Data-Model Journeys.

Edit this page on GitHub
Last Updated: 5/17/26, 6:10 PM
Contributors: fas89, Claude Opus 4.7, Claude Opus 4.7 (1M context)
Prev
Forge Memory Guide
Next
Capability Warnings