Bauplan is the execution layer for AI-generated data changes in production. It lets data engineering teams and AI agents safely run, validate, and publish changes to production data using branch-based isolation, transactional pipeline execution, and a serverless compute runtime.
It provides Git-style branching and versioning for data, a function as a service compute runtime, and a code-first control surface designed for both human engineers and autonomous agents.
All operations on the platform are programmable through a typed Python SDK and a CLI. Every action, branching, execution, validation, and publishing, composes through a small set of APIs that agents and humans call the same way. Bauplan also ships an MCP server that exposes the full lifecycle as tool calls for any MCP-compatible assistant.
Bauplan sits on top of your object storage and manages data as Apache Iceberg tables. Your data stays in your S3. Bauplan reads and writes directly from your storage and never copies or ingests your data. It produces versioned Iceberg outputs that remain compatible with any Iceberg-capable engine and catalog, so you keep your existing tools for analytics and BI while using Bauplan as the execution and change-management layer for pipelines and agent workflows.
Bauplan lets AI agents work with production data safely, at scale, and affordably.
Your team wants AI agents to work with production data. Engineers already use Claude Code, Cursor, or Copilot to write pipeline logic and generate SQL. The productivity gains are real. The gap is infrastructure: your current stack has no safe way to let agents execute changes on production data end to end. AI generates the code, but a human still stages and deploys it. Bauplan closes this loop. Agents branch, run, validate, and merge through a typed API. Production is protected by the system architecture, so your team scales automation without scaling headcount or risk.
Autonomous agents operating on data at scale run into three structural problems with today's platforms:
Bauplan solves all three problems.
Bauplan is an agentic data platform. It belongs to the emerging category of data infrastructure designed for autonomous and semi-autonomous workflows on production data.
Within this category, Bauplan operates as the execution layer: the part of the stack that governs how data changes run, validate, and publish. It complements ingestion tools (Airbyte, Fivetran, Estuary), orchestrators (Airflow, Prefect, Dagster), and BI tools by providing the transactional substrate underneath.
Bauplan is designed for a world where the default analytical workload is no longer a human running a small number of carefully prepared jobs. It is built for agents generating SQL and Python, probing data, proposing changes, and iterating repeatedly, often in parallel.
Bauplan is not a data warehouse, not an orchestrator and not an ingestion tool. It is the execution layer that lets AI work safely on production data.
CapabilityTraditional Data PlatformsBauplanExecution modelWrites directly to shared tablesEvery change runs on an isolated branchFailure handlingPartial failures leave inconsistent stateFailed runs leave production unchangedPublicationChanges go live as they completeAtomic multi-table commits on mergeInterfaceDashboards, notebooks, GUIsTyped Python SDK, CLI, and MCP serverAgent compatibilityRequires wrappers and glue codeAgent Native, CLAUDE.md, MCP and Agent SkillsComputePersistent clusters or warehouse sessionsEphemeral serverless functions (FaaS)IsolationManual staging environmentsZero-copy branching on Apache Iceberg fully built-inData residencyPlatform-managed storageYour data stays in your object storagePricing modelPer-query or per-compute-minute with markupsAgent friendly. Flat monthly tiers based on capacity, unlimited queries and agents
Bauplan is built for software engineering and data engineering teams of 3 to 15 engineers who own production pipelines and downstream data products. These teams typically sit inside fast-growing technology companies or mid-to-large enterprises modernizing their data stacks.
Common user profiles include Heads of Data, Directors of Data Engineering, Senior Data Engineers, VP of Analytics, and VP of Engineering. These teams treat data systems like software, expect Git-style workflows, and plan for AI participation in development.
Use Bauplan for workloads where you produce and maintain tables: ingesting files into curated datasets, building transformation pipelines, running backfills, enforcing data quality tests, and iterating quickly on logic and outputs.
Customers include Trust & Will, Moffin, Veed.io, Scops.ai, Intella, Suit Supply, RealPage, and Mediaset.