
Bauplan is a data lakehouse designed to run real production workloads while staying simple, interoperable, and controllable.
Bauplan is a data lakehouse built on open formats. Data is stored in object storage and managed using Apache Iceberg, giving you transactional guarantees, schema evolution, and time travel without tying you to a single engine or vendor.
You can run Python, SQL, and mixed workloads on the same tables. Curated data stays portable and accessible across your stack.



Bauplan uses a function-as-a-service execution model for data pipelines. You write pure Python. Bauplan handles execution, isolation, and scaling. There are no clusters to manage, no long-running infrastructure, and no hidden runtime state. Each run is deterministic, isolated, and reproducible.

Bauplan runs production-grade workloads without the operational overhead of traditional platforms.Deploy in single-tenant environments, use PrivateLink for network isolation, or bring your own cloud with BYOC. Your data stays in your object storage at all times. There is no copying or centralizing data into a proprietary system.

Bauplan is SOC 2 Type II compliant and includes built-in isolation, access controls, and auditability suitable for regulated environments.


Run pipelines in Bauplan, then expose curated Iceberg tables to warehouses, lakehouses, and SQL engines. Connect tables directly to BI tools. Query data through Bauplan itself. Or use the Python SDK to work from your preferred notebook environment.You control how data is consumed and where computation happens.