Bauplan blog

engineering

Apr 16, 2025

Written by Ciro Greco

Bauplan is a serverless data platform that treats pipelines, models, and tables like software — versioned, testable, and ready for agents.

Read more

announcement

Apr 16, 2025

Written by Bauplan Team

Announcing $7.5M seed round led by Innovation Endeavors

Read more

case study

Apr 9, 2025

Written by Jacopo Tagliabue and Marco Reni

From legacy sprawl to lightning-fast pipelines: how Mediaset rebuilt their data stack with Bauplan and Temporal—and cut dashboard refresh times from 60 minutes to 5.

Read more

Reference

Mar 26, 2025

Written by Ciro Greco

End-to-end RAG system for a conversational service agent

Read more

Engineering

Mar 12, 2025

Written by Ciro Greco

Build simple, robust data apps with software engineering principles.

Read more

Engineering

Jan 13, 2025

Written by Nathan LeClaire

Lessons learned crafting a Serverless Lakehouse from spare parts

Read more

Reference

Dec 20, 2024

Written by Ciro Greco

Full-stack recommender system with Bauplan for data preparation and training, and MongoDB Atlas for real-time inference.

Read more

Research

Dec 2, 2024

Written by J. Tagliabue, T. Caraza-Harter and C.Greco

Paper presented at WoSC10 2024. In collaboration with The University of Wisconsin.

Read more

Engineering

Nov 22, 2024

Written by Nathan LeClaire and Ciro Greco

Making the experience of running data workflow in the cloud indistinguishable from doing it locally.

Read more

Engineering

Nov 20, 2024

Written by Luca Bigon and Jacopo Tagliabue

DAG planning using an in-memory graph database. In collaboration with Kùzu

Read more

Research

Nov 12, 2024

Written by J. Tagliabue, R. Curtin and C. Greco

Read more

Reference

Oct 21, 2024

Written by J. Tagliabue and C. White

A reference implementation to implement a Write-Audit-Publish (WAP) pattern with Bauplan and Prefect 3.0.

Read more

Engineering

Sep 18, 2024

Written by Ciro Greco

Find the right balance between cost control and fast startup time for your Spark clusters.

Read more

Research

Jun 9, 2024

Written by Jacopo Tagliabue and Ciro Greco

Paper presented at SIGMOD/PODS 2024. Awarded best paper DEEM@SIGMOD.

Read more

Open Source

Apr 11, 2024

Written by Ciro Greco

An open source implementation of WAP using Apache Iceberg, Lambdas, and Nessie all running entirely Python.

Read more

Engineering

Mar 1, 2024

Written by Ciro Greco

Working on production data is the only way to know whether our applications will work.

Read more

Engineering

Nov 27, 2023

Written by Ciro Greco

Why production cloud environment are too slow and hard to develop in them.

Read more

Engineering

Sep 6, 2023

Written by Ciro Greco and Jacopo Tagliabue

The greatest invention since sliced Virtual Machines.

Read more

Research

Aug 10, 2023

Written by Jacobo Tagliabue, Ciro Greco, and Luca Bigon

Read more

Open Source

Jun 4, 2023

Written by Ciro Greco

An open-source implementation of a Data Lake with DuckDB and AWS Lambdas.

Read more

Try bauplan

Try bauplan