
"Before Bauplan, every major breakking news event created sudden traffic spikes that threatened to crash our dashboards.Our infrastructure couldn't handle the load, causing unpredictable outages precisely when our users were most active. With Bauplan's medallion architecture, we've eliminated the bottlenecks - the system runs efficiently even during peak traffic and we can finally focus on our product, not our servers."
Mediaset, Europe’s largest free-to-air broadcaster (~€6.8 billion in annual revenue) and reaching over 65 million viewers daily, faced a serious operational challenge.
Whenever a major story broke on its flagship digital news product TGCOM24 (celebrity scandals, political announcements, breaking sports news) traffic would surge by order of magnitude in minutes and the analytics infrastructure couldn’t respond adequately.
When dashboards stalled, editors couldn’t track live metrics like click-throughs, dwell time, or referral mix. In other words, editors were forced to make key publishing decisions without data.
Mediaset’s analytics workflow relied on a multi-layer setup: Spark jobs running on AWS EMR processed data into S3, Athena provided SQL access on top, and dashboards were built in Qlik. The combination required:
Even with that expertise, the system struggled under peak load. During breaking-news surges, query failures propagated across the stack, dashboards took minutes to load, and editors went blind to audience behavior.
Mediaset’s data team wanted to adopt a medallion architecture, separating pipelines into bronze, silver, and gold layers to isolate compute stages, ensure data quality at each step, and optimize the final gold layer for fast queries.
In practice, implementing that pattern across Spark jobs, Athena queries, and Qlik transformations required significant engineering effort and ongoing coordination between backend and BI specialists.
Bauplan solve this problem by abstracting away all the infrastructure and providing simple abstractions for the engineering team.
First of all, Bauplan allowed to replace a multi-tool stack with a unified Python-first lakehouse architecture on S3.
All transformations, from raw ingestion to gold aggregations, are now expressed as declarative Bauplan models in SQL and Python.
Each layer became a self-contained, version-controlled function:
Thanks to declarative I/O, built-in isolation, and zero-copy branching, these stages could be chained safely without additional orchestration code or data movement.
🚀 Instead of a team of senior engineers and a Qlik specialist, two junior engineers assisted by a data lead were able to implement the full medallion workflow in weeks.

