Recent Posts

Data plateform infrastructure bootstrap

2 minute read

Introduction The motivation of this article is to show how easily one can setup a resiliant, scalable and budget data processing plateform infrastucture in c...

Spark ETL – Airflow on Kubernetes, Part 1

4 minute read

Introduction Airflow is nowadays a widely used ETL scheduler. I really like that one can write ETL pipelines in python code. My colleague Laurent said : Airf...