Posts by Category

Kubernetes

Data plateform infrastructure bootstrap

2 minute read

Introduction The motivation of this article is to show how easily one can setup a resiliant, scalable and budget data processing plateform infrastucture in c...

Spark ETL – Airflow on Kubernetes, Part 1

4 minute read

Introduction Airflow is nowadays a widely used ETL scheduler. I really like that one can write ETL pipelines in python code. My colleague Laurent said : Airf...

Spark structure streaming on Kubernetes

5 minute read

Introduction There are many ways to process real time data, In the company I work for, we use Kafka as the message service. Naturally it comes to use Kafka s...

Behind the Docker Daemon has the OOM-Killer

1 minute read

Recently I have written an article about Datadog on Kubernetes, The Datadog jmxfetch(java) process got killed by the host OOM-killer due to the JVM heapsize ...

Back to Top ↑

Spark

Spark ETL – Airflow on Kubernetes, Part 1

4 minute read

Introduction Airflow is nowadays a widely used ETL scheduler. I really like that one can write ETL pipelines in python code. My colleague Laurent said : Airf...

Spark structure streaming on Kubernetes

5 minute read

Introduction There are many ways to process real time data, In the company I work for, we use Kafka as the message service. Naturally it comes to use Kafka s...

Back to Top ↑

Docker

Behind the Docker Daemon has the OOM-Killer

1 minute read

Recently I have written an article about Datadog on Kubernetes, The Datadog jmxfetch(java) process got killed by the host OOM-killer due to the JVM heapsize ...

Back to Top ↑

Airflow

Spark ETL – Airflow on Kubernetes, Part 1

4 minute read

Introduction Airflow is nowadays a widely used ETL scheduler. I really like that one can write ETL pipelines in python code. My colleague Laurent said : Airf...

Back to Top ↑

Terraform

Data plateform infrastructure bootstrap

2 minute read

Introduction The motivation of this article is to show how easily one can setup a resiliant, scalable and budget data processing plateform infrastucture in c...

Back to Top ↑

Devops

Data plateform infrastructure bootstrap

2 minute read

Introduction The motivation of this article is to show how easily one can setup a resiliant, scalable and budget data processing plateform infrastucture in c...

Back to Top ↑

Datadog

Back to Top ↑

Monitoring

Back to Top ↑

Jmx

Back to Top ↑

Kernel

Behind the Docker Daemon has the OOM-Killer

1 minute read

Recently I have written an article about Datadog on Kubernetes, The Datadog jmxfetch(java) process got killed by the host OOM-killer due to the JVM heapsize ...

Back to Top ↑

General

I move away from Medium

less than 1 minute read

It is been more than one year that I have not updated my Medium space. unfortunately Medium seems not be a suitable tool for me. As a tech person, it is way ...

Back to Top ↑

AWS

Data plateform infrastructure bootstrap

2 minute read

Introduction The motivation of this article is to show how easily one can setup a resiliant, scalable and budget data processing plateform infrastucture in c...

Back to Top ↑

GCP

Back to Top ↑

MacOS

Back to Top ↑