When applications go in production one of our main concerns is to ensure that they are properly monitored, notably with appropriate checks and suitable metrics to report about their availability over time. This article comes to tackle this concern. Particularly focused on applications running on Kubernetes, it sets up a standard to monitor, measure, and observe the availability of applications. The goal being to help organizations to define Service Level Objectives (SLO) and/or Service Level Agreements (SLA) while be able to track them through factual KPIs over time.
This article is compose of two main sections. The first one is conceptual, introducing our fundamentals and assumptions to define, monitor, and measure application availability on Kubernetes. The second one is practical, demonstrating an implementation powered by RealOpInsight — an open source application operations monitoring framework designed to work atop of Kubernetes while leveraging the basic probe capabilities of the later.
Given one or more instances of Kubernetes, the goal is to be able to monitor, measure, and track the availability of applications as established by the following tenets:
#kubernetes #prometheus #microservices #monitoring