Webinars

Master Prometheus on Kubernetes: Spotting every issue without alert fatigue

August 14, 2024

7:00 pm

CET

CEST

1:00 PM

EDT

45 min

Bi-weekly

Prometheus is the de-facto standard for gathering metrics on Kubernetes and making sure that you discover problems early - before they impact users. But setting up a well-oiled Prometheus stack is surprisingly hard! Many companies have such a high volume of daily Prometheus alerts that they end up missing critical issues among the noise.

Apply now

Watch webinar

Apply now

In this talk, we will cover the architectural details you must understand about Kube-Prometheus-Stack to be successful. Then we will dive into best practices and techniques that you can use to make sure that no critical issue goes undetected, and that your team is not overwhelmed with alerts no matter how big your environment and how many clusters you have.

What are the key components of Kube-Prometheus-Stack and what is the ideal setup and common gotchas
How you can reduce the chance of missing critical alerts and take control of alerting volume
What other tools are present in the open-source Prometheus ecosystem which can help you

Audience - who should join?

Platform engineers, platform architects, DevOps engineers, site reliability engineers (SREs), infrastructure and operations, security engineers, enterprise and solution architects, application developers with an affinity for platform engineering and technical management focusing on improving DevEx and ops efficiency.

Apply now

Watch webinar

Apply now

Natan Yellin

Cofounder, Robusta.dev

Natan Yellin is the creator of HolmesGPT and Robusta - the leading open source AIOps platform for reducing your company’s mean-time-to-response with automatic AI investigations of Prometheus alerts. He works with enterprises running Kubernetes at scale, and helps them succeed with Prometheus monitoring and Kubernetes incident response.