Monitoring Nomad with Prometheus and Icinga
Learn how to integrate icinga2 with HashiCorp Consul and Nomad, then build Prometheus visualizations with Grafana.
Things like infrastructure as code, service discovery, and config management have helped us quickly build and rebuild infrastructure but we haven't spent nearly enough time to train ourselves to review, monitor, and respond to outages. Does your platform degrade in a graceful way? What does a high CPU load really mean? What can we learn from level 1 outages to be able to run our platforms more reliably?
How's Your SRE?
We all love infrastructure as code, we all "automate everything™" but how many of us can really say we could destroy and recreate our core infrastructure without human intervention. A container scheduler adds application life cycle management, scheduling, and placement based on available resources and connectivity features to your cloud system. It takes away the responsibility from you to take care for these tasks.
What You'll Learn
This talk will focus on making sure you have alerts and metrics in this quickly changing infrastructure landscape. You'll learn how to integrate icinga2 with HashiCorp Consul and Nomad. Lastly, you'll learn how to build and visualize the Prometheus data in a way that resembles Netflix's vizceral using freely available Grafana dashboards and plugins.