The video of my LISA17 talk is posted on YouTube.
On-call teams, postmortems, and costs of downtime are well-covered topics of DevOps. What’s not spoken of is the costs of false alarms in your alerting. The team’s ability to effectively handle true issues is hindered by this noise. What are these hidden costs, and how do you eliminate false alarms?
While you’re at LISA17, how many monitoring emails do you expect to receive?