RUBY 480: The Sounds of Silence: Lessons From an API Outage with Paul Zaich

Episode 489 · December 1st, 2020 · 47 mins 40 secs

About this Episode

Paul Zaich from Checkr tells us about a critical outage that occurred, what caused it and how they tracked down and fixed the issue. The conversation ranges through troubleshooting complex systems, building team culture, blameless post-mortems, and monitoring the right things to make sure your applications don't fail or alert you when they do.

Panel

  • Charles Max Wood
  • Dave Kimura
  • Luke Stutters

Guest

  • Paul Zaich

Links

Picks