Skip to main content
Case Study

How HashiCorp SREs Built HCP's Incident Management Program

Martin Smith and Michael Main reveal how HashiCorp built an incident management program for the HashiCorp Cloud Platform.

»Some History

Back in 2019 and 2020, HashiCorp’s Cloud SREs worked with their engineering teams to build an incident management program for the, at the time, soon-to-be launched HashiCorp Cloud Platform — a core platform on which cloud service offerings for Terraform, Vault, Consul, Packer, and more would be managed.

»What You'll Learn

HashiCorp engineers Martin Smith and Michael Main will take you on a retrospective through how they initially approached this task, including a walk through the HashiCorp RFC process, as well as some of the questions they encountered along the way. For example:

  • Tooling selection challenges

  • Whether to use words like retrospective instead of postmortem

  • and more

The speakers will then talk about how that process has continued to evolve into 2022, as well as what they think the future holds for incident management at HashiCorp.

»Why it Matters

Get an inside look at a company that built their own cloud offerings of existing open core projects and what that looked like, hopefully providing some insights and ideas for other engineers hoping to take the same steps and build a similar incident response program in their own company.

More resources like this one

  • 2/3/2023
  • Case Study
Automating Multi-Cloud, Multi-Region Vault for Teams and Landing Zones
  • 1/20/2023
  • Case Study
Packaging security in Terraform modules
  • 12/22/2022
  • Case Study
Architecting Geo-Distributed Mobile Edge Applications with Consul
  • 12/13/2022
  • Case Study
Nomad and Vault in a Post-Kubernetes World