{"id":2060,"date":"2020-03-12T21:49:45","date_gmt":"2020-03-12T21:49:45","guid":{"rendered":"https:\/\/uptime.com\/blog\/?p=2060"},"modified":"2025-01-08T15:21:29","modified_gmt":"2025-01-08T15:21:29","slug":"how-to-improve-downtime-response","status":"publish","type":"post","link":"https:\/\/uptime.com\/blog\/how-to-improve-downtime-response","title":{"rendered":"How to Improve Downtime Response: Error Budgeting and Unplanned Downtime"},"content":{"rendered":"<p>Every one of us reading this blog has seen a fire spring up and quietly walked away from the impending chaos. And everyone one of us has managed to live this long because we understand when to react to a fire.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-2062\" src=\"https:\/\/uptime.com\/blog\/wp-content\/uploads\/2020\/03\/kYqDeN.gif\" alt=\"\" width=\"320\" height=\"240\" \/><\/p>\n<p>A real fire affects our Service Level Objectives (SLO), and affects the user base. You need to figure out where it is, what started it, and what your team will do about it, and you need to do that now. Then you need to examine how you responded to see whether any holes or roadblocks impede your progress.<\/p>\n<p>What are the key factors that have to align to get you out of bed with alarms and whistles? What wakes you up?<\/p>\n<p>We know incidents don\u2019t conform to our schedule, and that not every fire is worth bringing in the cavalry. So today, we\u2019re going to ask ourselves if this is really worth getting out of bed for. How do we not only prioritize alerts, but improve our response time?<\/p>\n<h2>Error Budgeting Your Bedtime<\/h2>\n<p>Vince Lombardi said \u201cPerfection is not attainable, but if we chase perfection we can catch excellence.\u201d SLOs built on unrealistic fulfillment are the first recipe we have for disaster. You are guaranteed to overpromise and under-deliver to a dwindling audience over time.<\/p>\n<h4>How to Error Budget<\/h4>\n<p>Good SLOs use something called an \u201cerror budget\u201d to ensure a manageable user experience. Secondarily, good SLOs make sure that the devops team isn\u2019t quietly plotting to burn the place down when no one\u2019s looking.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-large wp-image-2065\" src=\"https:\/\/uptime.com\/blog\/wp-content\/uploads\/2020\/03\/nines-1-570x253.png\" alt=\"\" width=\"570\" height=\"253\" srcset=\"https:\/\/uptime.com\/blog\/wp-content\/uploads\/2020\/03\/nines-1-570x253.png 570w, https:\/\/uptime.com\/blog\/wp-content\/uploads\/2020\/03\/nines-1-510x227.png 510w, https:\/\/uptime.com\/blog\/wp-content\/uploads\/2020\/03\/nines-1.png 1485w\" sizes=\"auto, (max-width: 570px) 100vw, 570px\" \/><\/p>\n<p>Error budgeting is another way of referring to the amount of unplanned downtime your organization deems acceptable. Here\u2019s a table from Google that provides figures for <a href=\"https:\/\/landing.google.com\/sre\/sre-book\/chapters\/availability-table\/#appendix_table-of-nines\">the math behind the nines<\/a>, but you want as many nines as you can conceivably handle. Budgets will differ based on team and infrastructure size. One person managing an app with hundreds of users has very different concerns than a team managing a service with hundreds of thousands logged in concurrently.<\/p>\n<p>But error budgets serve another important function: they define when to stop what you\u2019re doing and fix something. An SLO may or may not be legally binding, but it does provide a framework for how to treat systems under stress.<\/p>\n<p><strong>Simply put: you can continue deployments under ABC conditions, but stop everything for analysis and reliability work under XYZ conditions.<\/strong><\/p>\n<p>The best SLOs are extremely specific about when and how the team needs to react to unplanned downtime. If the error budget is consumed too quickly, the team must be able to explain what happened and create an incident report that details what went wrong. That report will help further define your escalation policies.<\/p>\n<h4>Defining an Escalation Policy<\/h4>\n<p>So, then, the question becomes: what is unplanned downtime? Simple answer: anything not within your control that triggers downtime. But when we think about SLOs, the answer gets a bit muddled. Let\u2019s say we have a website that is OK 200, with a support chat function that is closer to :sadface: 400. Does that down chat factor into our SLO?<\/p>\n<p>Is it worth getting out of bed for?<\/p>\n<p>Yes, we should know it\u2019s down and we should have a plan to fix it. Do we need to act on that at 3 AM when we first receive the alert it\u2019s dead? Probably not.<\/p>\n<p>So, part of error budgeting involves wasting less time figuring out whether the fire is a big one. When we know more about which systems affect the customer experience, we can waste less time deciding whether an alert is worth acting on.<\/p>\n<p>You can take this a step further. Finding yourself getting out of bed just to reboot a service? Automate it! Recurring errors giving you the blues? Tell tier 1 how to fix them while you sleep so you can feel refreshed enough to do the root cause analysis tomorrow and figure out how to stop them.<\/p>\n<p>All of these methods help improve downtime response without excessive costs to your error budget.<\/p>\n<h2>Two Steps to Beating Your Error Budget<\/h2>\n<p>There are two steps you can take to improve your downtime response and ultimately beat the numbers you&#8217;ve set as your error budget.<\/p>\n<h4>Collect Data on Uptime and Performance<\/h4>\n<p>The first is to start collecting performance data now. Here\u2019s a handy link to <a href=\"https:\/\/uptime.com\/devices\/services?monitoring_service_type=RUM\">create a RUM check<\/a> if you\u2019ve not already done so. Use this performance indicator as your first line of defense, because the user experience is why we all show up in the morning.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-large wp-image-1927\" src=\"https:\/\/uptime.com\/blog\/wp-content\/uploads\/2019\/11\/RUM-example-570x459.png\" alt=\"rum-screenshot\" width=\"570\" height=\"459\" srcset=\"https:\/\/uptime.com\/blog\/wp-content\/uploads\/2019\/11\/RUM-example-570x459.png 570w, https:\/\/uptime.com\/blog\/wp-content\/uploads\/2019\/11\/RUM-example-510x411.png 510w, https:\/\/uptime.com\/blog\/wp-content\/uploads\/2019\/11\/RUM-example.png 1005w\" sizes=\"auto, (max-width: 570px) 100vw, 570px\" \/><\/p>\n<p><a href=\"https:\/\/uptime.com\/blog\/what-is-real-user-monitoring\">Real User Monitoring<\/a> tells us how our website is performing, and it\u2019s powered by real user data. When you review a RUM report you step into the user\u2019s shoes for a clear picture of how they perceive the website. It\u2019s also got a built-in threshold for performance, alerting you when the load time exceeds the value you define.<\/p>\n<p>Performance checks like RUM are the sensors on our star cruiser, diligently informing us of the dangers ahead.<\/p>\n<p>The more performance thresholds you can identify as meaningful, the more data you have for first response. You know when it\u2019s slow, when slowness becomes downtime, and the severity and length of that downtime.<\/p>\n<h4>Escalate Unplanned Downtime<\/h4>\n<p>The second step we can take to improve downtime response is to practice meaningful escalations.<\/p>\n<p>The kind of downtime you\u2019ve experienced is crucial in determining what you do with your alerting. If we circle back to our RUM check, our defined threshold helps determine when it\u2019s time to examine the problem. Until that threshold is met, it\u2019s nice to know performance is fast or slower but service levels must have an acceptable operating range. No need to blow the horn just yet.<\/p>\n<p>If our RUM check triggers an alert, and some of our <a href=\"https:\/\/uptime.com\/uptime-monitoring\">HTTP(S) checks<\/a> go down, we might need to get out of bed.<\/p>\n<p>We can think of <a href=\"https:\/\/support.uptime.com\/hc\/en-us\/articles\/360005117559-Creating-Responsive-Escalations-for-Downtime-Alerting\">escalation<\/a> in terms of when and where. When should we escalate to the next tier, and where will our message have the greatest impact. First, let\u2019s tackle the \u201cwhere\u201d portion of escalations.<\/p>\n<p>Where is easy: wherever your team lives. Do you live on Slack? Are you all married to some other provider? Do you exist on mobile, or have internal dashboards and ticketing? Where is easy to define when we think about how your team works.<\/p>\n<p>Now let\u2019s think about \u201cwhy\u201d, which is the really interesting part.<\/p>\n<p>Modern applications are full of complex and interlocking systems, with redundancy built into everything we do. If one server from a farm of 10,000 goes down, is it really worth getting out of bed for?<\/p>\n<p>You would know if you had more clarity on which one went down.<\/p>\n<p>An alert that has no actionable takeaway isn\u2019t very useful. You\u2019ve just been roused out of sleep to learn that you shouldn\u2019t or can\u2019t actually do anything. Alerts like these are over before you see them, or they have little to no impact on the end-user.<\/p>\n<p>On the other hand, too many alerts will drive anyone to levels of insanity that create DevOps horror stories that make the front page of Reddit.<\/p>\n<div id=\"attachment_2067\" style=\"width: 580px\" class=\"wp-caption alignnone\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-2067\" class=\"size-large wp-image-2067\" src=\"https:\/\/uptime.com\/blog\/wp-content\/uploads\/2020\/03\/lewis-parsons-xIFIoCgx4pc-unsplash-570x381.jpg\" alt=\"\" width=\"570\" height=\"381\" srcset=\"https:\/\/uptime.com\/blog\/wp-content\/uploads\/2020\/03\/lewis-parsons-xIFIoCgx4pc-unsplash-570x381.jpg 570w, https:\/\/uptime.com\/blog\/wp-content\/uploads\/2020\/03\/lewis-parsons-xIFIoCgx4pc-unsplash-510x341.jpg 510w, https:\/\/uptime.com\/blog\/wp-content\/uploads\/2020\/03\/lewis-parsons-xIFIoCgx4pc-unsplash-1536x1026.jpg 1536w, https:\/\/uptime.com\/blog\/wp-content\/uploads\/2020\/03\/lewis-parsons-xIFIoCgx4pc-unsplash-2048x1368.jpg 2048w\" sizes=\"auto, (max-width: 570px) 100vw, 570px\" \/><p id=\"caption-attachment-2067\" class=\"wp-caption-text\">Photo by Lewis Parsons on Unsplash<\/p><\/div>\n<p>Why we escalate becomes a question of the cost of downtime to your organization. We looked at some hypothetical <a href=\"https:\/\/uptime.com\/blog\/black-friday-2019\">downtime costs during Black Friday 2019<\/a>, if you\u2019re interested to see what these calculations might take into account.<\/p>\n<p><strong>Simply put: what\u2019s the end-user cost to something going down? Is there potential for a bounce or a lost sale? If so, how apocalyptic is that threat?<\/strong><\/p>\n<p>Here are some quick guidelines you can use to assist in escalating:<\/p>\n<ul>\n<li>Would this outage prevent the end-user from accessing a critical portion of the application?<\/li>\n<li>Would this outage affect the end-user performance?<\/li>\n<li>Can a human intervene with a meaningful action?<\/li>\n<li>When does an alert become \u201curgent\u201d for your organization?<\/li>\n<\/ul>\n<h2>Auditing for Ineffective Alerts<\/h2>\n<p>When alerts overload us, we miss the important ones. Just like you have an error budget, you also have an alert budget, or a measure of how many alerts you can send before they all just look like a blur. The alert budget is measured in staff size, on-call hours, and resources generally available to you.<\/p>\n<p>Measure the number of alerts your team is receiving in a given period (say monthly or weekly), then look at how many had an immediate action taken. The escalations you don\u2019t need were ones that failed to produce an action item without further analysis conducted during normal business hours.<\/p>\n<p>If you follow our advice above and collect performance and user data continuously, you\u2019ll have a better gauge of how your systems affect one another and what the user is seeing.<\/p>\n<h2>Error Budgeting the User Experience<\/h2>\n<p>A more perfect internet is one where we all experience near-zero lag and permanent connectivity. We\u2019re kind of a long way away from that dream, so really error budgeting is about protecting the user experience as best we can.<\/p>\n<p>There\u2019s a balance between meaningful, actionable alerts, sane DevOps, and the user. Your job is to manage that balance.<\/p>\n<p>With Uptime.com, you have a few options. You can set escalations in bulk, allowing you to target entire systems in a single click. You can also add notes and pull metrics and alert data to package as you need.<\/p>\n<p>We also have <a href=\"https:\/\/uptime.com\/integrations\">the most integrations<\/a> in our class, with <a href=\"https:\/\/uptime.com\/status-page\">status pages<\/a> that can be public or private to your organization. We give you the visibility you need, high level or granular, delivered direct to you.<\/p>\n<p>Error budgeting, visibility, and alert budgeting are all related concepts. Less noise, actionable alerting and better on-call utilization are all good end goals. Your employees and users will appreciate a stable experience.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Every one of us reading this blog has seen a fire spring up and quietly walked away from the impending chaos. And everyone &#8230;<\/p>\n","protected":false},"author":1,"featured_media":2066,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[41,9],"tags":[70,68,71,69,72],"ppma_author":[169],"class_list":["post-2060","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-outages","category-resources","tag-downtime-alerts","tag-downtime-response","tag-error-budget","tag-error-budgeting","tag-improve-downtime-response"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v27.2 (Yoast SEO v27.2) - https:\/\/yoast.com\/product\/yoast-seo-premium-wordpress\/ -->\n<title>How to Improve Downtime Response: Error Budgeting and Unplanned Downtime<\/title>\n<meta name=\"description\" content=\"Learn how to use the principles of error budgeting to improve downtime response. Uptime.com offers responsive alerting to improve downtime response.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/uptime.com\/blog\/how-to-improve-downtime-response\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"How to Improve Downtime Response: Error Budgeting and Unplanned Downtime\" \/>\n<meta property=\"og:description\" content=\"Learn how to use the principles of error budgeting to improve downtime response. Uptime.com offers responsive alerting to improve downtime response.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/uptime.com\/blog\/how-to-improve-downtime-response\" \/>\n<meta property=\"og:site_name\" content=\"Uptime.com\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/uptimemonitor\" \/>\n<meta property=\"article:published_time\" content=\"2020-03-12T21:49:45+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2025-01-08T15:21:29+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/uptime.com\/blog\/wp-content\/uploads\/2020\/03\/improve-downtime-response-cover.png\" \/>\n\t<meta property=\"og:image:width\" content=\"700\" \/>\n\t<meta property=\"og:image:height\" content=\"500\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Uptime.com\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@UptimeDotCom\" \/>\n<meta name=\"twitter:site\" content=\"@UptimeDotCom\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Uptime.com\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"9 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/uptime.com\/blog\/how-to-improve-downtime-response#article\",\"isPartOf\":{\"@id\":\"https:\/\/uptime.com\/blog\/how-to-improve-downtime-response\"},\"author\":{\"name\":\"Uptime.com\",\"@id\":\"https:\/\/uptime.com\/blog\/#\/schema\/person\/69f0940686123645bc00cbb227cad781\"},\"headline\":\"How to Improve Downtime Response: Error Budgeting and Unplanned Downtime\",\"datePublished\":\"2020-03-12T21:49:45+00:00\",\"dateModified\":\"2025-01-08T15:21:29+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/uptime.com\/blog\/how-to-improve-downtime-response\"},\"wordCount\":1651,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/uptime.com\/blog\/#organization\"},\"image\":{\"@id\":\"https:\/\/uptime.com\/blog\/how-to-improve-downtime-response#primaryimage\"},\"thumbnailUrl\":\"https:\/\/uptime.com\/blog\/wp-content\/uploads\/2020\/03\/improve-downtime-response-cover.png\",\"keywords\":[\"downtime alerts\",\"downtime response\",\"error budget\",\"error budgeting\",\"improve downtime response\"],\"articleSection\":[\"Outages\",\"Resources\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/uptime.com\/blog\/how-to-improve-downtime-response#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/uptime.com\/blog\/how-to-improve-downtime-response\",\"url\":\"https:\/\/uptime.com\/blog\/how-to-improve-downtime-response\",\"name\":\"How to Improve Downtime Response: Error Budgeting and Unplanned Downtime\",\"isPartOf\":{\"@id\":\"https:\/\/uptime.com\/blog\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/uptime.com\/blog\/how-to-improve-downtime-response#primaryimage\"},\"image\":{\"@id\":\"https:\/\/uptime.com\/blog\/how-to-improve-downtime-response#primaryimage\"},\"thumbnailUrl\":\"https:\/\/uptime.com\/blog\/wp-content\/uploads\/2020\/03\/improve-downtime-response-cover.png\",\"datePublished\":\"2020-03-12T21:49:45+00:00\",\"dateModified\":\"2025-01-08T15:21:29+00:00\",\"description\":\"Learn how to use the principles of error budgeting to improve downtime response. Uptime.com offers responsive alerting to improve downtime response.\",\"breadcrumb\":{\"@id\":\"https:\/\/uptime.com\/blog\/how-to-improve-downtime-response#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/uptime.com\/blog\/how-to-improve-downtime-response\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/uptime.com\/blog\/how-to-improve-downtime-response#primaryimage\",\"url\":\"https:\/\/uptime.com\/blog\/wp-content\/uploads\/2020\/03\/improve-downtime-response-cover.png\",\"contentUrl\":\"https:\/\/uptime.com\/blog\/wp-content\/uploads\/2020\/03\/improve-downtime-response-cover.png\",\"width\":700,\"height\":500},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/uptime.com\/blog\/how-to-improve-downtime-response#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/uptime.com\/blog\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"How to Improve Downtime Response: Error Budgeting and Unplanned Downtime\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/uptime.com\/blog\/#website\",\"url\":\"https:\/\/uptime.com\/blog\/\",\"name\":\"Uptime.com\",\"description\":\"\",\"publisher\":{\"@id\":\"https:\/\/uptime.com\/blog\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/uptime.com\/blog\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/uptime.com\/blog\/#organization\",\"name\":\"Uptime.com\",\"url\":\"https:\/\/uptime.com\/blog\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/uptime.com\/blog\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/uptime.com\/blog\/wp-content\/uploads\/2022\/10\/Uptime.com_Wordmark_Horizontal_Black_Green.png\",\"contentUrl\":\"https:\/\/uptime.com\/blog\/wp-content\/uploads\/2022\/10\/Uptime.com_Wordmark_Horizontal_Black_Green.png\",\"width\":512,\"height\":83,\"caption\":\"Uptime.com\"},\"image\":{\"@id\":\"https:\/\/uptime.com\/blog\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/www.facebook.com\/uptimemonitor\",\"https:\/\/x.com\/UptimeDotCom\",\"https:\/\/www.linkedin.com\/company\/uptime-com\/\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/uptime.com\/blog\/#\/schema\/person\/69f0940686123645bc00cbb227cad781\",\"name\":\"Uptime.com\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/uptime.com\/blog\/wp-content\/uploads\/2025\/01\/uptime_com_logo.jpeg34cb5744a58a7a87dbf8f078ae83ffc5\",\"url\":\"https:\/\/uptime.com\/blog\/wp-content\/uploads\/2025\/01\/uptime_com_logo.jpeg\",\"contentUrl\":\"https:\/\/uptime.com\/blog\/wp-content\/uploads\/2025\/01\/uptime_com_logo.jpeg\",\"caption\":\"Uptime.com\"},\"description\":\"Uptime.com provides unmatched global visibility and real-time alerts on website, server, and API performance. Monitor downtime, speed, reliability, user experience with customizable status pages, dashboards, and reports.\",\"url\":\"https:\/\/uptime.com\/blog\/author\/upadmin\"}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"How to Improve Downtime Response: Error Budgeting and Unplanned Downtime","description":"Learn how to use the principles of error budgeting to improve downtime response. Uptime.com offers responsive alerting to improve downtime response.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/uptime.com\/blog\/how-to-improve-downtime-response","og_locale":"en_US","og_type":"article","og_title":"How to Improve Downtime Response: Error Budgeting and Unplanned Downtime","og_description":"Learn how to use the principles of error budgeting to improve downtime response. Uptime.com offers responsive alerting to improve downtime response.","og_url":"https:\/\/uptime.com\/blog\/how-to-improve-downtime-response","og_site_name":"Uptime.com","article_publisher":"https:\/\/www.facebook.com\/uptimemonitor","article_published_time":"2020-03-12T21:49:45+00:00","article_modified_time":"2025-01-08T15:21:29+00:00","og_image":[{"width":700,"height":500,"url":"https:\/\/uptime.com\/blog\/wp-content\/uploads\/2020\/03\/improve-downtime-response-cover.png","type":"image\/png"}],"author":"Uptime.com","twitter_card":"summary_large_image","twitter_creator":"@UptimeDotCom","twitter_site":"@UptimeDotCom","twitter_misc":{"Written by":"Uptime.com","Est. reading time":"9 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/uptime.com\/blog\/how-to-improve-downtime-response#article","isPartOf":{"@id":"https:\/\/uptime.com\/blog\/how-to-improve-downtime-response"},"author":{"name":"Uptime.com","@id":"https:\/\/uptime.com\/blog\/#\/schema\/person\/69f0940686123645bc00cbb227cad781"},"headline":"How to Improve Downtime Response: Error Budgeting and Unplanned Downtime","datePublished":"2020-03-12T21:49:45+00:00","dateModified":"2025-01-08T15:21:29+00:00","mainEntityOfPage":{"@id":"https:\/\/uptime.com\/blog\/how-to-improve-downtime-response"},"wordCount":1651,"commentCount":0,"publisher":{"@id":"https:\/\/uptime.com\/blog\/#organization"},"image":{"@id":"https:\/\/uptime.com\/blog\/how-to-improve-downtime-response#primaryimage"},"thumbnailUrl":"https:\/\/uptime.com\/blog\/wp-content\/uploads\/2020\/03\/improve-downtime-response-cover.png","keywords":["downtime alerts","downtime response","error budget","error budgeting","improve downtime response"],"articleSection":["Outages","Resources"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/uptime.com\/blog\/how-to-improve-downtime-response#respond"]}]},{"@type":"WebPage","@id":"https:\/\/uptime.com\/blog\/how-to-improve-downtime-response","url":"https:\/\/uptime.com\/blog\/how-to-improve-downtime-response","name":"How to Improve Downtime Response: Error Budgeting and Unplanned Downtime","isPartOf":{"@id":"https:\/\/uptime.com\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/uptime.com\/blog\/how-to-improve-downtime-response#primaryimage"},"image":{"@id":"https:\/\/uptime.com\/blog\/how-to-improve-downtime-response#primaryimage"},"thumbnailUrl":"https:\/\/uptime.com\/blog\/wp-content\/uploads\/2020\/03\/improve-downtime-response-cover.png","datePublished":"2020-03-12T21:49:45+00:00","dateModified":"2025-01-08T15:21:29+00:00","description":"Learn how to use the principles of error budgeting to improve downtime response. Uptime.com offers responsive alerting to improve downtime response.","breadcrumb":{"@id":"https:\/\/uptime.com\/blog\/how-to-improve-downtime-response#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/uptime.com\/blog\/how-to-improve-downtime-response"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/uptime.com\/blog\/how-to-improve-downtime-response#primaryimage","url":"https:\/\/uptime.com\/blog\/wp-content\/uploads\/2020\/03\/improve-downtime-response-cover.png","contentUrl":"https:\/\/uptime.com\/blog\/wp-content\/uploads\/2020\/03\/improve-downtime-response-cover.png","width":700,"height":500},{"@type":"BreadcrumbList","@id":"https:\/\/uptime.com\/blog\/how-to-improve-downtime-response#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/uptime.com\/blog"},{"@type":"ListItem","position":2,"name":"How to Improve Downtime Response: Error Budgeting and Unplanned Downtime"}]},{"@type":"WebSite","@id":"https:\/\/uptime.com\/blog\/#website","url":"https:\/\/uptime.com\/blog\/","name":"Uptime.com","description":"","publisher":{"@id":"https:\/\/uptime.com\/blog\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/uptime.com\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/uptime.com\/blog\/#organization","name":"Uptime.com","url":"https:\/\/uptime.com\/blog\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/uptime.com\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/uptime.com\/blog\/wp-content\/uploads\/2022\/10\/Uptime.com_Wordmark_Horizontal_Black_Green.png","contentUrl":"https:\/\/uptime.com\/blog\/wp-content\/uploads\/2022\/10\/Uptime.com_Wordmark_Horizontal_Black_Green.png","width":512,"height":83,"caption":"Uptime.com"},"image":{"@id":"https:\/\/uptime.com\/blog\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/uptimemonitor","https:\/\/x.com\/UptimeDotCom","https:\/\/www.linkedin.com\/company\/uptime-com\/"]},{"@type":"Person","@id":"https:\/\/uptime.com\/blog\/#\/schema\/person\/69f0940686123645bc00cbb227cad781","name":"Uptime.com","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/uptime.com\/blog\/wp-content\/uploads\/2025\/01\/uptime_com_logo.jpeg34cb5744a58a7a87dbf8f078ae83ffc5","url":"https:\/\/uptime.com\/blog\/wp-content\/uploads\/2025\/01\/uptime_com_logo.jpeg","contentUrl":"https:\/\/uptime.com\/blog\/wp-content\/uploads\/2025\/01\/uptime_com_logo.jpeg","caption":"Uptime.com"},"description":"Uptime.com provides unmatched global visibility and real-time alerts on website, server, and API performance. Monitor downtime, speed, reliability, user experience with customizable status pages, dashboards, and reports.","url":"https:\/\/uptime.com\/blog\/author\/upadmin"}]}},"authors":[{"term_id":169,"user_id":1,"is_guest":0,"slug":"upadmin","display_name":"Uptime.com","avatar_url":{"url":"https:\/\/uptime.com\/blog\/wp-content\/uploads\/2025\/01\/uptime_com_logo.jpeg","url2x":"https:\/\/uptime.com\/blog\/wp-content\/uploads\/2025\/01\/uptime_com_logo.jpeg"},"0":null,"1":"","2":"","3":"","4":"","5":"","6":"","7":"","8":""}],"_links":{"self":[{"href":"https:\/\/uptime.com\/blog\/wp-json\/wp\/v2\/posts\/2060","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/uptime.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/uptime.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/uptime.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/uptime.com\/blog\/wp-json\/wp\/v2\/comments?post=2060"}],"version-history":[{"count":4,"href":"https:\/\/uptime.com\/blog\/wp-json\/wp\/v2\/posts\/2060\/revisions"}],"predecessor-version":[{"id":6583,"href":"https:\/\/uptime.com\/blog\/wp-json\/wp\/v2\/posts\/2060\/revisions\/6583"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/uptime.com\/blog\/wp-json\/wp\/v2\/media\/2066"}],"wp:attachment":[{"href":"https:\/\/uptime.com\/blog\/wp-json\/wp\/v2\/media?parent=2060"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/uptime.com\/blog\/wp-json\/wp\/v2\/categories?post=2060"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/uptime.com\/blog\/wp-json\/wp\/v2\/tags?post=2060"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/uptime.com\/blog\/wp-json\/wp\/v2\/ppma_author?post=2060"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}