Site Reliability Engineering: Scenario Planning

SRE    |    Intermediate
  • 21 videos | 1h 11m 11s
  • Includes Assessment
  • Earns a Badge
Likes 161 Likes 161
Scenario planning helps site reliability engineers strategically prepare for uncertainties that may disrupt or negatively affect services. In this course, you'll explore scenario planning use cases and the strategies utilized to prepare for disasters. You'll examine the functions of Disaster Recovery Testing (DiRT) and Customer Reliability Engineering teams, which help manage the impact of a disaster or disruption. Next, you'll identify disaster recovery testing events and recognize how to plan and design tests for DiRT. You'll move on to describe the production incident lifecycle and how to minimize production incidents. You'll identify unmanaged responses, how to rectify untrained responses, and the activities used to train response teams. Finally, you'll examine how to test people and how they self-organize and interact using various role-playing and test scenarios.


  • discover the key concepts covered in this course
    define scenario planning and identify why it should be part of your strategic plan
    describe how to use scenario planning and how to create scenarios
    recognize considerations when scenario planning for a disaster
    identify potential scenarios to test and prepare for, such as the loss of technical infrastructure or environmental issues
    list common data-related disaster recovery scenarios to plan for
    list common applications-related disaster recovery scenarios to plan for
    provide an overview of disaster recovery testing events and how they can help identify vulnerabilities in critical systems
    list what to test when designing tests for DiRT
    recognize how to minimize the potential damage of disruptive DiRT tests
    provide an overview of the DiRT technical team and the coordination team
  • list common components of a DiRT test plan and how creating a template is useful for future test plan proposals
    outline the functions of a Customer Reliability Engineering team and their role in scenario planning
    outline the production incident lifecycle and how to lay the foundations to shrink production incidents
    provide an overview of unmanaged responses
    describe how to rectify untrained responses
    recognize hands-on activities used to train response teams
    describe how DiRT exercises should also test how people organize themselves and interact with each other
    provide an overview of the "Wheel of Misfortune" role-playing scenario
    provide an overview of the Dungeon/Scenario Master and their role in running a test scenario
    summarize the key concepts covered in this course


  • 1m 50s
  • 2m 4s
    In this video, you'll learn more about the concept of Scenario Planning and making it a part of your Organization Decisions as you move forward. You'll learn that by defining which predictions are most likely to occur, you can be better prepared if and when those situations actually develop. FREE ACCESS
  • Locked
    3.  Implementing Scenario Planning
    3m 9s
    In this video, you'll learn how to use scenario planning and how to create your own scenarios. You'll learn that formulating what you consider the four most likely scenarios for your organization is challenging but most organizations can determine which are more likely than others. You'll also learn that focusing on two of the driving forces is key when developing scenarios and identifying their impact and implications. FREE ACCESS
  • Locked
    4.  Disaster Recovery Scenario Planning
    4m 45s
    In this video, you'll learn more about planning for disaster in your organization. You'll learn that when it comes to planning out scenarios for your organization, one of the most important considerations is planning for disaster. This means anything that can cause an interruption to your business. So in this video, you'll examine some considerations to help ensure you're able to recover your operations with minimal disruption. FREE ACCESS
  • Locked
    5.  Disaster Recovery Testing Scenarios
    5m 8s
    In this video, you'll learn more about planning out disaster recovery scenarios. You'll discover there can be many different situations that constitute a disaster. It's important to try to identify scenarios that are most likely to occur, but it's equally important to test your strategies for dealing with each of them. This video explores some common disaster scenarios and possible mitigations that can be tested. FREE ACCESS
  • Locked
    6.  Disaster Recovery Scenarios for Data
    4m 17s
    In this video, you'll learn more about the considerations needed when formulating Disaster Recovery Scenarios that are specific to ensuring the ability to recover your data. You'll learn that the term data itself usually refers to anything stored in an unstructured manner such as documents. Most environments also have structured data, such as relational databases. But the first step is to implement backups of both your data and your databases. FREE ACCESS
  • Locked
    7.  Disaster Recovery Scenarios for Applications
    8m 59s
    In this video, you'll learn more about considerations when formulating a disaster recovery plan with respect to your applications. You'll learn there are common examples such as batch processes, ecommerce websites, and video streaming. These illustrate several different levels of priority when it comes to how urgent their recovery might be. FREE ACCESS
  • Locked
    8.  Disaster Recovery Testing
    3m 57s
    In this video, you'll learn more about Disaster Recovery Testing or DiRT. You'll discover that while an organization may feel they are prepared for disasters, this can't really be known until disaster strikes. Instead of waiting for an actual disaster, simulations can be run to test how effective your strategies are. This video outlines the process of Disaster Recovery Testing and what it involves. FREE ACCESS
  • Locked
    9.  DiRT Testing Components
    2m 5s
    In this video, you'll learn more about specific examples of what to test for when designing disaster recovery tests. You'll learn these can help shed light on the possible scenarios that affect organizations. So initially, you might look at simpler cases involving service-specific testing. This includes ensuring any given service has fault tolerance configured. FREE ACCESS
  • Locked
    10.  DiRT Testing Impact
    3m 7s
  • Locked
    11.  DiRT Team
    2m 5s
    In this video, you'll learn more about the two core teams that should be involved with disaster recovery testing. You'll learn about a technical team and a coordination team. The technical team is responsible for the initial design of all tests, as well as evaluating them to determine their effectiveness and the impact on the target systems. Once any given test is deemed ready for implementation, the technical team will also be responsible for monitoring during execution. FREE ACCESS
  • Locked
    12.  DiRT Test Plan Scenario Template
    3m 57s
    In this video, you'll learn more about Disaster Recovery Test components and how to create a Test Plan Proposal. You'll discover the proposal should begin by outlining the General Information of the test and should be used for all tests to ensure that the documentation is always consistent and easily understood by all parties involved. Watch this video to learn more about the specific components covered in this demo. FREE ACCESS
  • Locked
    13.  Customer Reliability Engineering Team
    4m 28s
    In this video, you'll learn more about the Customer Reliability Engineer or CRE. Like the site reliability engineer, it's an extension of user support but particularly with respect to the adoption of cloud services. Many organizations are still hesitant to adopt cloud services primarily due to the loss of control over hardware and infrastructure, applications, and even data. The CRE exists to address this anxiety. FREE ACCESS
  • Locked
    14.  Production Incidents
    4m 41s
  • Locked
    15.  Unmanaged Responses
    2m 14s
  • Locked
    16.  Untrained Responses
  • Locked
    17.  Response Teams
    4m 8s
    In this video, you'll learn more about the methods that are generally the most effective when it comes to providing training for your incident response teams. You'll learn that there is value in book learning, video learning or classroom learning. But none of these are as effective as hands-on experience because failure is concerned. It's a matter of pattern recognition here for the trainers. FREE ACCESS
  • Locked
    18.  Human Learning
    2m 21s
  • Locked
    19.  Wheel of Misfortune
    2m 18s
    In this video, you'll learn more about a training method known as the Wheel of Misfortune. This is a role-playing scenario that uses simulated emergencies to test the responses of your teams or those who are in training. You'll learn that there's no specific response strategy in place because everything is done in a low-risk environment. FREE ACCESS
  • Locked
    20.  Scenario Master
    2m 17s
    In this video, you'll learn more about the role of the Scenario Master or sometimes also referred to as the Dungeon Master when using the Wheel of Misfortune, or other scenario testing methods for training your disaster response teams. You'll learn that there are certain requirements when it comes to implementing the Dungeon Master. If you think of the scenario as something similar to a play, they're thought of as the director. FREE ACCESS
  • Locked
    21.  Course Summary
    1m 23s
    In this video, you'll summarize what you've learned in the course. You've discovered the use of scenario planning and strategies to minimize and manage disaster or disruption impact. You also learned about the benefits of disaster recovery testing and Customer Reliability Engineering teams. You explored scenario planning, why and how to use it, and how to create your own scenarios. FREE ACCESS


Skillsoft is providing you the opportunity to earn a digital badge upon successful completion on some of our courses, which can be shared on any social network or business platform.

Digital badges are yours to keep, forever.



Likes 255 Likes 255  
Likes 140 Likes 140  
Likes 194 Likes 194