SRE Testing Tasks: Software Reliability & Testing

SRE    |    Intermediate
  • 18 Videos | 1h 30m 4s
  • Includes Assessment
  • Earns a Badge
Likes 6 Likes 6
Site reliability engineers (SREs) can use various testing techniques to ensure software operations are as failure-free as possible for a specified time in a specified environment. In this course, you'll explore multiple testing techniques, their purposes, and the tasks involved in their execution. You'll start by examining traditional software testing approaches, such as unit tests, integration tests, and system tests. Next, you'll investigate the components and use cases of various reliability metrics applied to SRE testing, including mean time to failure (MTTF), mean time to recover (MTTR), and mean time between failures (MTBF). Lastly, you'll outline several software testing approaches, such as stress, configuration, integration, acceptance, production, and canary testing, among others. You'll identify when, how, and by whom each of these testing types is carried out.


  • discover the key concepts covered in this course
    outline what's involved in reliability testing and describe testing techniques, such as unit, integration, system, production, stress, and rollouts entangle tests
    list standard factors that can influence software reliability
    describe why SREs might carry out reliability testing
    name and describe some common SRE metrics
    describe the features and benefits of the mean time to failure (MTTF) metric and outline how to use it in SRE work
    define the mean time to respond (MTTR) metric and describe why it might be used in SRE
    define the mean time to resolve (MTTR) metric and outline when and how to use it for SRE work
    define the mean time between failures (MTBF) metric and outline when and how to use it for SRE work
  • describe what's involved in software unit testing for SRE work, including when it's performed, who performs it, and the tasks involved
    define integration testing as it applies to SRE, list three associated method types, and outline how to perform an integration test, detailing the tasks involved
    outline what's involved in system testing in SRE, when it is performed, and who performs it
    outline what's involved in acceptance testing for SRE, when it's typically performed, and who performs it
    outline what's involved in production testing for SRE and recognize its purpose
    outline how to carry out configuration testing in SRE work and name the pre-requisites and objectives of this type of testing
    describe how and when to perform a stress test for SRE work
    define a canary test and outline what's involved in carrying out these types of tests in SRE work
    summarize the key concepts covered in this course


  • Playable
    Course Overview
    1m 32s
  • Playable
    Reliability Testing Techniques
    6m 50s
  • Locked
    Influential Software Reliability Factors
    4m 29s
  • Locked
    SRE Reliability Testing Use Cases
    3m 43s
  • Locked
    Standard SRE Metrics
    3m 2s
  • Locked
    The Mean Time to Failure Metric and SRE
    4m 6s
  • Locked
    The Mean Time to Respond Metric and SRE
    4m 25s
  • Locked
    The Mean Time to Resolve Metric and SRE
    4m 17s
  • Locked
    The Mean Time Between Failures Metric and SRE
    3m 41s
  • Locked
    SRE and Software Unit Testing
    7m 34s
  • Locked
    SRE and Integration Testing
    7m 48s
  • Locked
    SRE and System Testing
    3m 57s
  • Locked
    SRE and Acceptance Testing
    4m 27s
  • Locked
    SRE and Production Testing
  • Locked
    SRE and Configuration Testing
    7m 1s
  • Locked
    SRE and Stress Testing
    4m 15s