SRE Metric Management: Software Reliability Monitoring and Reporting
SRE
| Intermediate
- 17 Videos | 1h 17m 47s
- Includes Assessment
- Earns a Badge
Once SRE metrics have been identified, site reliability engineers (SREs) must know how to perform fault analysis on a system, classify defects, and monitor and report data. In this course, you'll explore the tools and best practices for carrying out these procedures. You'll begin by identifying various fault analysis methods and tools. You'll then classify software defects and bugs with a focus on severity and priority. Next, you'll investigate strategies for monitoring APIs and explore some tools used for this task. You'll then examine in detail several tools for collecting, analyzing, and reporting metric data using a customizable dashboard, including those that comprise the ELK Stack - Elasticsearch, Logstash, and Kibana. Furthermore, you'll explore the data collection tool Beats and the beneficial use cases for Elasticsearch notifications.
WHAT YOU WILL LEARN
-
discover the key concepts covered in this courseoutline various methods for analyzing the effects of faults in a systemoutline how to use fault tree analysis to determine the cause of faults in a systemname the tools that can be used to perform fault tree analysisoutline how to classify software defectsdescribe the various types of software bugs and recognize why they occurdifferentiate between the severity and priority of software bugsoutline best practice when defining API monitoring strategiesstate the key characteristics of API monitoring strategies
-
list API monitoring tools and their strengths and weaknessesidentify the components of the ELK Stack and how they work together for data reportingdescribe the features and benefits of Elasticsearch for storing log datadescribe the features and benefits of Kibana for viewing datadescribe the features and benefits of Beats for data collectiondescribe the features and benefits of Logstash for data processingoutline how to use Elasticsearch notifications to notify staff when API services have issuessummarize the key concepts covered in this course
IN THIS COURSE
-
1.Course Overview1m 26sUP NEXT
-
2.System Fault Analysis5m 32s
-
3.Fault Tree Analysis6m 11s
-
4.Fault Tree Analysis Tools2m 38s
-
5.Software Defect Classification4m 25s
-
6.Software Bug Types4m 50s
-
7.Software Bug Severity vs. Priority4m 37s
-
8.Cloud API Monitoring6m 30s
-
9.API Monitoring Strategies5m 43s
-
10.API Monitoring Tools4m 54s
-
11.ELK Stack Reporting4m 33s
-
12.Elasticsearch for Log Data Management5m 10s
-
13.Kibana for Data Visualization3m 59s
-
14.Beats for Data Collection5m 43s
-
15.Logstash for Data Processing5m 33s
-
16.Using Elasticsearch Notifications5m 1s
-
17.Course Summary1m 2s
EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE
Skillsoft is providing you the opportunity to earn a digital badge upon successful completion of this course, which can be shared on any social network or business platform
Digital badges are yours to keep, forever.