Final Exam: Big Data Infrastructures

Big Data 2021    |    Beginner
  • 1 Video | 10m 32s
  • Includes Assessment
  • Earns a Badge
Likes 1 Likes 1
Final Exam: Big Data Infrastructures will test your knowledge and application of the topics presented throughout the Big Data Infrastructures track of the Skillsoft Aspire Data for Leaders and Decision Makers Journey.


  • define the big 7 characteristics that define Big Data
    define the role of the data processing layer and specify how information captured in the previous layer is processed
    describe graph database use cases and specify why the relationship between data is as important as the data itself in a graph database
    describe Spark and how it offers open-source scalable massively parallel in-memory solutions for analytics applications
    describe the challenges in the current data analytics models and system designs such as scalability, consistency, reliability, efficiency, and maintainability
    describe the concept of Big Data and the history behind it
    describe the difference between horizontal and vertical scaling
    describe the rewarding role of NoSQL databases in horizontal distribution of large, structured and unstructured data
    describe the subcomponents of Hadoop such as MapReduce and HDFS
    describe what horizontal scaling is and specify how it eliminates the need for adding more memory to existing machines by using clusters (AKA, Sharding )
  • identify the sources that are capable of generating Big Data
    list the main characteristics of Spark such as loading behavior, file formats, parallelism, cache, data skews
    name and describe the features of Storage systems such as HDFS, S3 and Object stores, Elastic Search and Apache Solr, Kudu, CockroachDB
    name and describe the four types of Big Data Analytics (i.e. Prescriptive, Predictive, Diagnostic, Descriptive)
    name and describe the role of the main layers of Big data analytics from the bottom to the top
    name most important performance optimization techniques such as file format selection, level of parallelism and API selection
    recognize the need for Big Data
    specify the shortcoming of distributed systems and why these shortcomings make Big Data even more important
    specify use cases, benefits and challenges of popular key-value data stores
    specify when to use NoSQL and when to use SQL database


  • Playable
    Big Data Infrastructures


Skillsoft is providing you the opportunity to earn a digital badge upon successful completion of this course, which can be shared on any social network or business platform

Digital badges are yours to keep, forever.