Raw Data to Insights: Data Ingestion & Statistical Analysis

Data Science
  • 10 Videos | 57m 30s
  • Includes Assessment
  • Earns a Badge
Likes 28 Likes 28
Explore how statistical analysis can turn raw data into insights, and then examine how to use the data to improve business intelligence, in this 10-video course. Learn how to scrutinize and perform analytics on the collected data. The course explores several approaches for identifying values and insights from data by using various standard and intuitive principles, including data exploration and data ingestion, along with the practical implementation by using R. First, you will learn how to detect outliers by using R, and how to compare simple linear regression models, with and without outliers, to improve the quality of the data. Because today's data are available in diversified formats, with large volume and high velocity, this course next demonstrates how to use a variety of technologies: Apache Kafka, Apache NiFi, Apache Sqoop, and Wavefront (a program for simulating two-dimensional acoustic systems) to ingest data. Finally, you will learn how these tools can help users in data extraction, scalability, integration support, and security.

WHAT YOU WILL LEARN

  • describe how we can use statistical analysis to add value to data
    recorgnize the concept of data correction along with the various essential approaches of implementing data correction which includes data detection localization, imputation and correction
    demonstrate how we can facilitate outlier detection using R
    describe the layered architecture of data from the perspective of data ingestion, prcoessing, and visualization
    list and compare the various essential data ingestion tools that we can use to ingest data
  • set up Kafka and Apache NiFi to ingest data
    demonstrate the steps involved in ingesting data from databases to Hadoop clusters using Sqoop
    demonstrate how we can ingest data using WaveFront
    detect outliers using R and ingest data using Apache NiFi and WaveFront

IN THIS COURSE

  • Playable
    1. 
    Course Overview
    1m 35s
    UP NEXT
  • Playable
    2. 
    Statistical Analysis
    7m 38s
  • Locked
    3. 
    Data Correction
    6m 34s
  • Locked
    4. 
    Outlier Detection
    5m 6s
  • Locked
    5. 
    Data Architecture Pattern
    4m 52s
  • Locked
    6. 
    Data Ingestion Tools
    4m 30s
  • Locked
    7. 
    Kafka and Apache NiFi
    10m 32s
  • Locked
    8. 
    Apache Sqoop Ingest
    5m 9s
  • Locked
    9. 
    Ingest Using WaveFront
    3m 10s
  • Locked
    10. 
    Exercise: Detecting Outliers and Ingesting Data
    4m 24s

EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE

Skillsoft is providing you the opportunity to earn a digital badge upon successful completion of this course, which can be shared on any social network or business platform

Digital badges are yours to keep, forever.

PEOPLE WHO VIEWED THIS ALSO VIEWED THESE

Likes 15 Likes 15  
Likes 214 Likes 214