Final Exam: Data Scientist

Intermediate
  • 1 Video | 32s
  • Includes Assessment
  • Earns a Badge
Final Exam: Data Scientist will test your knowledge and application of the topics presented throughout the Data Scientist track of the Skillsoft Aspire Data Analyst to Data Scientist Journey.

WHAT YOU WILL LEARN

  • load data from databases using R
    implement Dask arrays in order to manage NumPy APIs
    list Dask task scheduling and big data collection features
    demonstrate the steps involved in ingesting data from databases to Hadoop clusters using Sqoop
    list and compare the various essential data ingestion tools that we can use to ingest data
    demonstrate how we can ingest data using WaveFront
    describe the various essential distributed data management frameworks used to handle big data
    define the concept of storyboarding along with the prominent storyboarding templates that we can use to implement storyboarding
    compare the different types of Recommendation Engines and how they can be used to solve different recommendation problems
    identify different cloud data sources available
    create an R function that finds similar users and finds products they liked which would be good to recommend to the user
    build heat maps and scatter plots using R
    describe the Gestalt principles of visual perception
    Pandas ML to explore a dataset where the samples are not evenly distributed across the target classes
    describe how regression works by finding the best fit straight line to model the relationships in your data
    describe the process involved in learning a relationship between input and output during the training phase of machine learning
    use Pandas and Seaborn to visualize the correlated fields in a dataset
    combine the use of oversampling and PCA in building a classification model
    recognize how to enable data-driven decision making
    can be leveraged to extract value from big data
    implement Python Luigi in order to set up data pipelines
    organize your dashboard by adding objects and adjusting the layout
    identify libraries that can be used in Python to implement data visualization
    share your dashboard to others
    implement point and interval estimation using R
    create an HTTP server using hapi.js
    compare the differences between the descriptive and inferential statistical analysis
    list libraries that can be used in Python to implement data visualization
    describe the concept of serverless computing and its benefits
    describe what truncated data is and how to remove it using Azure Automation
  • recognize the impact of implementing containerization on cloud hosting environments
    demonstrate how to craft visual data using Tableau
    recognize the problems associated with a model that is overfitted to training data and how to mitigate the issue
    use the scikit-learn library to build and train a LinearSVC classification model and then evaluate its performance using the available model evaluation functions
    work with vectors and metrics using Python and R
    recall cloud migration models from the perspective of architectural preferences
    demonstrate how to create a stacked bar plot
    describe the aspects of data quality
    demonstrate how to implement different types of bar charts using PowerBI
    create Histograms, Scatter plots, and Box plots using Python libraries
    define a port
    recognize the data pipeline building capabilities provided by Kafka, Spark, and PySpark
    build and customize graphs using ggplot2 in R
    add extensions to your dashboard such as Tableau Extensions API
    implement data exploration using plots in R
    recall the various essential decluttering steps and approaches that we can implement to eliminate clutters
    build backup and restore mechanisms in the cloud
    describe blockchain
    recognize the impact of the implementing Kubernetes and Docker in the cloud
    demonstrate how to implement data exploration using R
    implement correlogram and build area charts using R
    use R to import, filter, and massage data into data sets
    Linear regression
    use modules in your API using node.js
    how the four Vs should be balanced in order to implement a successful big data strategy
    install and prepare R for data exploration
    specify volume in big data analytics and its role in the principle of the four Vs
    integrate Spark and Tableau to manage data pipelines
    implement missing values and outliers using Python
    identify the process and approaches involved in storytelling with data

IN THIS COURSE

  • Playable
    1. 
    Data Scientist
    33s
    UP NEXT

EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE

Skillsoft is providing you the opportunity to earn a digital badge upon successful completion of this course, which can be shared on any social network or business platform

Digital badges are yours to keep, forever.