Data Pipeline: Using Frameworks for Advanced Data Management
Data Pipeline
| Intermediate
- 10 Videos | 32m 1s
- Includes Assessment
- Earns a Badge
Discover how to implement data pipelines using Python Luigi, integrate Spark and Tableau to manage data pipelines, use Dask arrays, and build data pipeline visualization with Python in this 10-video course. Begin by learning about features of Celery and Luigi that can be used to set up data pipelines, then how to implement Python Luigi to set up data pipelines. Next, turn to working with Dask library, after listing the essential features provided by Dask from the perspective of task scheduling and big data collections. Learn about implementation of Dask arrays to manage NumPy application programming interfaces (APIs). Explore frameworks that can be used to implement data exploration and visualization in data pipelines. Integrate Spark and Tableau to manage data pipelines. Move on to streaming data visualization with Python, using Python to build visualizations for streaming data. Then learn about the data pipeline building capabilities provided by Kafka, Spark, and PySpark. The concluding exercise involves setting up Luigi to implement data pipelines, Spark and Tableau integration, and building pipelines with Python.
WHAT YOU WILL LEARN
-
recognize the features of Celery and Luigi that can be used to set up data pipelinesimplement Python Luigi in order to set up data pipelineslist Dask task scheduling and big data collection featuresimplement Dask arrays in order to manage NumPy APIslist frameworks that can be used to implement data exploration and visualization in data pipelines
-
integrate Spark and Tableau to manage data pipelinesuse Python to build visualizations for streaming datarecognize the data pipeline building capabilities provided by Kafka, Spark, and PySparkset up Luigi to implement data pipelines, integrate Spark and Tableau for data pipeline management, and build visualizations for data pipelines using Python
IN THIS COURSE
-
1.Course Overview1m 34sUP NEXT
-
2.Celery and Luigi3m 45s
-
3.Data Pipeline with Python Luigi3m 38s
-
4.Working with Dask Library3m 11s
-
5.Dask Arrays3m 59s
-
6.Data Exploration and Visualization Frameworks3m 46s
-
7.Spark and Tableau2m 26s
-
8.Streaming Data Visualization with Python2m 51s
-
9.Data Pipeline Open Source Tools3m 45s
-
10.Exercise: Implement Data Pipelines with Luigi3m 7s
EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE
Skillsoft is providing you the opportunity to earn a digital badge upon successful completion of this course, which can be shared on any social network or business platform
Digital badges are yours to keep, forever.