Dataproc Operations
Google Cloud 2018
| Intermediate
- 10 videos | 52m 1s
- Includes Assessment
- Earns a Badge
Executing Dataproc implementations with big data can provide a variety of methods. Examine Dataproc implementations with Spark and Hadoop using the cloud shell and introduce BigQuery PySpark REPL package.
WHAT YOU WILL LEARN
-
describe the various Spark and Hadoop processes that can be performed with Dataprocrecognize the benefits of separating storage and compute services using Cloud Dataprocrecall the process of monitoring and logging Dataproc jobsdemonstrate the process of using an SSH tunnel to connect to the master and worker nodes in a clusterdefine the Spark REPL package and how it's used in Linux
-
describe the compute and storage processes and the benefits of their separation and the virtualized distribution of Hadoopdefine BigQuery and its benefits for large-scale analyticsdescribe the MapReduce programming modeldemonstrate the process of submitting multiple jobs with Dataprocrecognize the various Dataproc and Cloud Shell job operations and implementations
IN THIS COURSE
-
1.Spark and Hadoop Processes3m 5sUP NEXT
-
2.Benefits of Cloud Dataproc2m 34s
-
3.Job Monitoring and Logging3m 59s
-
4.SSH into Master and Worker Nodes6m 36s
-
5.Spark REPL3m 20s
-
6.Separation of Compute and Storage4m 6s
-
7.BigQuery Features and Capabilities3m 53s
-
8.MapReduce with Big Data6m 11s
-
9.Job Submission with Cloud Shell7m 39s
-
10.Exercise: Dataproc and Cloud Shell Implementations10m 38s
EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE
Skillsoft is providing you the opportunity to earn a digital badge upon successful completion on some of our courses, which can be shared on any social network or business platform.
Digital badges are yours to keep, forever.YOU MIGHT ALSO LIKE
Channel
Wintellect Apache Hadoop