In this lab, you will learn how to use Apache Spark on Cloud Dataproc to distribute a computationally intensive image processing task onto a cluster of machines.
Overview
Syllabus
- GSP010
- Overview
- Setup
- Introduction
- Task 1. Create a development machine in Compute Engine
- Task 2. Install software
- Task 3. Create a Cloud Storage bucket and collect images
- Task 4. Create a Cloud Dataproc cluster
- Task 5. Submit your job to Cloud Dataproc
- Task 6. Test your understanding
- Congratulations!