In this hands-on lab you will explore using Google Cloud Kubernetes Engine and Kubeflow TFJob to scale out TensorFlow distributed training.
Overview
Syllabus
- GSP775
- Overview
- Setup and requirements
- Lab tasks
- Task 1. Creating a GKE cluster
- Task 2. Deploying
- Task 3. Creating a Cloud Storage bucket
- Task 4. Preparing TFJob
- Task 5. Submitting the TFJob
- Task 6. Monitoring the TFJob
- Congratulations