Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Pluralsight

Handling Streaming Data with GCP Dataflow

via Pluralsight

Overview

Dataflow is a serverless, fully-managed service on the Google Cloud Platform for batch and stream processing.

Dataflow allows developers to process and transform data using easy, intuitive APIs. Dataflow is built on the Apache Beam architecture and unifies batch as well as stream processing of data. In this course, Handling Streaming Data with GCP Dataflow, you will discover the GCP provides a wide range of connectors to integrate the Dataflow service with other GCP services such as the Pub/Sub messaging service and the BigQuery data warehouse. First, you will see how you can integrate your Dataflow pipelines with other services to use as a source of streaming data or as a sink for your final results. Next, you will stream live Twitter feeds to the Pub/Sub messaging service and implement your pipeline to read and process these Twitter messages. Finally, you will implement pipelines with a side input, and branching pipelines to write your final results to multiple sinks. When you are finished with this course you will have the skills and knowledge to design complex Dataflow pipelines, integrate these pipelines with other Google services, and test and run these pipelines on the Google Cloud Platform.

Taught by

Janani Ravi

Reviews

Start your review of Handling Streaming Data with GCP Dataflow

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.