Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Amazon Web Services

Exploring Google Ngrams with Amazon EMR and Hive

Amazon Web Services and Amazon via AWS Skill Builder

Overview

Languages Available: Español (Latinoamérica) | Español (España) | Français | Bahasa Indonesia | Italiano | 日本語 | 한국어 | Português (Brasil) | 中文(简体)

This lab demonstrates how to launch an Amazon Elastic MapReduce (EMR) cluster for Big Data processing and use Hive with SQL-style queries to analyze data. You will create a Hadoop cluster using Amazon EMR which will allow to run interactive Hive queries against data stored in Amazon S3. You will use Hive to normalize the data in a more useful way, and you will run queries to analyze the data.


Level

Advanced


Duration

1 Hours 15 Minutes


Course Objectives

In this course, you will learn how to:

  • Create an Amazon EMR cluster running Hive
  • Use Hive statements to create tables from Google Ngram input data stored in Amazon S3
  • Run Hive queries to drill-down and analyze data


Intended Audience

This course is intended for:

  • Architects
  • Data Engineers


Prerequisites

We recommend that attendees of this course have the following prerequisites:

  • None


Course Outline

  • Task 1: Launch an Amazon EMR cluster
  • Task 2: Connect to Your Cluster
  • Task 3: Analyze Data

Reviews

Start your review of Exploring Google Ngrams with Amazon EMR and Hive

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.