Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

LinkedIn Learning

Hadoop for Data Science Tips, Tricks, & Techniques

via LinkedIn Learning

Overview

Get up to speed with Hadoop. Learn tips and tricks for doing data science work in this popular big data platform.

Syllabus

Introduction
  • Welcome
  • What you should know
  • Exercise files
  • Environment setup
1. Working with Files
  • Organize files in HDFS
  • Upload files to HDFS
  • Move files in HDFS
  • Remove files in HDFS
2. Connecting to Hadoop
  • Explore Hive through Beeline
  • Access Hive from Python
  • Create aggregates in Hive
  • Select partitions in Hive
3. Complex Data Structures in Hive
  • Map data in Hive
  • Arrays in Hive
  • Structs in Hive
  • Create flat tables for Impala
  • Deconstruct Impala queries
Conclusion
  • Next steps

Taught by

Ben Sullins

Reviews

Start your review of Hadoop for Data Science Tips, Tricks, & Techniques

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.