Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

LinkedIn Learning

Data Wrangling in R (2017)

via LinkedIn Learning

Overview

Learn about the principles of tidy data, and discover how to create and manipulate data tibbles—transforming them from source data into tidy formats.

Syllabus

Introduction
  • Welcome
  • What you need to know
  • Using the exercise files
1. Tidy Data
  • What is tidy data?
  • Variables, observations, and values
  • Common data problems
  • Using the tidyverse
2. Working with Tibbles
  • Building and printing tibbles
  • Subsetting tibbles
  • Filtering tibbles
3. Importing Data into R
  • What are CSV files?
  • Importing CSV files into R
  • What are TSV files?
  • Importing TSV files into R
  • Importing delimited files into R
  • Importing fixed-width files into R
  • Importing Excel files into R
  • Reading data from databases and the web
4. Data Transformation
  • Wide vs. long datasets
  • Making wide datasets long with gather()
  • Making long datasets wide with spread()
  • Converting data types in R
  • Working with dates and times in R
5. Data Cleaning
  • Detecting outliers
  • Missing and special values in R
  • Breaking apart columns with separate()
  • Combining columns with unite()
  • Manipulating strings in R with stringr
6. Data Wrangling Case Study: Coal Consumption
  • Understanding the coal dataset
  • Reading in the coal dataset
  • Converting the coal dataset from long to wide
  • Segmenting the coal dataset
  • Visualizing the coal dataset
7. Data Wrangling Case Study: Water Quality
  • Understanding the water quality dataset
  • Reading in the water quality dataset
  • Filtering the water quality dataset
  • Water quality data types
  • Correcting data entry errors
  • Identifying and removing outliers
  • Converting temperature from Fahrenheit to Celsius
  • Widening the water quality dataset
8. Data Wrangling Case Study: Social Security Disability Claims
  • Understanding the Social Security Disability dataset
  • Importing the Social Security Disability dataset
  • Making the Social Security Disability dataset long
  • Formatting dates in the Social Security Disability dataset
  • Handling fiscal years in the Social Security Disability dataset
  • Widening the Social Security Disability dataset
  • Visualizing the Social Security Disability dataset
Conclusion
  • Next steps

Taught by

Mike Chapple

Reviews

Start your review of Data Wrangling in R (2017)

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.