Understanding and Visualizing Data with Python

University of Michigan via Coursera

Go to class Write review

Details

Go to class

Provider

Coursera
Pricing

Free Online Course (Audit)
Languages

English
Certificate

Paid Certificate Available
Duration & workload

19 hours 37 minutes
Sessions

On-Demand
Level

Beginner
Subtitles

Arabic, French, Portuguese, Italian, German, Russian, English, Spanish, Korean, Thai, Indonesian, Kazakh, Hindi, Swedish, Greek, Chinese, Ukrainian, Japanese, Polish, Dutch, Turkish, Hungarian, Bengali, Pashto, Urdu, Azerbaijani, Farsi

Found in

Part of

Statistics with Python

Overview

Class Central Tips

In this course, learners will be introduced to the field of statistics, including where data come from, study design, data management, and exploring and visualizing data. Learners will identify different types of data, and learn how to visualize, analyze, and interpret summaries for both univariate and multivariate data. Learners will also be introduced to the differences between probability and non-probability sampling from larger populations, the idea of how sample estimates vary, and how inferences can be made about larger populations based on probability sampling. At the end of each week, learners will apply the statistical concepts they’ve learned using Python within the course environment. During these lab-based sessions, learners will discover the different uses of Python as a tool, including the Numpy, Pandas, Statsmodels, Matplotlib, and Seaborn libraries. Tutorial videos are provided to walk learners through the creation of visualizations and data management, all within Python. This course utilizes the Jupyter Notebook environment within Coursera.

Syllabus

WEEK 1 - INTRODUCTION TO DATA

In the first week of the course, we will review a course outline and discover the various concepts and objectives to be mastered in the weeks to come. You will get an introduction to the field of statistics and explore a variety of perspectives the field has to offer. We will identify numerous types of data that exist and observe where they can be found in everyday life. You will delve into basic Python functionality, along with an introduction to Jupyter Notebook. All of the course information on grading, prerequisites, and expectations are on the course syllabus and you can find more information on our Course Resources page.

WEEK 2 - UNIVARIATE DATA

In the second week of this course, we will be looking at graphical and numerical interpretations for one variable (univariate data). In particular, we will be creating and analyzing histograms, box plots, and numerical summaries of our data in order to give a basis of analysis for quantitative data and bar charts and pie charts for categorical data. A few key interpretations will be made about our numerical summaries such as mean, IQR, and standard deviation. An assessment is included at the end of the week concerning numerical summaries and interpretations of these summaries.

WEEK 3 - MULTIVARIATE DATA

In the third week of this course on looking at data, we’ll introduce key ideas for examining research questions that require looking at more than one variable. In particular, we will consider both numerically and visually how different variables interact, how summaries can appear deceiving if you don’t properly account for interactions, and differences between quantitative and categorical variables. This week’s assignment will consist of a writing assignment along with reviewing those of your peers.

WEEK 4 - POPULATIONS AND SAMPLES

In this week, you’ll spend more time thinking about where data come from. The highest-quality statistical analyses of data will always incorporate information about the process used to generate the data, or features of the data collection design. You’ll be exposed to important concepts related to sampling from larger populations, including probability and non-probability sampling, and how we can make inferences about larger populations based on well-designed samples. You’ll also learn about the concept of a sampling distribution, and how estimation of the variance of that distribution plays a critical role in making statements about populations. Finally, you’ll learn about the importance of reading the documentation for a given data set; a key step in looking at data is also looking at the available documentation for that data set, which describes how the data were generated.

Taught by

Brenda Gunderson, Brady T. West and Kerby Shedden

Reviews

4.7 rating, based on 3 Class Central reviews

4.7 rating at Coursera based on 2594 ratings

Start your review of Understanding and Visualizing Data with Python

Ronny De Winter @RonnyDeWinter

I used this course as a kind of refresher, didn't view all the video lectures. Most of the material was not new to me. The pythonbooks illustrate well the concepts. I particularly liked the peer-reviewed exercise: a study design for a statistical analysis of a pizza restaurant and its competitor.
Kai

This "Understanding and Visualising Data with Python" training offers: 1. lecture videos teaching you concepts 2. graded quizzes 3. a graded assignment where you have to create a survey design 4. Jupyter notebooks with exercises for you to explo…

This "Understanding and Visualising Data with Python" training offers:

1. lecture videos teaching you concepts

2. graded quizzes

3. a graded assignment where you have to create a survey design

4. Jupyter notebooks with exercises for you to explore statistical concepts in Python

5. walkthrough videos on Jupyter notebook exercises if you need some help to unblock yourself or when you want to understand why certain things were done

The training was alittle lengthy but well worth the time. At times, because concepts can be explained in long sentences, you may need to rewind and revisit certain parts of the videos to get the full meaning of what has been explained.

Overall, this training refreshed my understanding of:

1. basic statistical concepts - statistical measures, population, sampling

2. using numpy, matplotlib, seaborn, scipy packages in Jupyter notebooks (which was good because I currently dont code in Python at work)

This training also explained practical ideas such as:

1. stratifying, clustering, why these concepts are important when sampling

2. issues with certain sampling approaches

3. useful ways to turn a non-probability sample into a probability sample, so that the analysis/claims you present would be grounded in a more solid basis.

Points 2 and 3 in the list above were neither covered in school nor statistics texts in the past. So like me, you may get the chance to learn something new to apply to your work.
Anonymous

This course is practically detail and well-structured. The video lessons are presented with ppt including many samples that is easy to follow and understand. They also provide the 3rd-party tool to practice (Jupyter notebook) with explicit description. Although you might find many lecturers during the course, it's not a big problem. The course is great to enroll.

Go to class

Udemy, Coursera, 2U/edX Face Lawsuits Over Meta Pixel Use

Most common

Popular subjects

Popular courses

Understanding and Visualizing Data with Python

Overview

Syllabus

Taught by

Tags

Reviews

Udemy, Coursera, 2U/edX Face Lawsuits Over Meta Pixel Use

Taught by

Tags

Inferential Statistical Analysis with Python

Fitting Statistical Models to Data with Python

Statistics with Python

The Power of Statistics

Introduction to Probability and Data with R

Working with Categorical Data in Python

1700 Coursera Courses That Are Still Completely FREE

250 Top FREE Coursera Courses of All Time

Massive List of MOOC-based Microcredentials

Never Stop Learning.