This introductory course is for SAS software users who perform statistical analyses using SAS/STAT software. The focus is on t tests, ANOVA, and linear regression, and includes a brief introduction to logistic regression.
Course Overview (Review from Introduction to Statistics: Hypothesis Testing)
In this module you learn about the course and the data you analyze in this course. Then you set up the data you need to do the practices in the course.
Model Building and Effect Selection
In this module you explore several tools for model selection. These tools help limit the number of candidate models so that you can choose an appropriate model that's based on your expertise and research priorities.
Model Post-Fitting for Inference
In this module you learn to verify the assumptions of the model and diagnose problems that you encounter in linear regression. You learn to examine residuals, identify outliers that are numerically distant from the bulk of the data, and identify influential observations that unduly affect the regression model. Finally, you learn to diagnose collinearity to avoid inflated standard errors and parameter instability in the model.
Model Building for Scoring and Prediction
In this module you learn how to transition from inferential statistics to predictive modeling. Instead of using p-values, you learn about assessing models using honest assessment. After you choose the best performing model, you learn about ways to deploy the model to predict new data.
Categorical Data Analysis
In this module you look for associations between predictors and a binary response using hypothesis tests. Then you build a logistic regression model and learn about how to characterize the relationship between the response and predictors. Finally, you learn how to use logistic regression to build a model, or classifier, to predict unknown cases.