In the third course of the Practical Data Science Specialization, you will learn a series of performance-improvement and cost-reduction techniques to automatically tune model accuracy, compare prediction performance, and generate new training data with human intelligence. After tuning your text classifier using Amazon SageMaker Hyper-parameter Tuning (HPT), you will deploy two model candidates into an A/B test to compare their real-time prediction performance and automatically scale the winning model using Amazon SageMaker Hosting. Lastly, you will set up a human-in-the-loop pipeline to fix misclassified predictions and generate new training data using Amazon Augmented AI and Amazon SageMaker Ground Truth.
Practical data science is geared towards handling massive datasets that do not fit in your local hardware and could originate from multiple sources. One of the biggest benefits of developing and running data science projects in the cloud is the agility and elasticity that the cloud offers to scale up and out at a minimum cost.
The Practical Data Science Specialization helps you develop the practical skills to effectively deploy your data science projects and overcome challenges at each step of the ML workflow using Amazon SageMaker. This Specialization is designed for data-focused developers, scientists, and analysts familiar with the Python and SQL programming languages and want to learn how to build, train, and deploy scalable, end-to-end ML pipelines - both automated and human-in-the-loop - in the AWS cloud.
Week 1: Advanced model training, tuning and evaluation
Train, tune, and evaluate models using data-parallel and model-parallel strategies and automatic model tuning.
Week 2: Advanced model deployment and monitoring
Deploy models with A/B testing, monitor model performance, and detect drift from baseline metrics.
Week 3: Data labeling and human-in-the-loop pipelines
Label data at scale using private human workforces and build human-in-the-loop pipelines.
Antje Barth, Shelbee Eigenbrode, Sireesha Muppala and Chris Fregly
In this final course of the Practical data science specialisation, you learn advanced techniques for model training, tuning, evaluation, deployment, monitoring, data labelling and human-in-the-loop pipelines, both from a conceptual perspective as with practical exercises on Amazon SageMaker and related tools.
An indispensable course for any data scientist who wants to take advantage of the AWS tools to be more effective and efficient in his job.