![](https://ccweb.imgix.net/https%3A%2F%2Fwww.classcentral.com%2Fimages%2Ficon-black-friday.png?auto=format&ixlib=php-4.1.0&s=fe56b83c82babb2f8fce47a2aed2f85d)
Limited-Time Offer: Up to 75% Off Coursera Plus!
7000+ certificate courses from Google, Microsoft, IBM, and many more.- Module 1: Learn about SRE, an engineering discipline that helps you sustainably achieve the appropriate level of reliability in your systems, services, and products.
In this module you will:
- Gain a basic understanding of Site Reliability Engineering (SRE)
- Learn how to get started with this valuable operations practice
- Module 2: Respond to incidents and activities in your infrastructure through alerting capabilities in Azure Monitor.
In this module, you'll:
- Configure alerts on events in your Azure resources based on metrics, log events, and activity log events.
- Learn how to use action groups in response to an alert, and how to use alert processing rules to override action groups when necessary.
- Module 3: Learn about how to capture trace output from your Azure web apps. View a live log stream and download logs files for offline analysis.
In this module, you will:
- Enable application logging on an Azure Web App
- View live application logging activity with the log streaming service
- Retrieve application log files from an application with Kudu or the Azure CLI
- Module 4: Learn how to manage site reliability.
After completing this module, you'll be able to:
- Describe how site reliability engineering (SRE) empowers software developers to own the ongoing daily operation of their applications in production.
- Describe how Application Insights analyzes the performance of your web application and can warn you about potential problems.
- List the processes that you can implement to monitor site reliability.
- Build a "just culture" that balances safety and accountability.
- Module 5: Cloud Admin course from Dr. Majd Sakr at Carnegie Mellon University. Discover what cloud elasticity means and different ways to scale your cloud resources.
In this module you will:
- Describe common load patterns and how they drive the need to scale
- Enumerate the strategies and considerations in scaling cloud applications
- Discuss the advantages of auto-scaling and the mechanisms used to achieve it
- Describe the importance of load balancing in cloud applications and enumerate various methods to achieve it
- List the primary benefits of serverless computing and explain the concept of serverless functions
This content is provided in partnership with Dr. Majd Sakr and Carnegie Mellon University.
- Module 6: Carnegie Mellon University's Cloud Developer course. Learn how developers write programs that run on the cloud, including how to deploy, be fault-tolerant, load balance, scale, and deal with latency.
In this module, you will:
- Evaluate different considerations when programming applications that run on clouds
- Evaluate different considerations when deploying applications on clouds
- Compare and contrast proactive and reactive measures for fault tolerance in cloud applications
- Describe the importance of load balancing in cloud applications and enumerate various methods to achieve it
- Enumerate the strategies and considerations in scaling cloud applications
- Motivate the case for minimizing tail latency and discuss the various strategies to reduce tail latency
- Describe the strategies to optimize total operational cost of using cloud services
In partnership with Dr. Majd Sakr and Carnegie Mellon University.
- Module 7: Learn how to troubleshoot inbound network connectivity for Azure Load Balancer.
In this module, you will:
- Identify common Azure Load Balancer inbound connectivity issues.
- Identify steps to resolve issues when virtual machines aren't responding to health probe.
- Module 8: Learn how to monitor the health of your Azure VMs by using Azure Metrics Explorer and metric alerts.
In this module, you will:
- Identify metrics and diagnostic data that you can collect for virtual machines
- Configure monitoring for a virtual machine
- Use monitoring data to diagnose problems