Most popular programs
Trending now
After a course session ends, it will be archived.
Learn how to monitor, troubleshoot, and improve your infrastructure and application performance. Guided by the principles of Site Reliability Engineering (SRE), this course features a combination of lectures, demos, hands-on labs, and real-world case studies. In this course, you'll gain experience with full-stack monitoring, real-time log management and analysis, debugging code in production, and profiling CPU and memory usage.
To get the most out of this course, participants should have: - Google Cloud Fundamentals: Core Infrastructure or equivalent experience - Basic scripting or coding familiarity - Proficiency with command-line tools and Linux operating system environments
1. Introduction
2. Introduction to Monitoring in Google Cloud
3. Avoiding Customer Pain
4. Alerting Policies
5. Monitoring Critical Systems
6. Configuring Google Cloud Services for Observability
7. Advanced Logging and Analysis
8. Monitoring Network Security and Audit Logs
9. Managing Incidents
10. Investigating Application Performance Issues
11. Optimizing the Costs of Monitoring
12. Course Resources