Integrated monitoring, logging, and trace managed services for applications and systems running on Google Cloud and beyond.

Google Cloud’s approach to logging, monitoring and observability is comprehensive and unified, making it easier for IT teams to keep track of their application’s performance. With customizable dashboards, comparison graphs, and logs indexed for query in real time, Google Cloud makes it easy to observe key metrics in near real time.

This ability helps engineers diagnose issues quickly as they arise and more effectively manage the performance of applications at scale.

Learn the techniques for monitoring, troubleshooting, and improving infrastructure and application performance in Google Cloud. Guided by the principles of Site Reliability Engineering (SRE).

Overview

The Logging, Monitoring and Observability in Google Cloud training course teaches participants techniques for monitoring, troubleshooting, and improving infrastructure and application performance in Google Cloud.

Learn how to monitor, troubleshoot, and improve your infrastructure and application performance. Guided by the principles of Site Reliability Engineering (SRE), this official Google Cloud course features a combination of lectures, demos, hands-on labs, and real-world case studies. In this course, you’ll gain experience with full-stack monitoring, real-time log management and analysis, debugging code in production, and profiling CPU and memory usage.

Skills Covered

This course teaches participants the following skills:

  • Plan and implement a well-architected logging and monitoring infrastructure
  • Define Service Level Indicators (SLIs) and Service Level Objectives (SLOs)
  • Create effective monitoring dashboards and alerts
  • Monitor, troubleshoot, and improve Google Cloud infrastructure
  • Analyze and export Google Cloud audit logs
  • Find production code defects, identify bottlenecks, and improve performance
  • Optimize monitoring costs

Prerequisites

To get the most out of this course, participants should have:

Target Audience

This class is intended for the following participants:

  • Cloud architects, administrators, and SysOps personnel
  • Cloud developers and DevOps personnel

Course Curriculum

Module 1: Introduction to Google Cloud Monitoring Tools

  • Understand the purpose and capabilities of Google Cloud operations-focused components: Logging, Monitoring, Error Reporting, and Service Monitoring
  • Understand the purpose and capabilities of Google Cloud application performance management focused components: Debugger, Trace, and Profiler

Module 2: Avoiding Customer Pain

  • Construct a monitoring base on the four golden signals: latency, traffic, errors, and saturation
  • Measure customer pain with SLIs
  • Define critical performance measures
  • Create and use SLOs and SLAs
  • Achieve developer and operation harmony with error budgets

Module 3: Alerting Policies

  • Develop alerting strategies
  • Define alerting policies
  • Add notification channels
  • Identify types of alerts and common uses for each
  • Construct and alert on resource groups
  • Manage alerting policies programmatically

Module 4: Monitoring Critical Systems

  • Choose best practice monitoring project architectures
  • Differentiate Cloud IAM roles for monitoring
  • Use the default dashboards appropriately
  • Build custom dashboards to show resource consumption and application load
  • Define uptime checks to track aliveness and latency

Module 5: Configuring Google Cloud Services for Observability

  • Integrate logging and monitoring agents into Compute Engine VMs and images
  • Enable and use Kubernetes Monitoring
  • Extend and clarify Kubernetes monitoring with Prometheus
  • Expose custom metrics through code and with the help of OpenCensus

Module 6: Advanced Logging and Analysis

  • Identify and choose among resource tagging approaches
  • Define log sinks (inclusion filters) and exclusion filters
  • Create metrics based on logs
  • Define custom metrics
  • Use Error Reporting to link application errors to Logging
  • Export logs to BigQuery

Module 7: Monitoring Network Security and Audit Logs

  • Collect and analyze VPC Flow logs and Firewall Rules logs.
  • Enable and monitor Packet Mirroring.
  • Explain the capabilities of Network Intelligence Center.
  • Use Admin Activity audit logs to track changes to the configuration or metadata of resources.
  • Use Data Access audit logs to track accesses or changes to user-provided resource data.
  • Use System Event audit logs to track GCP administrative actions.

Module 8: Managing Incidents

  • Define incident management roles and communication channels
  • Mitigate incident impact
  • Troubleshoot root causes
  • Resolve incidents
  • Document incidents in a post-mortem process

Module 9: Monitoring Network Security and Audit Logs

  • Collect and analyze VPC Flow logs and Firewall Rules logs.
  • Enable and monitor Packet Mirroring.
  • Explain the capabilities of Network Intelligence Center.
  • Use Admin Activity audit logs to track changes to the configuration or metadata of resources.
  • Use Data Access audit logs to track accesses or changes to user-provided resource data.
  • Use System Event audit logs to track GCP administrative actions.

Module 10: Optimizing Stackdriver Costs

  • Understand Stackdriver billing
  • Analyze Stackdriver resource utilization
  • Implement best practices for Stackdriver cost control

Dates & Locations

Let’s make it work for you

Can’t find a date that fits? Need to train your whole team? Looking for a discount?
Speak to one of our learning experts today.

July 9, 2026 - July 10, 2026

Location: Kuala Lumpur
Modal: ILT
Availability: TBC
PROMO

July 9, 2026 - July 10, 2026

Location: Online
Modal: VILT
Availability: TBC
PROMO

September 10, 2026 - September 11, 2026

Location: Kuala Lumpur
Modal: ILT
Availability: TBC
PROMO

September 10, 2026 - September 11, 2026

Location: Online
Modal: VILT
Availability: TBC
PROMO

November 5, 2026 - November 6, 2026

Location: Kuala Lumpur
Modal: ILT
Availability: TBC

November 5, 2026 - November 6, 2026

Location: Online
Modal: VILT
Availability: TBC
Trainocate exam and cert

Exam & Certification

This course is not inclusive of examination but its part of the following Google Cloud certification pathways:

 

Training & Certification Guide

Frequently Asked Questions

Speak to a Training Consultant

All courses are HRD Claimable.
Get in touch with our team via the form or WhatsApp us on +6011-5119 6631

Preferred mode of training
Checkboxes