From data warehouse to autonomous data and AI platform.

BigQuery is the autonomous data to AI platform, automating the entire data life cycle, from ingestion to AI-driven insights, so you can go from data to AI to action faster.

  • Why get trained: Learn how to design BigQuery storage and schemas, optimise SQL queries, manage data ingestion and use BigQuery ML, BI Engine and Storage APIs.
  • Why it matters: Well-designed BigQuery environments improve query performance, reduce costs and help teams deliver faster, more scalable analytics on Google Cloud.
  • Who should attend: Data analysts, data scientists, data engineers and developers who need advanced BigQuery skills to optimise large-scale data warehouse environments.

Build high-performance Google Cloud data warehouses with Trainocate Malaysia and strengthen your BigQuery expertise. HRD Corp Claimable.

Overview

In this course, you learn about the internals of BigQuery and best practices for designing, optimizing, and administering your data warehouse. Through a combination of lectures, demos, and labs, you learn about BigQuery architecture and how to design optimal storage and schemas for data ingestion and changes.

Next, you learn techniques to improve read performance, optimize queries, manage workloads, and use logging and monitoring tools. You also learn about the different pricing models.

Finally, you learn various methods to secure data, automate workloads, and build machine learning models with BigQuery ML.

Skills Covered

  • Describe BigQuery architecture fundamentals.
  • Implement storage and schema design patterns to improve performance.
  • Use DML and schedule data transfers to ingest data.
  • Apply best practices to improve read efficiency and optimize query performance.
  • Manage capacity and automate workloads.
  • Understand patterns versus anti-patterns to optimize queries and improve read performance.
  • Use logging and monitoring tools to understand and optimize usage patterns.
  • Apply security best practices to govern data and resources.
  • Build and deploy several categories of machine learning models with BigQuery ML.

Prerequisites

Have attended Big Data and Machine Learning Fundamentals course.

Target Audience

Data analysts, data scientists, data engineers, and developers who perform work on a scale that requires advanced BigQuery internals knowledge to optimize performance.

Course Curriculum

Module 1: BigQuery Architecture Fundamentals

  • Introduction
  • BigQuery Core Infrastructure
  • BigQuery Storage
  • BigQuery Query Processing
  • BigQuery Data Shuffling
  • Explain the benefits of columnar storage.
  • Understand how BigQuery processes data.
  • Explore the basics of BigQuery’s shuffling service to improve query efficiency
  • Labs and demos

Module 2: Storage and Schema Optimizations

  • BigQuery Storage
  • Partitioning and Clustering
  • Nested and Repeated Fields
  • ARRAY and STRUCT syntax
  • Best Practices
  • Compare the performance of different schemas (snowflake, denormalized, and nested and repeated fields).
  • Partition and cluster data for better performance.
  • Improve schema design using nested and repeated fields.
  • Describe additional best practices such as table and partition expiration
  • Labs and demos

Module 3: Ingesting Data

  • Data Ingestion Options
  • Batch Ingestion
  • Streaming Ingestion
  • Legacy Streaming API
  • BigQuery Storage Write API
  • Query Materialization
  • Query External Data Sources
  • Data Transfer Service
  • Ingest batch and streaming data.
  • Query external data sources.
  • Schedule data transfers.
  • Understand how to use Storage Write API.
  • Labs and demos

Module 4: Changing Data

  • Managing Change in Data Warehouses
  • Handling Slowly Changing Dimensions (SCD)
  • DML statements
  • DML Best Practices and Common Issues
  • Write DML statements.
  • Address common DML performance problems and bottlenecks.
  • Identify slowly changing dimensions (SCD) in your data and make updates.

Module 5: Improving Read Performance

  • BigQuery’s Cache
  • Materialized Views
  • BI Engine
  • High Throughput Reads
  • BigQuery Storage Read API
  • Explore BigQuery’s cache.
  • Create materialized views.
  • Work with BI Engine to accelerate your SQL queries.
  • Use the Storage Read API for fast access to BigQuery-managed storage.
  • Explain the caveats of using external data sources.
  • Labs and demos

Module 6: Optimizing and Troubleshooting Queries

  • Simple Query Execution
  • SELECTs and Aggregation
  • JOINs and Skewed JOINs
  • Filtering and Ordering
  • Best Practices for Functions
  • Interpret BigQuery execution details and the query plan.
  • Optimize query performance by using suggested methods for SQL statements and clauses.
  • Demonstrate best practices for functions in business use cases.
  • Labs and demos

Module 7: Workload Management and Pricing

  • BigQuery Slots
  • Pricing Models and Estimates
  • Slot Reservations
  • Controlling Costs
  • Define a BigQuery slot.
  • Explain pricing models and pricing estimations (BigQuery UI, bq dry_run, jobs API).
  • Understand slot reservations, commitments, and assignments.
  • Identify best practices to control costs.
  • Labs and demos

Module 8: Logging and Monitoring

  • Cloud Monitoring
  • BigQuery Admin Panel
  • Cloud Audit Logs
  • INFORMATION_SCHEMA
  • Query Path and Common Errors
  • Use Cloud Monitoring to view BigQuery metrics.
  • Explore the BigQuery admin panel.
  • Use Cloud Audit logs.
  • Work with INFORMATION_SCHEMA tables to get insights for your BigQuery entities.
  • Labs and demos

Module 9: Security in BigQuery

  • Secure Resources with IAM
  • Authorized Views
  • Secure Data with Classification
  • Encryption
  • Data Discovery and Governance
  • Explore data discovery using Data Catalog.
  • Discuss data governance using DLP API and Data Catalog.
  • Create IAM policies (e.g., authorized views) to secure resources.
  • Secure data with classifications (e.g., row-level policies).
  • Understand how BigQuery uses encryption.
  • Labs and demos

Module 10: Automating Workloads

  • Scheduling Queries
  • Scripting
  • Stored Procedures
  • Integration with Big Data Products
  • Schedule queries.
  • Use scripting and stored procedures to build custom transformations.
  • Describe how to integrate BigQuery workloads with other Google Cloud big data products.
  • Demos

Module 11: Machine Learning in BigQuery

  • Introduction to BigQuery ML
  • How to Make Predictions with BigQuery ML
  • How to Build and Deploy a Recommendation System with BigQuery ML
  • How to Build and Deploy a Demand Forecasting Solution with BigQuery ML
  • Time-Series Models with BigQuery ML
  • BigQuery ML Explainability
  • Describe some of the different applications of BigQuery ML.
  • Build and deploy several categories of machine learning models with BigQuery ML.
  • Use AutoML Tables to solve high-value business problems.
  • Labs and demos

Dates & Locations

Let’s make it work for you

Can’t find a date that fits? Need to train your whole team? Looking for a discount?
Speak to one of our learning experts today.

August 5, 2026 - August 7, 2026

Location: Kuala Lumpur
Modal: ILT
Availability: TBC

August 5, 2026 - August 7, 2026

Location: Online
Modal: VILT
Availability: TBC

October 7, 2026 - October 9, 2026

Location: Kuala Lumpur
Modal: ILT
Availability: TBC

October 7, 2026 - October 9, 2026

Location: Online
Modal: VILT
Availability: TBC

December 9, 2026 - December 11, 2026

Location: Kuala Lumpur
Modal: ILT
Availability: TBC

December 9, 2026 - December 11, 2026

Location: Online
Modal: VILT
Availability: TBC
Trainocate exam and cert

Exam & Certification

No associated certification. This course along with courses listed below are part of the Google Data Analytics learning path:

 

Training & Certification Guide

Frequently Asked Questions

Speak to a Training Consultant

All courses are HRD Claimable.
Get in touch with our team via the form or WhatsApp us on +6011-5119 6631

Preferred mode of training
Checkboxes