Designing reliable cloud solutions requires more than infrastructure knowledge—it demands systems thinking, performance trade-offs, and cost-aware decision-making. This course equips participants with design tools and architectural patterns to build scalable and secure systems on Google Cloud. Through lectures and labs, they will address networking, storage, security, and resilience in cloud design.
Learning Outcomes:
Define and measure application requirements using KPIs, SLOs, and SLIs.
Architect secure, resilient, and scalable cloud-native applications.
Apply design principles to networking, storage, and compute layers.
Plan for availability, disaster recovery, and cost optimisation.
Key Topics:
Application decomposition and service-level objectives
Microservices, storage planning, and hybrid networking
Resiliency strategies, fault tolerance, and business continuity
Security architecture, encryption, and access management
Capacity planning, monitoring, and incident response
Module 1: Defining the Service
- Design in this class.
- State and solution.
- Measurement.
- Gathering requirements, SLOs, SLAs, and SLIs (key performance indicators).
Module 2: Business-logic layer design
- Microservices architecture.
- GCP 12-factor support.
- Mapping compute needs to Google Cloud Platform processing services.
- Compute system provisioning.
Module 3: Data layer design
- Classifying and characterizing data.
- Data ingest and data migration.
- Identification of storage needs and mapping to Google Cloud Platform storage systems.
Module 4: Network layer design
- Network edge configuration.
- Network configuration for data transfer within the service, including load balancing and network location.
- Network integration with other environments, including on premises and multi-cloud.
Module 5: Design for resiliency, scalability, and disaster recovery
- Failure due to loss of resources.
- Failure due to overload.
- Strategies for coping with failure.
- Business continuity and disaster recovery, including restore strategy and data lifecycle management.
- Scalable and resilient design.
Module 6: Design for security
- Google Cloud Platform security.
- Network access control and firewalls.
- Protections against denial of service.
- Resource sharing and isolation.
- Data encryption and key management.
- Identity access and auditing.
Module 7: Capacity planning and cost optimization
- Capacity planning.
- Pricing.
Module 8: Deployment, monitoring and alerting, and incident response
- Deployment.
- Monitoring and alerting.
- Incident response.