DevOps Site Reliability Engineering (SRE) Foundation

The SRE (Site Reliability Engineering) Foundation? course is an introduction to the principles & practices that enable an organization to reliably and economically scale critical services. Introducing a site-reliability dimension requires organizational re-alignment, a new focus on engineering & automation, and the adoption of a range of new working paradigms. The course highlights the evolution of SRE and its future direction, and equips participants with the practices, methods, and tools to engage people across the organization involved in reliability and stability evidenced through the use of real-life scenarios and case stories. Upon completion of the course, participants will have tangible takeaways to leverage when back in the office such as understanding, setting and tracking Service Level Objectives (SLO’s). The course was developed by leveraging key SRE sources, engaging with thought-leaders in the SRE space and working with organizations embracing SRE to extract real-life best practices and has been designed to teach the key principles & practices necessary for starting SRE adoption.
Course Details

Price:

$1,495.00

Days:

2

Location:

Virtual

Course Overview

The SRE (Site Reliability Engineering) Foundation? course is an introduction to the principles & practices that enable an organization to reliably and economically scale critical services. Introducing a site-reliability dimension requires organizational re-alignment, a new focus on engineering & automation, and the adoption of a range of new working paradigms. The course highlights the evolution of SRE and its future direction, and equips participants with the practices, methods, and tools to engage people across the organization involved in reliability and stability evidenced through the use of real-life scenarios and case stories. Upon completion of the course, participants will have tangible takeaways to leverage when back in the office such as understanding, setting and tracking Service Level Objectives (SLO’s). The course was developed by leveraging key SRE sources, engaging with thought-leaders in the SRE space and working with organizations embracing SRE to extract real-life best practices and has been designed to teach the key principles & practices necessary for starting SRE adoption.

The history of SRE and its emergence at Google/li> The inter-relationship of SRE with DevOps and other popular frameworks/li> The underlying principles behind SRE/li> Service Level Objectives (SLO’s) and their user focus/li> Service Level Indicators (SLI’s) and the modern monitoring landscape/li> Error budgets and the associated error budget policies/li> Toil and its effect on an organization’s productivity/li> Some practical steps that can help to eliminate toil/li> Observability as something to indicate the health of a service/li> SRE tools, automation techniques and the importance of security/li> Anti-fragility, our approach to failure and failure testing/li> The organizational impact that introducing SRE brings

There are no required prerequisites for the DevOps Institute’s Site Reliability Engineering (SRE) Foundation certification, but the following are recommended: Understanding of common DevOps terminology and concepts, Related work experience, IT experience, and Working knowledge in the DevOps field.

Use Azure Resource Manager

  • Understand Azure Resource Manager benefits
  • Understand Azure resource terminology
  • Set up resource groups
  • Set up Azure Resource Manager locks
  • Reorganize Azure resources
  • Remove resources and resource groups
  • Identify resource limits

Get Started with Azure Cloud Shell, Bash, and PowerShell

  • Introduction to Azure Cloud Shell
  • Understand how Azure Cloud Shell works
  • When should you use Azure Cloud Shell?
  • Introduction to Bash
  • Bash fundamentals
  • Bash commands and operators
  • Introduction to PowerShell
  • Locate commands

Deploy Resources with Azure Templates

  • Understand Azure Resource Manager template advantages
  • Examine the Azure Resource Manager template schema
  • Examine the Azure Resource Manager template parameters
  • Consider Bicep templates
  • Understand QuickStart templates

Manage Identity with Microsoft Entra ID

  • Examine Microsoft Entra ID
  • Distinguish between Microsoft Entra ID and Active Directory Domain Services
  • Examine Microsoft Entra ID as a directory service for cloud apps
  • Distinguish between Microsoft Entra ID P1 and P2 plans
  • Examine Microsoft Entra Domain Services

Manage Users and Groups

  • Set up user accounts
  • Administer user accounts
  • Set up bulk user accounts
  • Set up group accounts
  • Set up administrative units

Manage Subscriptions and Cost Controls

  • Identify Azure regions
  • Deploy Azure subscriptions
  • Get started with an Azure subscription
  • Identify Azure subscription usage
  • Deploy Microsoft Cost Management
  • Apply and manage resource tagging
  • Apply and manage cost savings

Enforce Governance with Azure Policy

  • Set up management groups
  • Deploy Azure policies
  • Set up Azure policies
  • Set up policy definitions
  • Set up an initiative definition
  • Scope the initiative definition
  • Identify compliance

Control Access with Azure RBAC

  • Deploy role-based access control
  • Set up a role definition
  • Set up a role assignment
  • Distinguish between Azure roles and Microsoft Entra roles
  • Apply and manage role-based access control
  • Understand fundamental Azure RBAC roles

Manage Identities in Microsoft Entra ID

  • What are user accounts in Microsoft Entra ID?
  • Administer app and resource access using Microsoft Entra groups
  • Collaborate using guest accounts and Microsoft Entra B2B

Enable Password Self-Service

  • Introduction to self-service password reset in Microsoft Entra ID
  • Deploy Microsoft Entra self-service password reset

Design and Configure Virtual Networks

  • Plan virtual networks
  • Set up subnets
  • Set up virtual networks
  • Plan IP addressing
  • Set up public IP addressing
  • Associate public IP addresses
  • Allocate or assign private IP addresses

Secure Network Traffic with NSGs and Peering

  • Deploy network security groups
  • Identify network security group rules
  • Identify network security group effective rules
  • Set up network security group rules
  • Deploy application security groups

Integrate Networks with Peering and Routing

  • Identify Azure Virtual Network peering use cases
  • Identify gateway transit and connectivity options
  • Set up virtual network peering
  • Extend peering with user-defined routes and service chaining
  • Understand system routes
  • Identify user-defined routes
  • Identify service endpoint uses and supported services
  • Identify private link use cases

Distribute and Balance Traffic

  • Identify Azure Load Balancer use cases
  • Deploy a public load balancer
  • Deploy an internal load balancer
  • Identify load balancer SKUs
  • Create backend pools, health probes, and load balancing rules
  • Implement Azure Application Gateway
  • Determine Application Gateway routing
  • Configure Application Gateway components

Host and Route Domains

  • What is Azure DNS?
  • Configure Azure DNS to host your domain
  • Dynamically resolve resource names with alias records

Manage Storage Accounts and Services

  • Implement Azure Storage
  • Explore storage services and account types
  • Choose replication strategies
  • Secure storage endpoints and access

Use Azure Blob Storage

  • Implement Azure Blob Storage
  • Create blob containers
  • Assign blob access tiers
  • Add blob lifecycle management rules
  • Understand blob object replication
  • Upload blobs
  • Review Blob Storage pricing

Secure and Access Azure Storage

  • Review Azure Storage security strategies
  • Create shared access signatures (SAS)
  • Identify URI and SAS parameters
  • Understand Azure Storage encryption
  • Create customer-managed keys
  • Apply Azure Storage security best practices

Work with File Storage and Sync

  • Compare storage options for file shares and blobs
  • Manage Azure file shares and snapshots
  • Enable soft delete
  • Use Azure Storage Explorer
  • Deploy Azure File Sync

Provision Storage and Manage Data Access

  • Decide how many storage accounts you need
  • Choose account settings and creation tools
  • Use SAS to delegate storage access
  • Use stored access policies

Upload and Manage Data with Azure Storage Explorer

  • Connect Storage Explorer to a storage account
  • Connect to Azure Data Lake Storage

Deploy and Manage Virtual Machines

  • Understand cloud service responsibilities
  • Plan VMs (size, storage, networking)
  • Create and connect to VMs in the Azure portal

Ensure VM Availability

  • Plan for maintenance and downtime
  • Create availability sets and zones
  • Understand update and fault domains
  • Compare vertical and horizontal scaling
  • Implement and autoscale VM scale sets

Deploy and Manage Azure App Services

  • Implement and scale App Service plans
  • Understand pricing and autoscale features
  • Create and manage App Services
  • Deploy apps, configure slots, and enable CI/CD
  • Secure apps and use custom domains
  • Back up and restore apps
  • Use Application Insights for monitoring

Work with Azure Containers

  • Compare containers to VMs
  • Review and deploy Azure Container Instances
  • Implement container groups
  • Explore Azure Container Apps

Manage Infrastructure with Azure CLI

  • What is the Azure CLI?

Create and Configure Windows VMs

  • Create a Windows VM in Azure
  • Use RDP to connect
  • Configure network settings

Host Web Apps in Azure App Service

  • Create a web app in the portal
  • Prepare and deploy code to App Service

Back Up and Restore with Azure

  • What is Azure Backup?
  • How Azure Backup works
  • When to use Azure Backup
  • Protect VM data with snapshots
  • Set up Recovery Services vault
  • Back up and restore VMs
  • Enable soft delete

Implement Azure Site Recovery

Monitor and Analyze with Azure Monitor

  • Describe key capabilities and components
  • Define metrics, logs, and monitoring tiers
  • Understand activity log events and how to query them

Analyze Logs with Log Analytics

  • Determine use cases for Log Analytics
  • Create a workspace
  • Write and structure KQL queries

Use Network Watcher

  • Explore Network Watcher features
  • Review IP flow and next-hop diagnostics
  • Visualize network topology

Respond to Incidents with Azure Monitor Alerts

  • Understand metric, log, and activity log alerts
  • Use action groups and alert rules

Query and Monitor Azure Infrastructure

  • Analyze infrastructure with Monitor logs
  • Query logs to extract key information
  • Monitor VM metrics and event logs with VM Insights
Class Dates & Times
Filters Sort results
Reset Apply
02/11/2026 - 02/12/2026
Virtual
09:00:00 to 17:00:00 EST
Enroll Now
$1,495.00
06/09/2026 - 06/10/2026
Virtual
09:00:00 to 17:00:00 EST
Enroll Now
$1,495.00
— Questions?

Information Request

— Empower Change

Invest in Skills & Equality

Support Diversity, Equity, and Inclusion with Every Purchase.

Great Horizons is a North Carolina Certified HUB Vendor and WOSB. By becoming a patron of our organization, you are not only supporting a historically underutilized business, but a woman-owned small business as well.