Discover more about this course below.
Provider: GEL
Accredited by: PeopleCert
Exam vouchers included
Course duration: 30+ hours
Access Period: 12 months
Tutor support
Quizzes and exam practice
Works on mobile devices
We comprehensively address the DevOps Institute's SRE℠ curriculum, educating individuals on all aspects of site reliability engineering and its role in enhancing businesses' ability to deliver and expand top-tier services. Beginning with an overview of SRE principles and methodologies, the program delves into their implementation and the complete enhancement of your workflow.
SRE Foundation (SREF) Module 1: SRE Principles & Practices
Learning Objectives
This module provides an introduction to site reliability engineering (SRE) as a field, highlighting its distinctions from DevOps. Delve into the core principles and methodologies of SRE in this comprehensive overview.
SRE Foundation (SREF) Module 2: Service Level Objectives & Error Budgets
Learning Objectives
This module explores service level objectives (SLOs), service levels, error budgets, and policies governing error budgets.
SRE Foundation (SREF) Module 3: Reducing Toil
Learning Objectives
This module introduces the concept of 'toil', discusses its implications as a challenge, and explores effective strategies for its management.
SRE Foundation (SREF) Module 4: Monitoring & Service Level Indicators
Learning Objectives
This module centers on service level indicators (SLIs), emphasising observability and monitoring practices.
SRE Foundation (SREF) Module 5: SRE Tools & Automation
Learning Objectives
This module examines the concept of 'automation' as defined by both SRE and DevOps. It delves into various categories of automation and their organisational structure, in addition to highlighting popular automation tools.
SRE Foundation (SREF) Module 6: Anti-Fragility & Learning from Failure
Learning Objectives
This module explores the SRE principle of deriving insights from failures and its correlation with anti-fragility and chaos engineering practices.
SRE Foundation (SREF) Module 7: Organisational Impact of SRE
Learning Objectives
This module investigates the organisational management of SRE. It discusses the initial implementation of SRE, the reasons behind the widespread adoption of SRE by businesses, strategies for integrating SRE, effective incident response practices, and the importance of blameless post-mortems. Additionally, it explores the scalability of SRE implementation.
SRE Foundation (SREF) Module 8: SRE, Other Frameworks, Trends
Learning Objectives
This module delves into the integration of SRE with prominent frameworks such as IT4IT, Agile, and ITIL 4. It also explores the evolution of SRE and its future trajectory.
SRE Foundation (SREF) Practice Exams
Learning Objectives
This module includes two mock exams designed to familiarize candidates with the environment of the Site Reliability Engineering (SRE) Foundation exam.
An Introduction to SRE Practitioner (SREP)
Learning Objectives
This module presents students with an overview of the course, highlighting its goals, objectives, study schedule, and layout. Participants will be guided through the course outline and offered supplementary resources such as a glossary, additional reading materials, diagrams, and links to access crucial SRE publications. Common queries about SRE Practitioner are addressed, followed by a quick assessment to evaluate retention of the SRE Foundation syllabus content.
SRE Practitioner (SREP) Module 1: SRE Antipatterns
Learning Objectives
This module delves into SRE anti-patterns and explores how these counterproductive behaviours can have adverse effects on a pipeline.
SRE Practitioner (SREP) Module 2: Service Levels and Error Budgets
Learning Objectives
This module explores system boundaries and illustrates the process of defining system capabilities, as well as establishing suitable service level indicators (SLIs) and service level objectives (SLOs). Additionally, it covers measuring the baseline and delves into multi-service architecture, including the calculation and utilisation of error budgets.
SRE Practitioner (SREP) Module 3: Building Secure and Reliable Systems
Learning Objectives
This module outlines the responsibilities of a site reliability engineer in system design, emphasising key factors related to evolving landscapes and security needs. It further explores modern methodologies, technologies, and resources for system design, including design patterns that empower SRE professionals to construct secure, robust, dependable, and scalable systems.
SRE Practitioner (SREP) Module 4: Full-stack Observability
Learning Objectives
This module centers on the essential components of comprehensive stack observability and the role of instrumentation in enhancing the observability of SRE systems.
SRE Practitioner (SREP) Module 5: Review: Modules 1-4
Learning Objectives
This interactive module is crafted to assist learners in assessing their understanding of the concepts and terminology discussed in modules one to four. It includes a memory challenge and a concept validation tool.
SRE Practitioner (SREP) Module 6: Platform SRE and AIOps
Learning Objectives
This module explores the advantages of adopting a platform-centric approach in the development and management of platforms as products. It further delves into the utilisation of artificial intelligence for enhancing IT operations and the strategies for AI implementation.
SRE Practitioner (SREP) Module 7: SRE and Incident Management
Learning Objectives
This module explores the essential components of incident management within the incident command framework. It also discusses the application of the Observe, Orient, Decide, Act (OODA) loop in integrating technology, procedures, and assets for effective incident responses.
SRE Practitioner (SREP) Module 8: Chaos Engineering
Learning Objectives
This module explores the concept of 'chaos engineering', which involves conducting experiments on a distributed system to enhance trust in its resilience and adaptability during challenging circumstances. It also provides insights on organising game day drills to practice chaos engineering and debunks prevalent misconceptions surrounding the topic.
SRE Practitioner (SREP) Module 9: Implementing SRE Practices
Learning Objectives
This module delves into the significance of Site Reliability Engineering (SRE) in enhancing operational efficiency and embracing DevOps principles to the fullest. It further explores the strategies and frameworks employed to deploy and operationalise SRE practices.
SRE Practitioner (SREP) Module 10: Review: Modules 6-9
Learning Objectives
This module serves as a reflective tool to assist students in reinforcing their comprehension of the concepts and terminology discussed in modules six to nine. It includes a memory game and concept assessment tool.
SRE Practitioner (SREP) Practice Exams
Learning Objectives
This module contains two practice exams designed to acquaint candidates with the requirements of the Site Reliability Engineering (SRE) Practitioner examination.
This SRE training suite is tailored to fully equip students for the official SRE Foundation and SRE Practitioner exams. It includes offering authorised practice exams to enable students to assess themselves and acclimate to exam conditions.
The course provides simulated exams to aid students in readiness for the actual assessment, along with complimentary exam vouchers. (Terms and conditions apply)
Before scheduling your exam, it is advisable to ensure that your device meets the technical prerequisites. For further details and guidance, please refer to the PeopleCert website.
Requests for exam vouchers are typically processed within 2 working days, but please allow up to 5 days. Students are required to request their exam voucher within the course access period, which commences from the date of purchase. For additional information, please visit the GEL Support & FAQs page.
SRE Foundation (SREF) exam
• This exam comprises 40 multiple-choice questions
• Candidates have 60 minutes to finish the exam
• It is an open-book exam, allowing the use of provided materials only
• To pass, candidates need to achieve a minimum score of 65%: at least 26 out of 40 questions must be answered correctly
• The exam can be taken either online or in person under invigilation
SRE Practitioner (SREP) exam
• This exam comprises 40 multiple-choice questions
• Candidates have 90 minutes to complete the exam
• It is an open-book exam, allowing the use of provided materials only
• To pass, candidates need to achieve a minimum score of 65%: at least 26 out of 40 questions must be answered correctly
• The exam can be taken either online or in person with supervision
What is SRE?
'Site Reliability Engineering (SRE)' involves the ongoing evaluation of a new product's 'reliability' during development. This practice empowers developers to gain insights and cater to the requirements of operations teams effectively.
How does SRE work?
The components of SRE include:
• Defining a 'Service Level Agreement (SLA)' to determine the required reliability for end-users
• Setting up an 'Error Budget' to allocate resources for error resolution before halting production
• Collaboration between site reliability engineers and development teams to manage workloads effectively
• Proactive identification and resolution of issues by site reliability engineers during development
• Developers stepping in for Operations tasks when needed
• Implementation of automation by site reliability engineers to enhance efficiency and reliability
What is a site reliability engineer?
A 'site reliability engineer' is a specialist in automation and coding tasked with identifying and resolving issues across Development and Operations.
How can SRE benefit businesses?
An SRE team enhances not just the reliability but also the efficiency and scalability of a DevOps pipeline. By leveraging SRE practices, Development and Operations teams can redirect their focus to enhancing services in other areas, elevating the standard of releases. The integration of SRE fosters improved communication, transparency, and collaboration within existing DevOps cultures.
Moreover, site reliability engineers excel in addressing and articulating organisational concerns, extracting valuable metrics that can benefit other departments significantly.
Does SRE complement DevOps?
DevOps and SRE complement each other seamlessly. Their synergy stems from a shared focus on automation, cross-team cooperation, and effective communication, enhancing efficiency and reliability in IT workflows. Notably, the SRE Practitioner certification originates from the DevOps Institute, underscoring their interconnectedness.
Do I need to study site reliability engineering?
This course does not have any mandatory requirements for enrolment. Nonetheless, having prior familiarity with SRE and DevOps concepts can be advantageous for a better understanding of the course material.
Why is SRE necessary?
Google pioneered the concept of SRE. Its primary objective is to formalise the collaboration between Development and Operations teams, guaranteeing the creation of code with efficiency, reliability, and operational considerations. This approach is especially beneficial in enterprises where IT departments and teams have become isolated from each other.
Who can benefit from studying SRE?
SRE is well-suited for companies that depend on code development and deployment. It thrives in DevOps settings and is favoured by DevOps professionals and leaders. With the increasing demand for SRE, individuals with expertise in this area will likely encounter smoother career progression opportunities.
The Site Reliability Engineering (SRE)℠ Foundation & Practitioner course is provided by GEL, an ATO of PeopleCert.
SRE℠ is a registered trademark of PeopleCert. Used under licence from PeopleCert. All rights reserved.
We strive to offer a great range of affordable and accessible high-quality courses
We're a family run business and customer service is important to us
We provide accredited courses to further your career and to set you up for success