SITE RELIABILITY ENGINEERING FOUNDATION (NON-FUNDED)

SITE RELIABILITY ENGINEERING FOUNDATION (NON-FUNDED)

Course Duration

16.0 hr(s)

Who Should Attend

  • Individuals starting or leading a move towards increased reliability
  • Individuals interested in modern IT leadership and organizational change approaches
  • DevOps Practitioners, Scrum Masters, System Integrators and Software Engineers
  • Business Managers & Stakeholders
  • ​​​​​​IT and Business Management Developers

Course Overview

Site Reliability Engineering (SRE) is a discipline that incorporates aspects of software engineering and applies them to infrastructure and operations problems. The goal is to create ultra-scalable and highly reliable distributed software systems.
This course is an introduction to the principles & practices that enable an organization to reliably and economically scale critical services. Introducing a site-reliability dimension requires organizational re-alignment, a new focus on engineering & automation, and the adoption of a range of new working paradigms.

Course Schedule

Next available schedule

Course Objectives

Upon completing this course, participants will achieve an understanding of:

  • The history of SRE and its emergence at Google
  • The inter-relationship of SRE with DevOps and other popular frameworks
  • The underlying principles behind SRE
  • Service Level Objectives (SLO’s) and their user focus
  • Service Level Indicators (SLI’s) and the modern monitoring landscape
  • Error budgets and the associated error budget policies
  • Toil and its effect on an organization’s productivity
  • Some practical steps that can help to eliminate toil
  • Observability as something to indicate the health of a service
  • SRE tools, automation techniques and the importance of security
  • Anti-fragility, our approach to failure and failure testing
  • The organizational impact that introducing SRE bring

Pre-requisites

  • Knowledge, Skills & Experience

Prior knowledge of DevOps, which can be achieved by attending the DevOps Foundation course. It’s also recommended that participants have prior working experience or knowledge in IT software development or IT industry operations.

  • Hardware & Software
    • This course will be conducted as a Virtual Live Class (VLC) via Zoom platform.
    • Participants must own a Zoom account and have a laptop or a desktop with “Zoom Client for Meetings” installed. Download from zoom.us/download.

System Requirement

Must-have:

Please ensure that your computer or laptop meets the following requirements.

  • Operating system: Windows 10 or MacOS (64 bit or above)
  • Processor/CPU: 1.8 GHz, 2-core Intel Core i3 or higher
  • Minimum 20 GB hard disk space
  • Minimum 8 GB RAM
  • Webcam (camera must be turned on for the duration of the class)
  • Microphone
  • Internet Connection: Wired or Wireless broadband
  • Latest version of Zoom software to be installed on computer or laptop prior to the class.

Good-to-have:

  • Wired internet connection

Wired internet will provide you with stable and reliable connection.

  • Dual monitors

Using a dual monitor setup will undoubtedly improve your training experience, enabling you to simultaneously participate in hands-on exercises and maintain engagement with your instructor.

Not recommended:

Using tablets are not recommended due to their smaller screen size, which could cause eye strain and discomfort over the course of the program's duration.

Course Outline

Module 1: SRE Principles & Practices

  • What is Site Reliability Engineering?
  • SRE & DevOps: What is the Difference?
  • SRE Principles & Practices

Module 2: Service Level Objectives & Error Budgets

  • Service Level Objectives (SLO’s)
  • Error Budgets & Error Budget Policies

Module 3: Reducing Toil

  • What is Toil?
  • Why is Toil Bad?
  • Doing Something About Toil

Module 4: Monitoring & Service Level Indicators

  • Service Level Indicators (SLI’s)
  • Monitoring & Observability

Module 5: SRE Tools & Automation

  • Automation Focus
  • Hierarchy of Automation Types
  • Secure Automation
  • Automation Tools

Module 6: Anti-Fragility & Learning from Failure

  • Why Learn from Failure
  • Benefits of Anti-Fragility
  • Shifting the Organizational Balance

Module 7: Organizational Impact of SRE

  • Why Organizations Embrace SRE
  • Patterns for SRE Adoption
  • Sustainable Incident Response
  • Blameless Post-Mortems
  • SRE & Scale

Module 8: SRE, Other Frameworks, Trends

  • SRE & Other Frameworks
  • SRE Evolution
  • Additional Sources of Information

Certificate Obtained and Conferred by

  • Certificate of Completion from NTUC LearningHub

Upon meeting at least 75% attendance and passing the assessment(s), participants will receive a Certificate of Completion from NTUC LearningHub.

  • External Certification Exam

After completing this course and passing the official “DevOps Site Reliability Engineering Foundation” certification exam, candidates will receive a Certified Site Reliability Engineering Foundation certification from DevOps Institute. The certification is governed and maintained by DevOps Institute.

You will be awarded with the following digital badge(s) upon passing the official DevOps Site Reliability Engineering Foundation exam:

Site Reliability Engineering (SRE) Foundation℠ — DevOps Institute

Additional Details

Medium of Instruction: English
Trainer to trainee ratio: 1:20

Mode of Delivery:
Virtual Live Class (VLC) via Zoom
Courseware: Learner’s Resources on DevOps Institute Portal

Price

Course Fee

 

Individual Sponsored

Company Sponsored

Before GST

After GST

Before GST

After GST

Full Course Fee

$ 1,400.00

$ 1,526.00

$1,400.00

$ 1,526.00

Terms & Conditions apply. NTUC LearningHub reserves the right to make changes or improvements to any of the products described in this document without prior notice.

Prices are subject to other LHUB miscellaneous fees.

What Others Also Enrolled In