SRE101Getting Started

What is SRE?

Site Reliability Engineering (SRE) is a practice for managing the reliability of systems. Google originally developed SRE in the early-2000s when Ben Treynor Sloss started the first SRE team, coined the name, and set the tone for the industry.

SREIncident Response101

What is On Call?

In the engineering world, being “on call” means you need to be available to be contacted if an incident or issue arises. This chosen engineer or group of engineers may be on call regardless if it’s during the workday or after regular business hours.