Implementing the Four Essential Metrics for Accelerating DevOps

Every efficient engineering teams aim to improve the way they build their software products and services. Through Accelerate, three researchers wanted to understand what works in a scientific way.

Dive into this article with us to discover how the Four Key DevOps Metrics can improve the productivity and well-being of your engineers.

Dr. Nicole Forsgren, Jez Humble, and Gene Kim published the results of their years of research developed upon 23 000 datasets from international companies. Startups, enterprises, for-profit, and not-for-profit are all part of this panel.

In Accelerate: The Science of Lean Software and DevOps: Building and Scaling High Performing Technology Organizations, you can learn what sets apart best-performing engineering teams from the rest of the software world. This article aims to give you a high-level view of the importance of the Four Key DevOps Metrics and how to track them.

Summary of the Four Key DevOps Metrics

Is your engineering team using worldwide best practices?

Accelerate metrics and definition	Best level	High level	Medium level	Low level
Deployment Frequency: Number of deployments in a given duration of time. How often does your organization deploy code to production or release it to end users?	On-demand (multiple deploys per day)	Between once per day and once per week	Between once per week and once per month	Between once per month and once every six months
Cycle Time or Change Lead Time: Time to implement, test, and deliver code for a feature (measured from the first commit to deployment). How long does it take to go from code committed to code successfully running in production?	Less than one day	Between one day and one week	Between one week and one month	Between one month and six months
Mean Time to Recovery: How long does it take to restore service when an incident that impacts users occurs?	Less than one hour	Less than one day	Less than one day	Between one week and one month
Change Failure Rate: What percentages of changes to production result in degraded service and subsequently require remediation?	0-15%	0-15%	16-45%	46-60%

Why are Accelerate metrics important for engineering leaders?

You should not think of these metrics as the ultimate objective of your engineering team but rather as good measures to follow over time to improve your processes. As underlined by Dr. Forsgren in her talk, The Key to High Performance: What the Data Says, rigorous implementation of these Key Metrics has shown that engineering teams in the best category are twice more likely to succeed with their commercial goals (profitability, market share, number of customers) and their non-commercials goals (quality of product or service and customer satisfaction). Dr. Forsgren concluded that teams performing well under these DevOps metrics reach a 50M higher market cap growth over three years.

Also, according to the Google State of DevOps 2018, there is a strong correlation between engineering well-being and successful implementation of these metrics. The report underlines that engineers from the Elite performer group "are 1.8 times more likely to recommend their team as a great place to work".

What are the four Accelerate metrics and their best practices?

The Four Key Metrics are as follows:

Deployment frequency

How often does your organization successfully deploy to production?

Deployment Frequency should be the easiest metric for you to collect. In opposition to the other metrics, it only needs one source of input. But be careful. We're talking here about frequency and not volume. You can't show daily deployment volume or grave an average number of deployments per week.

The only other setting to consider is what is a successful deployment for your organization. Do you include any kind of deployment? Deployments that will affect at least 10% of your user base? 50%? This is up to you to decide.

Best-performing teams are considered to "deploy daily,” meaning they successfully release in production almost every day. The four different kinds of behaviors underlined in Accelerate are as follows; which category described you best?

On-demand (multiple deploys per day),
Between once per day and once per week,
Between once per week and once per month,
Between once per month and once every six months.

Cycle Time or Change Lead Time

How long does it take for your organization to release a client request?

There are two schools of thought concerning Cycle Time, as one might be easier to track within your engineering team. You will require two pieces of data:

the date when a customer requested a new feature or when the first commit of your engineer concerning this feature happened,
the date when the deployment was successfully released.

Best-performing engineering teams can release in production a new feature on the same day the work starts. Which statement is the closest to your organization? We can successfully release a feature started today:

in less than one day,
between one day and one week,
between one week and one month,
between one month and six months.

Change Failure Rate

How many percent of your deployments are the source of a new failure in production?

To follow your Change Failure Rate, you need to find the deployment that created each new incident in production. If most of your deployments create new incidents, you should try to release more often and with smaller changes to your codebase.

Best-performing team deployments should have less than 15% to create new incidents in production. The four categories of Change Failure Rate are as follows. Which one is yours?

0-15%,
16-30%,
16-45%,
46-60%.

Mean Time to Recovery

How long does it take to restore service when an incident that impacts users occurs?

To compute your mean time to recovery, you have to save the following

the date when an incident was created in production (not spotted, so we're talking about the date of the deployment),
the date of the deployment that resolved the incident.

Best-performing teams are able to resolve the incident in less than an hour. Which category best describes your organization? We can resolve incidents:

in less than one hour,
in less than one day,
in less than one day,
between one week and one month.

Conclusion of the Accelerate metrics

As it is considered best practice for engineering teams today to follow these Four Key DevOps Metrics, we highly recommend you start following their evolutions in your organization.

Also published here.

Implementing the Four Essential Metrics for Accelerating DevOps

Too Long; Didn't Read

Summary of the Four Key DevOps Metrics

Why are Accelerate metrics important for engineering leaders?