Blog

Blog

Ebook

6.29.2022

Development Velocity (And How To Balance Reliability)

Development velocity is a measurement of how much work a software team can complete, based on similar work completed in previous iterations

Blog

Ebook

6.9.2022

Incident vs. Problem [Understanding the Differences]

Curious about incidents vs. problems? We explain the differences and how to handle each one.

Blog

Ebook

6.7.2022

Incident Priority Matrix (Understanding Impact and Urgency)

An incident priority matrix that helps set priority levels for your incidents based on four levels of impact. Here's how to determine an incident's urgency.

Blog

Ebook

6.2.2022

Software Engineers vs Site Reliability Engineering Explained

We discuss what software engineers and site reliability engineering are and explain their differences and their importance in the software development process.

Blog

Ebook

5.31.2022

DevOps Team Structure | Roles & Responsibilities

We explain how a DevOps team is structured, the roles and responsibilities within the team, and the balance between an individual contributor and the needs of the team.

Blog

Ebook

5.26.2022

What Is DevOps Automation & What Are The Benefits?

Looking into DevOps automation? We explain how automation can improve your process, how to prioritize which tasks to automate, best practices, and how to avoid common mistakes.

Blog

Ebook

5.10.2022

DevOps Pipeline | Best Practices, Tips, & Techniques

Looking into DevOps pipelines? We explain what a DevOps pipeline is, how to build one, and the best practices for building one for your team.

Blog

Ebook

5.4.2022

The Reverse Red Herring

Our VP of Engineering relates a story where a seemingly innocuous clue turns out to be key - a reverse red herring!

Blog

Ebook

5.3.2022

CI/CD Pipeline | What It Is & How It Works

Wondering about CI/CD pipelines? We explain what the CI/CD pipeline is, the steps involved, and best practices along the way.

Blog

Ebook

4.28.2022

Post-Incident Review | Why It’s Important & How It’s Done

A post-incident review is an evaluation of the incident response process. The goal is to have clear actions to improve the process and prevent further incidents.

Blog

9.28.2022

On-Call Schedules - Best Practices in 2022 (With Examples)

On-call rotation scheduling can feel like a jigsaw puzzle. Here are examples of today’s practices to simplify the task while preventing employee fatigue.

Blog

9.12.2022

Blameless Expands Microsoft Partnership to Deliver Faster, More Intuitive Incident Response Collaboration

The integration between Blameless and Microsoft Teams is significant for our customers, because it enhances their main line of communication during the most pressing moments of incident response. Directly from Microsoft Teams, an on-call engineer initiates an incident, notifies stakeholders, and orchestrates rapid response, all while automatically collecting each event or “touch” that adds value to the retrospective (postmortem) for learning.

Blog

8.31.2022

Software Metrics Every SRE Team Should Measure

Software metrics give important insight into the performance of your product, but which ones matter most to SRE teams? How do you decide which metrics to track?

Blog

8.24.2022

What is an SRE job description?

Whether you’re building an SRE team or looking for a job as an SRE, understanding the SRE job description is important. How would you define an SRE job?

Blog

8.17.2022

Chaos Engineering: What Is It & How Does It Work?

Distributed software systems have many points of failure. Can the process of chaos engineering help identify problems and gauge resiliency?

Blog

8.3.2022

Introducing Our Newest Integration with ServiceNow

Blameless released a new integration to ServiceNow’s incident management ticketing solution. If you are a DevOps team moving towards SRE, this is worth a look.

Blog

7.14.2022

Promoted to SRE Advocate: A Dream Turned Reality

Big news everyone! We’re excited to promote our own, Matt Davis, to SRE Advocate. Hear what Matt has to say about his journey into this role, and what it means to him.

Blog

7.12.2022

7 Ways Tagging Incidents Can Teach You About System Health

One of the most powerful ways to prepare for future incidents is to study and learn from patterns in past incidents. Learn how incident tagging can help.

Blog

7.6.2022

SRE Roles and Responsibilities Defined

SRE is a practice that creates a bridge between operations and development. We discuss the roles and responsibilities of a site reliability engineer.

Development Velocity (And How To Balance Reliability)

Incident vs. Problem [Understanding the Differences]

Incident Priority Matrix (Understanding Impact and Urgency)

Software Engineers vs Site Reliability Engineering Explained

DevOps Team Structure | Roles & Responsibilities

What Is DevOps Automation & What Are The Benefits?