What Makes Difficult Incidents So Difficult?

Hamed Silatani

April 17, 2025

Taggs:

IN THIS ARTICLE

Incident Timeline

Ready to make incident response your competitive advantage?

See how Uptime Labs builds provable, scalable incident response capability across your organisation.

Book a demo

Explore the platform

I’m using the term “difficult incident” carefully here — because, sure, that begs the question: what makes an incident easy or hard?

For this post, lets agree that a difficult one usually involves multiple teams, unclear ownership, and no obvious explanation.

When Conversations Stall

If you’ve been on support duty, you’ve probably seen incident chats that look something like this:

Engineer 1: “Anyone from Networks online? Can’t connect ClientData service to its database.”

Network engineer: “I need source and destination IP addresses and port numbers.”

Engineer 1: “We only have service and dependency names in the config. I don’t know the IPs.”

Now we’re stuck waiting — the conversation drags, and precious time is lost.

When Conversations Flow

Compare that to this version:

Engineer 1: “Anyone from Networks online? ClientData service on HOSTXXX can’t connect to its DB at 192.168.x.x:1521. Also seeing SYN_SENT in netstat.”

Network engineer: “Thanks. SYN_SENT usually points to a firewall issue.”

This second exchange moves quickly and gets closer to resolution faster.

It’s All About Mental Models

The difference? The application engineer understands a bit about networks, and the network engineer understands enough about applications to engage meaningfully. Their mental models overlap — and that overlap is key.

But as we introduce more abstraction to make it easier to ship and run code, this shared understanding becomes rarer. You don’t have to know how something works… until you’re knee-deep in an incident.

What Happens When We Have Different Understandings of The Same Thing?

There’s actually a term for this: "Fundamental common ground breakdown."

It’s what happens when people working the same incident make conflicting assumptions about what’s been said — and it often leads to delays or full-blown disasters.

A Personal Example

I was new in a job at a financial trading firm when I spotted high memory usage and rising connections on a load balancer. I suggested a rolling restart of instances to avoid trouble. Platform engineer agreed, and we kicked it off.

Except... it led to a full outage.

Turns out, the load balancer took a while to mark instances as “up.” I assumed “rolling restart” meant a fast, safe handoff between instances. They didn’t clarify. Boom. Outage.

That’s a mental model mismatch — and it cost us.

How to Build Overlap on Purpose

Expanding your knowledge beyond your immediate area — even just a little — can prevent incidents or help resolve them faster. Think application developers understanding databases. Database engineers understanding network and operating systems.

We can leave that overlap to chance… or we can intentionally build it.

Rotating engineers across teams or running incident drills are great ways to build those shared models. While we can't ensure identical mental models, we can foster overlap among team members' mental models.

Hamed Silatani

Hamed is the co-founder and CEO of Uptime Labs. He has 20 years of experience in engineering leadership, reliability engineering and IT operations. Having spent the majority of his career at the sharp end of incident response in financial services, he's looking to help all companies master the unexpected.

What Makes Difficult Incidents So Difficult?

Ready to make incident response your competitive advantage?

When Conversations Stall

When Conversations Flow

It’s All About Mental Models

What Happens When We Have Different Understandings of The Same Thing?

A Personal Example

How to Build Overlap on Purpose

Hamed Silatani

Measure the Learning, Not Just the Downtime

Why Teamwork Makes (Or Breaks) Your Incident Response

Unpacking the ‘People Side’ of Incident Response

Ready to make incident response your competitive advantage?

What Makes Difficult Incidents So Difficult?

Ready to make incident response your competitive advantage?

When Conversations Stall

When Conversations Flow

It’s All About Mental Models

What Happens When We Have Different Understandings of The Same Thing?

A Personal Example

How to Build Overlap on Purpose

Hamed Silatani

Related content

Measure the Learning, Not Just the Downtime

Why Teamwork Makes (Or Breaks) Your Incident Response

Unpacking the ‘People Side’ of Incident Response

Ready to make incident response your competitive advantage?