Flow & Measurement

Your Team Isn't Slow. Your System Is.

1 Mar 2026·8 min

Your team looks productive. Every developer is allocated. Somehow nothing ships on time.

The instinct is to look for the problem in the people. Someone isn't trying hard enough. The estimates are off. The planning is weak. But if the team is clearly working — if the board shows movement, if the standups confirm activity, if every engineer is genuinely occupied — then the problem is somewhere else.

It is in the system.

The busy-team paradox

Full allocation looks like efficiency. Every hour accounted for. No obvious slack. Management can see that resources are being used. From the outside, this is exactly what a well-run team looks like.

What it actually means is that the system has no room to absorb the natural variation of software work. Estimates are wrong. Dependencies arrive late. Reviews take longer than expected. A bug surfaces in testing. In a system with slack, these variations are absorbed and forgotten. In a system running at 95% utilisation, every one of them creates a queue.

That queue is invisible on the utilisation report. It shows up in cycle time — and by then, the damage has already been done.

Why queues form at high utilisation

Think about a car park at 95% capacity. Every space is technically available somewhere, but finding one means circling, waiting, queuing at the entrance. The infrastructure hasn't changed. The experience for anyone trying to get through has transformed completely.

Software teams work the same way. When every person is fully loaded, every handoff creates a wait. A developer finishes a feature and moves it to code review. The reviewer is at capacity — the work sits for three days. After review, it needs testing. The tester is at capacity too. Another two days in a queue. The work is not stuck because people are slow. It is stuck because there is nowhere for it to go.

The problem is not effort. It is the absence of slack.

The curve nobody shows you

The relationship between utilisation and wait time is not linear. It is exponential at the high end, and the numbers are stark.

A task requiring one day of hands-on effort might spend four days in the system when the team is running at 80% utilisation. At 90%, that same task takes roughly eight days. At 95%, it approaches twenty days — from one day of actual work.

Moving from 80% to 90% doesn't add a little more wait time. It roughly doubles it. Moving from 90% to 95% doubles it again. The last five percent of utilisation is not the most productive. It is the most expensive.

Insight

The last 5% of utilisation is not the most productive. It is the most expensive — and the cost is invisible on every standard dashboard.

Queue length at the 95% utilisation level is 22× larger than at 70%. Every percentage point of “efficiency” near full capacity carries a disproportionate cost.

This is why teams can work harder and deliver slower at the same time. They are not failing at effort. They are failing at the queueing mathematics that nobody showed them.

The relationship between utilisation and cycle time is exponential, not linear. Moving from 90% to 95% roughly doubles wait time again.

Context switching compounds it

High utilisation rarely means one person working on one thing. It usually means three or four people each working on five or six things simultaneously.

A developer working on five items is not five times as productive as one working on a single item. They spend a significant portion of every day remembering where they left off, re-establishing context, resolving collisions between parallel tracks. The utilisation report says 95%. The amount of focused work actually happening is considerably lower.

High utilisation and high work in progress reinforce each other. The system fills up with items that are technically active but practically waiting, and the people nominally assigned to them are spending their cognitive budget on context switches rather than completion.

What this looks like in the data

When cycle time data is plotted against utilisation over time, the pattern emerges clearly. During periods of high allocation, average cycle time stretches — not because work got harder, but because queues formed. When WIP is managed down and utilisation drops to 70 or 75 percent, cycle time contracts. The team appears to have got faster. They did not get faster. The system got less congested.

The mistake is treating this as a paradox. It is not. It is the system behaving exactly as queueing theory predicts. The data is telling you where the problem is. The standard dashboard just isn't listening.

The management instinct that makes it worse

When delivery slows, the instinct is to add pressure or add work. Start more initiatives in parallel. Increase oversight. Ask for daily status updates.

Each of these responses raises the load on a system that is already congested. The utilisation number goes up. The cycle time goes up with it. The items in progress multiply. Each item waits longer. The team, already at capacity, takes on more context switches to service the new demand.

The leaders who make this call are not being reckless. The instinct that busier teams should deliver more is deeply held and entirely wrong. Nobody showed them the curve.

What to change

Slack is not waste. It is the buffer that lets a system absorb variability without grinding to a halt.

A team at 70% utilisation with focused, limited WIP will consistently outperform a team at 95% with twelve things in flight. Not because they're trying harder, but because work actually moves instead of queuing. Reviews happen within hours instead of days. Testing is not a batch event at the end of the sprint. Items finish cleanly rather than aging for weeks.

The counterintuitive move — the one that produces the result — is to remove work from the system rather than add it.

The constraint is not the team's effort. It is the system's capacity to move work from started to done without it getting stuck in the queues your utilisation report refuses to show.

This requires a policy change, not a culture change. Limit the number of items in progress. Finish before starting. Treat idle capacity as a feature rather than a failure.

If every developer on your team is fully allocated, find out which items have been waiting more than ten days for someone to look at them. That list is where the delivery problem actually lives.