The bottleneck is never the work. It's the waiting.
The takeaway
the bottleneck is never the work — it's the waiting
What’s in this article
Pick one task that always crawls through your business. Measure how long the work itself takes. Then measure how long it sat there, untouched, waiting for someone to act. The second number almost always dwarfs the first, and almost no one tracks it.
Forty minutes of work, six days of waiting
I once watched a single customer request take six days to close. I was curious, so I pulled it apart. The actual work inside it, the typing and the deciding, was maybe forty minutes. Everything else was waiting.
It landed in an inbox on a Monday and sat overnight. Tuesday it got read and forwarded to the one person who knew the answer. That person was in meetings, so it waited again. Wednesday it needed a quick approval, which meant waiting for someone who was traveling. By Friday the loop finally closed.
Nobody was lazy. Nobody dropped the ball. Every person in that chain would have told you they were busy and responsive, and they'd have been telling the truth. The request just kept aging in line, invisible, because waiting doesn't show up on anyone's task list. You can't see it in a calendar. It generates no notification. It is the negative space of your operation, and it is most of what your customer actually experiences.
Run the experiment yourself this week. Take the proposal, the invoice, the reply to a client you want to keep. Write down two numbers: time spent working, time spent waiting. Once you've seen the gap, you can't unsee it.
Why queues behave the way they do
There's a well-studied body of work on this, most of it borrowed from manufacturing and from how products get built. Don Reinertsen spent years documenting how queues drive delay in development work, and the core finding travels far beyond software.
The mechanism is simple and a little brutal. The closer any resource gets to full capacity, the longer things wait in front of it, and not in a straight line. It curves upward fast. A person who is busy half the time barely creates a queue. Push that same person to ninety percent loaded and the wait time in front of them explodes. This is queueing theory, and it's why a road at ninety percent capacity still moves while one at a hundred percent locks into a jam. The cars didn't get slower. The system did.
Most businesses run their best people at something close to a hundred percent. The reasoning feels responsible: idle time looks like waste, so you fill it. But a person with zero slack has no capacity to absorb the next thing that arrives. So it waits. And the work behind it waits. The very efficiency you optimized for is what manufactures the delay.
That's the uncomfortable part. The waiting isn't a sign of a broken team. It's the predictable output of a system loaded too high, doing exactly what loaded systems do.
Speeding people up leaves the delay untouched
Here's where most of us reach for the wrong tool. We feel the slowness, decide the problem is effort, and go to work on the forty minutes.
We push the team. We ask for more urgency. We praise the people who hustle and stay late. We buy a faster tool that shaves a few minutes off the task itself. And the six days barely move, because we optimized the smallest part of the timeline and left the largest part completely alone.
Think about what actually happened in that six-day request. If the person doing the forty minutes had somehow done it in twenty, the request still would have closed on Friday. The work was never the constraint. The handoffs were. Cutting the work in half saves you twenty minutes against a five-day pile of waiting.
This is why hiring often disappoints. A founder feels overwhelmed, adds a person, and the business doesn't get faster. The new hire creates another handoff, another inbox, another place for work to stop and wait for attention. You added capacity to the wrong layer. The line got longer, not shorter.
Effort is visible, so we manage it. Waiting is invisible, so we ignore it. That single bias explains most of why busy companies stay slow.
Find the line, then attack the wait
Start by mapping one important flow, end to end. Not what should happen. What actually happens to a request from the moment it arrives to the moment it's resolved. Mark every point where the work stops and waits for a human to pick it up. Those stopping points are your real targets.
Then go after the waiting directly, in roughly this order.
Kill the approvals that don't protect anything. Most approval steps exist because something went wrong once, years ago. Set a dollar threshold or a clear rule and let people act without asking. Every approval you remove deletes a whole queue.
Remove single points of knowledge. When one person is the only one who knows the answer, everything backs up behind their availability. Write the answer down. Now three people can move it instead of one.
Deliberately build in slack. Don't load your key people to the brim. A team running at seventy percent feels almost lazy on a quiet day and stays fast on a busy one. That spare capacity is what absorbs the surprise.
Make the queue visible. A shared board where you can see what's waiting and for how long changes behavior on its own. People act on the thing that's been sitting for three days once they can actually see it sitting.
But some waiting is real
A fair pushback: not every pause is waste. Some waiting is the system protecting itself.
You genuinely want a second set of eyes on a contract worth six figures. You want a beat before a refund that doesn't sit right. Batching ten small things into one focused block is often smarter than handling each the second it lands. Slack itself is intentional waiting, the buffer that keeps you fast. The goal was never zero delay. Zero delay usually means you've stripped out the judgment too.
The distinction is whether the wait is doing work. Waiting that catches a real mistake, or lets someone do their best thinking instead of reacting, earns its place. Waiting that exists because a request fell into a gap, because nobody owned it, because the one person was out, because an approval is pure ritual, that earns nothing. It's a tax your customer pays for your structure.
So don't chase a frictionless machine. Chase the difference between waiting that protects something and waiting that protects nothing. Defend the first kind on purpose. Hunt the second kind like it's costing you customers, because it is.
You're managing a flow, not a to-do list
Most founders manage their business as a pile of tasks. The deeper move is to manage it as a flow, a thing with currents and dams and places where the water pools.
This reframe changes what you pay attention to. You stop asking only "is everyone working hard" and start asking "where does work go to sit." You stop celebrating heroics, the person who pulled the late night to rescue the deal, and start asking why the deal needed rescuing at all. Heroics are usually a sign that the flow is broken and a human is paying for it with their evening.
It also reframes what to automate. The point of handing work to systems isn't to make people redundant. It's to delete the waiting. A request that used to sit overnight gets routed, drafted, and surfaced to the right person while it's still warm. The forty minutes might still belong to a human. The six days don't have to belong to anyone.
That's the work worth doing. Stop watching effort. Start watching the gaps between the effort. The gaps are where your business is actually slow, and they're the part almost nobody is looking at. We build that operating layer at marsa.ai/business.
Explore /business →
Frequently asked questions
What's the difference between work time and wait time?
Work time is the hands-on effort a task requires: writing the proposal, processing the invoice, drafting the reply. Wait time is everything else, the hours and days a task sits untouched between steps, waiting for someone to read it, approve it, or know the answer. In most businesses, wait time is several times larger than work time, but only work time shows up on task lists, so it's the only part anyone manages.
How do I actually measure the waiting in my business?
Pick one recurring task and trace a single real instance from arrival to resolution. Note the timestamp at every step: when it arrived, when it was first touched, when it moved to the next person, when each person acted. The gaps between a step finishing and the next one starting are your wait time. Do this for three or four instances and the pattern becomes obvious. You don't need software for this. A notepad and one honest trace will show you more than a dashboard.
Why does running my best people at full capacity make things slower?
Because queues grow non-linearly as any resource approaches full load. A person who is half-busy barely creates a backlog; the same person at ninety percent loaded creates a long one, and at a hundred percent the line keeps growing because there's never a free moment to clear it. It's the same reason a highway jams at full capacity even though no individual car slowed down. Someone with no slack can't absorb the next arrival, so it waits, and everything behind it waits too.
Won't hiring more people fix the bottleneck?
Often not, and sometimes it makes things worse. If the delay comes from handoffs and waiting, adding a person adds another handoff, another inbox, another place work can stop. You've added capacity to a layer that wasn't the constraint. Hiring helps when the work itself genuinely exceeds available hands. It doesn't help when the real problem is approvals, single points of knowledge, and work sitting in gaps. Map the flow before you add to it.
How is this different from just being more productive?
Productivity advice almost always targets the work: do it faster, focus better, batch your tasks. That's optimizing the forty minutes. This is about the days. You can double your team's personal productivity and a request that takes six days to close will still take roughly six days, because the constraint was never effort. It was the waiting between efforts. Different problem, different fix.
Is the goal to eliminate all waiting?
No, and trying to would hurt you. Some waiting protects the business: a second review on a large contract, a deliberate pause before an irreversible decision, batching small items into a focused block, and the intentional slack that keeps your team fast under pressure. That waiting is doing real work. The waiting to eliminate is the kind that protects nothing, where a task fell into a gap, nobody owned it, or an approval is pure ritual. Defend the useful pauses; hunt the useless ones.