Does AI improve software delivery performance?

Not automatically. The 2024 DORA report found rising AI adoption was associated with lower delivery throughput and stability, not higher. AI speeds up writing code, but delivery performance depends on how that code gets reviewed, batched, and shipped, which AI doesn't fix and can quietly make worse.

What did the 2024 DORA report find about AI?

With about 76% of respondents using AI for part of their work, DORA estimated that a 25% increase in AI adoption was associated with a 1.5% decrease in delivery throughput and a 7.2% decrease in delivery stability. More adoption tracked with worse delivery outcomes, the opposite of the usual pitch.

Why does AI adoption lower delivery stability?

Because it inflates batch size. DORA has shown for years that large change sets are riskier. AI makes producing more code nearly free, so pull requests get bigger and harder to review, and big batches fail more often in production. The problem is the size of the change, not the origin of the code.

How do you adopt AI without hurting delivery?

Keep the batches small and the gates real. Cap PR size, require review the AI can't talk its way past, and watch change-failure rate, not lines shipped. The teams that stay stable with AI are the ones that kept their delivery discipline instead of letting volume erase it.

Does AI improve software delivery? What DORA 2024 found

tsukumo

Does AI improve software delivery? What DORA 2024 found · tsukumo

Same AI, two delivery cultures

Under AI	Disciplined team	Rubber-stamp team
Batch size	capped, splits stay reviewable	balloons, one giant diff
Code review	a human holds the change in their head	skims a wall, approves on trust
What gets watched	change-failure rate, time-to-restore	lines and percent written by AI
Net effect on stability	holds or improves	drifts down, as DORA measured

AI makes it easy to ship more. DORA's data says that's the problem.

What the DORA 2024 data actually says#

Why faster code makes delivery worse#

This isn't an argument against AI#

What actually protects delivery#

What to do on Monday#

How we think about it#

How we run a 9-agent growth team on wrai.th (and what broke)

AI 'reasoning' has a cliff. Apple went and found the edge.

Your multi-agent system isn't failing on the model. Berkeley counted where.

Want this running on your team?