Term FF-005
Decision quality preservation
AI accelerates decisions. Most teams stop checking whether the decisions are still right.
Decision quality preservation is the practice of tracking whether technical decisions made with AI assistance hold up over time. Measured by decision rework rate, incident pattern shift, and senior engineer review time after AI adoption.
What it is
The problem with faster decisions
AI coding assistants produce code faster than humans. That is their primary value proposition and it is well documented. What is less documented is the effect of that speed on the quality of the underlying technical decisions the code represents.
When code is written slowly, architecture decisions and implementation choices are made deliberately. A developer who spends three days implementing a feature has three days to reconsider the approach. A developer who generates the same feature in three hours with an AI assistant may not pause to evaluate the implementation pattern with the same care.
The issue compounds over time. If a team adopts AI tools and their decision rework rate increases, the productivity gains from faster initial generation are partially or fully offset by the cost of reworking incorrect decisions. That offset does not appear in throughput metrics. It appears in rework, in incident root causes, and in the growing proportion of senior time spent reviewing and fixing rather than building.
Decision quality preservation is the metric that makes this offset visible. It does not argue against AI adoption. It argues for maintaining deliberate evaluation practices alongside AI-assisted generation. As an AI metric in the Foundations Framework, it is the complement to throughput quality coupling: where throughput quality coupling tracks the ratio at the delivery level, decision quality preservation tracks it at the architectural decision level.
How it is measured
Three signals that surface decision quality over time
Decision rework rate
The percentage of architecture and implementation decisions that are reversed or substantially revised within 90 days of being made. A rising rework rate after AI adoption suggests that decision velocity has outpaced decision quality. Measured by tracking Architecture Decision Records and change management logs.
Incident pattern shift
The change in the distribution of incident root causes after AI adoption. If design and logic errors increase as a proportion of total incidents while infrastructure incidents remain stable, decision quality has likely declined. Measured by classifying incidents by root cause category and tracking the distribution over time.
Senior engineer review time
The change in the proportion of senior engineer time spent on code and architecture review versus original creation after AI adoption. If senior engineers shift toward spending more time cleaning up AI-generated decisions, decision quality preservation is failing. Measured through time-tracking correlations and Git contribution analysis.
Why it matters
Productivity theater versus sustainable throughput
The risk of measuring AI productivity by velocity alone is significant. A team that ships twice as many features per week while doubling its rework rate is not twice as productive. It is generating debt at twice the speed.
This failure mode is particularly costly for Series A to Series C companies. At this stage, engineering teams are typically 20 to 80 people. Senior engineers are a scarce resource. When AI adoption shifts senior time from creation to review and rework cleanup, the compound effect on throughput is the opposite of what the adoption was supposed to produce. The team gets slower as the senior engineers become quality bottlenecks rather than force multipliers.
The Foundations Framework's Principle 03 — measurement must outpace the tools — applies directly here. If a team adopts AI tools without instrumenting decision quality, they are adopting a tool that will influence the pace and pattern of their decisions without any way to observe whether those decisions are getting better or worse.
Decision quality preservation is the metric that prevents organizations from declaring AI adoption a success while accumulating the technical debt that will surface in the next incident spike. It is the measurement that keeps productivity claims honest.
In practice
What declining decision quality looks like before the incident spike arrives
A 55-person engineering org adopts AI coding assistants in January. By March, their deployment frequency is up 45 percent. The engineering manager presents the number as evidence of productivity improvement. What the QBR slide does not show: the architecture team's sprint retrospectives now consistently include ADR revisions that were not planned. Three ADRs written in February have already been substantially revised. The two senior engineers on the data platform team are spending 60 percent of their time reviewing AI-generated code rather than the 30 percent they spent reviewing human-generated code six months earlier.
None of these signals are catastrophic on their own. Together, they indicate that decision quality preservation is failing: the team is making decisions faster, but a higher percentage of those decisions are wrong on first pass. The rework cost is being absorbed by the most expensive people on the team.
The incident spike arrives in June, when a schema migration decision made with AI assistance in April causes a two-hour production outage. The root cause traces back to a trade-off that was not evaluated because the implementation was generated quickly and reviewed without the depth that would have caught it. The rework cost of the outage is 10x the cost of the additional review time that would have prevented it.
How Clouditive uses it
The fourth AI signal in every Foundations engagement
Decision quality preservation is the fourth of four AI metrics Clouditive instruments on every Foundations engagement. It is the metric that is most commonly absent in organizations when Clouditive arrives. Teams instrument throughput, sometimes quality, rarely rework, and almost never the senior review time shift.
During the Foundations Assessment, the team interviews senior engineers specifically about whether their daily work distribution has shifted since AI adoption. The questions are direct: what percentage of your time is now spent reviewing AI-generated code versus writing original code? Has that changed in the last six months? Are you finding yourself reversing implementation decisions that a junior or mid-level engineer made with AI assistance?
The answers to those questions, combined with incident root cause data and ADR revision history, produce the decision quality preservation baseline that every subsequent engagement is measured against.
See all four AI signals
The AI metrics page covers all four signals: throughput quality coupling, cognitive offload, AI agent observability, and decision quality preservation.
Read the AI metrics frameworkCommon questions
Decision quality preservation: answers to specific questions
Q: How do you instrument decision rework rate if the team does not write ADRs?
Start by introducing lightweight ADRs at the point where AI adoption is being assessed. You do not need a retrospective library of every past decision. The baseline measurement covers the 90-day window before and after AI adoption. Track: how many significant implementation decisions were reversed or substantially revised in that window? Decisions that get reversed leave traces in pull request comments, sprint retrospective notes, and incident post-mortems. Those records are often available even without a formal ADR practice.
Q: Is the goal of decision quality preservation to slow AI adoption?
No. The goal is to make AI adoption sustainable. Teams that adopt AI tools without measuring decision quality often experience a productivity spike followed by a quality crisis. That cycle creates the counter-narrative that AI tools do not deliver on their promise. Decision quality preservation keeps the productivity gains by catching regressions early, before they compound into architectural debt that requires expensive rewrites.
Q: How does decision quality preservation interact with the AI mirror effect?
The AI mirror effect predicts the direction of AI adoption outcomes based on platform strength. Decision quality preservation is one of the mechanisms through which that outcome is produced. Strong platforms with mature review practices and deliberate architecture processes preserve decision quality when AI accelerates throughput. Weak platforms with informal review and implicit architecture conventions see decision quality degrade. Measuring DQP gives you a leading indicator of which direction your mirror is pointing, before the incident pattern shift makes it visible in retrospect.
Instrument decision quality on your platform
The Foundations Assessment establishes the decision quality baseline before it becomes a liability.
Four to six weeks. Maturity radar. DORA baseline. AI readiness score. 90 day roadmap.