The scenarioAmazon's fulfilment network processes billions of items annually, and even a fractional defect rate translates into millions of mis-shipped, damaged, or delayed orders. By the mid-2010s, Amazon's operations teams had built what they internally called "Correction of Errors" (COE) processes — structured post-incident analyses that bear strong resemblance to Pareto-driven root cause work. The challenge wasn't identifying that errors existed; it was deciding which errors to fix first across a network of over 100 fulfilment centres, each with its own local defect profile.
How the tool appliedEach fulfilment centre ran weekly Pareto analyses on its defect data, categorising errors by type (wrong item, damaged in transit, late shipment, missing item, incorrect quantity) and sorting by customer impact — measured not in defect count but in "concessions" (refunds, replacements, and credits issued). This metric choice was deliberate. A wrong-item error that the customer keeps and receives a replacement for costs Amazon roughly twice what a late-shipment error costs. Sorting by concession dollars rather than defect count consistently surfaced different priorities than the operations teams expected. At one facility, "incorrect quantity" errors were the most frequent defect type but only the fourth-most-costly. "Wrong item" errors were less frequent but far more expensive per incident because they triggered both a replacement shipment and a return logistics chain.
What it surfacedThe network-level aggregation revealed a pattern invisible at the individual facility level: across all centres, a single defect type — items stored in the wrong bin location ("stow errors") — was the upstream cause of three of the top five defect categories. Wrong-item picks, incorrect quantities, and a subset of late shipments all traced back to stow errors that propagated through the pick-pack-ship chain. The Pareto chart at the defect level showed five problems. The Pareto chart at the root-cause level showed one.
The non-obvious factorAmazon's distinctive contribution wasn't the Pareto analysis itself — it was the iteration speed. Centres re-ran the analysis weekly, not quarterly. Each week's chart reflected the previous week's interventions. When a centre fixed its stow-error process and the wrong-item defect rate dropped, the Pareto chart immediately reshuffled, surfacing the next-highest-impact category for attention. This created a ratchet effect: each week's vital few got smaller, and the cumulative improvement compounded. Colin Bryar and Bill Carr describe this rhythm in — the relentless, data-driven cycle of measure, rank, fix, re-measure. The tool is simple. The discipline of running it every week, acting on it every week, and never treating any single analysis as the final word — that's what made it work at Amazon's scale.