·Business & Strategy
Section 1
The Core Idea
In 1968, Sackman, Erikson, and Grant published a study in Communications of the ACM measuring programmer performance on identical tasks. The best performers finished 10 times faster than the worst. Their code ran up to 11 times more efficiently. The sample was small, the methodology debated for decades. But the directional finding — that productivity among knowledge workers varies by an order of magnitude, not a modest percentage — has been replicated so consistently that it now operates as an architectural principle for how the best technology companies are built.
The 10x label is frequently reduced to "some people code faster." That misses the mechanism entirely. A 10x engineer's advantage is not typing speed. It is the quality of the decisions that precede the typing. Selecting the right abstraction on day one eliminates entire categories of bugs that a less capable engineer would spend months fixing. Designing a system that accommodates future requirements without rework saves not one person's time but the accumulated time of every engineer who touches that codebase for years afterward. The multiplier operates on decisions, and decisions compound. A well-chosen architecture is a gift that keeps paying; a poorly chosen one is a tax that never stops collecting.
Steve Jobs articulated this as an organizational philosophy in his 1995 interview with Robert Cringely: "In software, the difference between the average and the best is fifty to one. Maybe a hundred to one." Jobs was not engaging in motivational hyperbole. He was describing a measurement that changed how he hired, fired, and structured every team at Apple. His conclusion was operational: a small team of A-plus players will run circles around a giant team of B and C players — not because they work harder, but because the quality of their judgment, their taste in solutions, and their intolerance for mediocre approaches produces output that scales non-linearly with headcount.
The evidence arrives in increasingly compressed form. When Facebook acquired Instagram in April 2012 for $1 billion, the company had 13 employees serving 30 million users. WhatsApp operated with 55 engineers supporting 450 million users when Facebook acquired it for $19 billion in February 2014. Midjourney — a company generating hundreds of millions in annual revenue — operated with roughly 40 employees as of mid-2024. In each case, the output of a small group of exceptional individuals exceeded what organizations 50 to 200 times their size produced in the same domain. The variable was not budget or hours logged. It was the concentration of capability per seat.
Netflix operationalized this through what
Reed Hastings calls talent density. The company's culture memo — downloaded over 20 million times — states the principle bluntly: "One outstanding employee gets more done and costs less than two adequate employees." Netflix pays at or above the top of market for every role and terminates adequate performers to maintain the average. The logic is uncomfortable and arithmetically sound: coordination costs scale quadratically with headcount (n × (n-1) / 2 communication channels), while individual output scales linearly at best. A team of 5 exceptional engineers has 10 communication channels. A team of 20 average engineers has 190. The second team ships slower despite quadrupling the payroll.
Stripe's "increase the average" hiring philosophy follows the same logic from a different angle. Patrick Collison has said that every hire should raise the average quality of the team they join — meaning the bar rises with every addition. The practical implication: saying no to a good candidate because they don't raise the average is the hardest and most valuable discipline in scaling. Stripe built a payments infrastructure serving millions of businesses with an engineering team a fraction of the size competitors deployed. The product's elegance — developers could integrate Stripe in seven lines of code — was not the output of a large team working adequately. It was the output of a small team working at the frontier of what was possible.
The gap compounds because of how knowledge work differs from physical labor. A warehouse picker who is twice as productive as a colleague moves twice the boxes. A software architect who selects the right database schema on day one does not merely save engineering time — they eliminate data migration projects, reduce query complexity for every subsequent feature, and create a foundation that supports products the original designer never imagined. The 10x engineer's output is not 10 widgets instead of 1. It is a system that generates value for years versus a system that requires constant repair. The multiplier acts on the decision, and the decision's consequences compound through every person and process downstream.
The model extends beyond software. In venture capital, a small number of partners generate the overwhelming majority of returns. In scientific research, Lotka's Law — first observed in 1926 — found that the number of authors making n contributions is roughly 1/n² of those making one. In sales, the top performer consistently closes three to five times the revenue of the median rep, not because they make more calls but because they qualify better, sequence better, and close with higher conversion. The distribution of output in knowledge-intensive work is not bell-curved. It is power-law shaped. Organizations that staff by headcount rather than by capability density are systematically underperforming against competitors who understand the shape of the curve.