In the spring of 2016, Mondelez International offered $23 billion to buy The Hershey Company. The bid represented a roughly 10% premium to Hershey's already-elevated market capitalization and would have merged Oreo, Cadbury, and Chips Ahoy with Reese's, Kisses, and Twizzlers into a transatlantic confectionery colossus — the kind of deal investment bankers build entire careers hoping to close. Hershey's board rejected it. Not because the price was wrong, though they said it was, but because the Hershey
Trust Company — the entity
Milton Hershey created in 1905 to fund a school for orphaned boys, and which still controls approximately 80% of the voting power of the company's stock — would not let the company be sold. The Trust's fiduciary obligation ran not to shareholders seeking a liquidity event but to disadvantaged children seeking an education. The most consequential corporate governance mechanism in American confectionery is, and has been for over a century, an orphanage.
This is the paradox at the center of Hershey: a publicly traded company whose controlling shareholder is a philanthropic trust, whose products are sold in ninety countries but whose identity is indivisible from a single small town in central Pennsylvania, whose competitive moat was built in the early twentieth century on a formula for milk chocolate that European chocolatiers consider barely palatable — and whose stock has compounded at an annualized rate of roughly 11% over four decades. Hershey is not a startup story, not a disruption narrative, not a platform play. It is something rarer and, in certain respects, more instructive: a study in how brand permanence, structural governance, and relentless incrementalism can generate extraordinary long-term returns in a commodity-adjacent industry where the raw material — cocoa — is subject to wild price swings, consumer preferences shift with every wellness trend, and the competition includes two of the most formidable private companies on earth.
By the Numbers
Hershey at a Glance
$11.2BNet sales, FY2024
~20%Operating margin (historical average)
#1U.S. confectionery market position
~80%Voting power held by Hershey Trust
100+Brands in portfolio
$1.6BNet income, FY2022
130+Years of continuous operation
~19,000Employees worldwide
The Candy Man's Theorem
Milton Snavely Hershey was born on September 13, 1857, in Derry Township, Pennsylvania, to a Mennonite family fractured by his father Henry's serial entrepreneurial failures. Henry was a dreamer — charming, literate, perpetually chasing the next scheme — while Fanny Hershey was austere, practical, and deeply religious. The marriage dissolved under the weight of Henry's wandering. Milton, who never progressed past fourth grade, was apprenticed at age fourteen to a confectioner in Lancaster, Pennsylvania. He absorbed the trade the way certain people absorb language: totally, instinctively, without needing to understand the grammar.
What followed was a decade of failure. Hershey started a candy business in Philadelphia in 1876. It went bankrupt. He tried again in Denver, learning to make caramels with fresh milk — a technique that would prove decisive — then moved to New York. That venture failed too. He returned to Lancaster in 1886, humiliated and nearly broke, and started the Lancaster Caramel Company with borrowed money. This time it worked. The fresh-milk caramels were superior. By the early 1890s, the business was generating over $1 million in annual sales.
But Hershey had seen something at the 1893 World's Columbian Exposition in Chicago that rewired his ambition. A German manufacturer was demonstrating chocolate-making machinery. Caramels, Hershey reportedly told associates, were a fad. Chocolate was permanent. In 1900, he sold the Lancaster Caramel Company for $1 million — a staggering sum, equivalent to roughly $37 million today — and kept only the chocolate-manufacturing equipment. He was betting everything on a single product category, and he was forty-three years old.
Michael D'Antonio's biography
Hershey: Milton S. Hershey's Extraordinary Life of Wealth, Empire, and Utopian Dreams captures the almost reckless clarity of this pivot. Hershey didn't just want to make chocolate. He wanted to democratize it. In 1900, chocolate was a luxury good — handmade, expensive, consumed by the wealthy. Hershey's insight, which he shared with
Henry Ford's insight about automobiles, was that the real money was in making a luxury product affordable through industrial-scale production. He would make milk chocolate — richer, sweeter, more accessible than the dark European varieties — and sell it for a nickel.
Give them quality. That's the best kind of advertising in the world.
— Milton Hershey, as quoted in various biographical accounts
Building a Town to Build a [Brand](/mental-models/brand)
The factory Hershey constructed was not in a city. It was in the middle of Derry Township's dairy country — the same rural landscape where he'd been born. His logic was characteristically concrete: milk chocolate required enormous quantities of fresh milk. Pennsylvania dairy country provided it. He built not only a factory but an entire town: paved streets named Chocolate Avenue and Cocoa Avenue, worker housing, a trolley system, a department store, a bank, churches, a community center, an amusement park, a zoo. Hershey, Pennsylvania — "The Sweetest Place on Earth" — was a company town in the fullest sense, designed and financed by one man's conviction that the environment in which people worked determined the quality of what they produced.
This was not mere philanthropy, though it contained genuine idealism. It was a vertically integrated brand strategy executed before the concept had a name. The town was the marketing. Hershey famously refused to advertise nationally — a stance the company maintained for nearly seventy years, from its founding until 1970. While competitors like Mars spent aggressively on print and radio, Hershey relied on the wrapper itself (tossed on the ground, it was a free billboard), word of mouth, and the magnetic pull of the town, which drew tourists who left as brand evangelists. The Hershey bar's brown-and-silver wrapper became, through sheer ubiquity, as recognizable as the Coca-Cola script.
The no-advertising policy was not, strictly speaking, rational. It was ideological — rooted in Milton Hershey's Mennonite conviction that quality spoke for itself and in his shrewd understanding that the story of Hershey (the generous chocolatier, the utopian town, the school for orphans) was a more powerful brand narrative than any advertisement could construct. He was, without using the term, building what we'd now call an earned-media moat.
The Formula That Shouldn't Work
The milk chocolate Hershey developed in the early 1900s has a distinctive flavor that Europeans — and a significant number of American food critics — find puzzling. It has a slight tang, a faintly sour, almost cheesy note that results from the way Hershey's process handles milk. The prevailing theory, widely accepted though Hershey has never publicly confirmed the precise details, is that the milk undergoes a controlled lipolysis — a partial breakdown of milk fats — before being combined with chocolate liquor and sugar. The resulting flavor compound, butyric acid, is the same molecule found in Parmesan cheese and, less appetizingly, vomit.
This sounds like a defect. It is, instead, the moat.
American consumers who grew up eating Hershey's chocolate — which is to say, nearly all American consumers — imprinted on this flavor profile the way ducklings imprint on the first moving object they see. The taste of a Hershey bar is the taste of chocolate, to Americans. It is the flavor of Halloween, of s'mores around a campfire, of the Reese's Peanut Butter Cup torn open in a movie theater. Competitors who have tried to enter the U.S. market with "superior" European-style chocolate have consistently discovered that American consumers do not want superior chocolate. They want their chocolate. The butyric acid note that Swiss chocolatiers find objectionable is, for 330 million Americans, the Proustian madeleine.
This is a textbook illustration of Hamilton Helmer's concept of "counter-positioning" mutating over time into "brand power." The initial process was likely a cost-driven manufacturing decision — a way to use less-than-perfectly-fresh milk in a pre-refrigeration era. But the flavor it produced became the standard against which all other chocolate in America was measured. When you define the category, you own the category. And when the category definition is literally baked into the taste buds of the national population through a century of consumption, the switching costs are not financial. They are neurological.
The Orphans' Dividend
In 1909, Milton and his wife Catherine — who were unable to have children — established the Hershey Industrial School (later renamed the Milton Hershey School) for orphaned boys. In 1918, three years after Catherine's death, Milton transferred his entire fortune — including his controlling stake in the Hershey Chocolate Company — to the Milton Hershey School Trust. The value of the transfer was approximately $60 million, equivalent to roughly $1.2 billion today. He was sixty-one years old. He had given away everything.
The structural implications of this act have echoed for more than a century. The Hershey Trust Company, which administers the school's endowment, holds all of Hershey's Class B common stock, which carries ten votes per share. This gives the Trust approximately 80% of the voting power of the company, despite owning a smaller percentage of total equity. The Trust's fiduciary duty runs to the students of the Milton Hershey School — currently about 2,100 children from low-income families who receive free education, housing, and comprehensive support from pre-kindergarten through twelfth grade, on a 10,000-acre campus in Hershey, Pennsylvania.
This governance structure is, simultaneously, Hershey's greatest defense and its most debated constraint. The Trust has repeatedly blocked takeover attempts — most dramatically the 2002 bid by Wrigley (backed by a group that reportedly included Nestlé and Cadbury) and the 2016 Mondelez approach. The Pennsylvania Attorney General has intervened in past attempts to sell, arguing that the Trust's control of Hershey is essential to its charitable mission. The result is a public company that cannot be acquired against the Trust's wishes, which effectively means: Hershey cannot be acquired. Period.
For long-term shareholders, this is a feature. The Trust's permanence as a controlling shareholder creates a time horizon that is genuinely intergenerational — it is not optimizing for quarterly earnings or a three-year private equity hold period but for the perpetual funding of a school. For activist investors or would-be acquirers, it is a fortress with no drawbridge.
How Milton Hershey's philanthropy became a governance mechanism
1909Milton and Catherine Hershey establish the Hershey Industrial School for orphaned boys.
1918Milton transfers his entire Hershey Chocolate Company stake to the school trust, valued at ~$60 million.
1927Hershey Chocolate Company goes public on the NYSE. The Trust retains controlling interest.
2002Hershey Trust explores sale to Wrigley consortium. Pennsylvania AG and public outcry block the deal.
2016Mondelez bids $23 billion. Trust rejects, maintaining independence.
2025Trust still controls ~80% of voting power. Milton Hershey School enrolls ~2,100 students.
The Depression, the War, and the Wrapper
The 1930s tested every American enterprise. Hershey, remarkably, never laid off a worker during the Great Depression. Milton instead redirected employees to construction projects — building the Hotel Hershey, a community center, a sports arena, new facilities for the school. He was, in effect, running a Keynesian stimulus program in miniature, a decade before the concept had entered mainstream economic thought. The loyalty this generated was immense — and it was tested almost immediately.
In April 1937, workers at the Hershey factory staged a sit-down strike, inspired by the wave of labor actions sweeping American industry. The strikers, affiliated with the CIO (Congress of Industrial Organizations), occupied the plant for six days. The resolution was violent: local dairy farmers and non-striking workers, fearful of losing their livelihoods if the factory closed, physically ejected the strikers. Milton Hershey, who viewed the strike as a personal betrayal, was devastated. The episode revealed the fragility of even the most paternalistic corporate culture — the town that had been built on generosity could not, in the end, accommodate dissent.
World War II restored the company's narrative. The U.S. military contracted Hershey to produce a survival ration — the "Field Ration D" bar, engineered to withstand high temperatures and provide concentrated calories. It tasted terrible by design (to prevent soldiers from eating it as candy rather than saving it for emergencies), but Hershey produced over a billion of them. The Tropical Chocolate Bar followed — slightly more palatable, distributed across every theater of the war. Soldiers returned home with the Hershey name encoded in their wartime memories, which is about as powerful a brand association as any company could wish for. The government awarded Hershey the Army-Navy "E" Production Award five times.
Milton Hershey died on October 13, 1945, at age eighty-eight, having witnessed his creation survive depression, labor unrest, and global war. He left behind no biological heirs. He left behind a company, a town, a school, and a trust — an interlocking system designed to perpetuate itself indefinitely.
The Long Plateau and the Advertising Question
For the quarter century following Milton's death, the company was managed by loyalists who treated his methods as scripture. The no-advertising policy held. The product line expanded cautiously — Reese's Peanut Butter Cups (acquired when H.B. Reese, a former Hershey employee, died in 1963 and his family sold the business) became a cornerstone, but the core identity remained chocolate bars and Kisses sold without promotional support in a market that was rapidly professionalizing its marketing.
The reckoning arrived in the late 1960s. Mars — private, aggressive, and increasingly sophisticated — was eroding Hershey's market share with M&M's, Snickers, and Milky Way, backed by heavy television advertising. Health-consciousness was creeping into American food culture for the first time, and candy consumption per capita was declining. The company's sales were stagnating.
William Dearden — who had attended the Milton Hershey School as a boy after his mother's death, worked his way through the company, and eventually became CEO — made the heretical decision to advertise. In July 1970, Hershey placed ads in 114 Sunday newspapers. Two months later, the company's first television and radio commercials aired. The move was itself a news event — the fact that Hershey was advertising at all generated coverage that amplified the campaign's reach. Dearden hired Ogilvy & Mather, one of the era's preeminent agencies.
The irony was exquisite: seventy years of not advertising had created such powerful brand recognition that the act of finally advertising became its own form of publicity. Milton Hershey's refusal to advertise had, paradoxically, made advertising maximally effective when it was finally deployed.
The Reese's Machine
If the Hershey bar is the company's soul, Reese's is its engine. The brand, built on the deceptively simple combination of chocolate and peanut butter, has become the single largest confectionery brand in the United States — larger than Snickers, larger than M&M's, larger than the Hershey bar itself. Reese's Peanut Butter Cups alone generate billions in annual retail sales.
H.B. Reese was a dairy farmer who went to work at the Hershey factory, then left to start his own candy company in the basement of his home in Hershey, Pennsylvania, in 1923. He used Hershey's chocolate coating for his products — a relationship that was simultaneously competitive and symbiotic. When Reese died in 1956, his sons continued the business until merging it with Hershey in 1963. The merger price, by contemporary standards, was modest. The return on that investment has been, by any measure, one of the great acquisitions in consumer products history.
What makes Reese's durable is the specificity of its flavor combination. Chocolate-and-peanut-butter is not a generic pairing that any manufacturer can replicate — it is, like Hershey's chocolate itself, a specific ratio, a specific texture, a specific experience that consumers have bonded with over decades. The brand has proven extensible (Reese's Pieces, Reese's Sticks, Reese's Miniatures, Reese's Big Cup, seasonal shapes) without diluting the core. Michele Buck, who became CEO in 2017, has called Reese's "the jewel in the crown" — and the financials support the metaphor.
Our consumers tell us that chocolate provides emotional well-being, and we've seen especially recently as stress levels register higher, that chocolate plays an even bigger role in their lives.
— Michele Buck, CEO of Hershey, Fortune interview, 2025
The First Woman and the Salty [Pivot](/mental-models/pivot)
Michele Buck grew up on a farm without indoor plumbing, put herself through school, and spent years at Frito-Lay and other PepsiCo divisions before joining Hershey in 2005. She became the company's first female CEO in March 2017, following a stint as chief operating officer, and immediately articulated a vision that would have been unthinkable to Milton Hershey: she wanted to make Hershey a snacking powerhouse, not just a candy company.
The logic was deceptively straightforward. The American snacking market — broadly defined to include salty snacks, better-for-you options, and on-the-go formats — was growing faster than confectionery alone. Hershey's distribution infrastructure, its relationships with mass retailers and convenience stores, and its consumer insights capabilities were transferable across snack categories. The constraint was that Hershey had no credible brands outside of confectionery. Buck's answer was M&A.
The acquisitions came in a deliberate sequence: Amplify Snack Brands (SkinnyPop popcorn) in 2018 for approximately $1.6 billion, Pirate's Booty in 2018, and Dot's Homestyle Pretzels in 2021. Each target shared specific characteristics Buck would later codify: at least $100 million in annual sales, high margins, ease of integration into Hershey's distribution system, and a brand identity that Hershey could build rather than create from scratch. "We are builders of brands, we are not necessarily creators," she told Fortune in 2025. It is an unusually honest self-assessment for a CEO — an acknowledgment that the company's core competency is distribution, marketing, and brand stewardship, not product invention.
The salty snacks segment reached approximately 10% of Hershey's $11.2 billion in 2024 revenue — meaningful but still a fraction of the confectionery business. Buck herself acknowledged that she wished the transformation had moved faster. "Early on, our M&A capability was not where it needed to be and we stubbed our toe on a couple of small acquisitions," she admitted. The candor is notable. Building M&A as an organizational competency — not just finding deals but developing the institutional muscle to evaluate, integrate, and grow acquired businesses — is among the most underappreciated challenges in corporate strategy.
🥨
The Snacking Acquisitions
Hershey's expansion beyond confectionery
| Acquisition | Year | Category | Rationale |
|---|
| Amplify Snack Brands (SkinnyPop) | 2018 | Better-for-you popcorn | Entry into $10B+ salty snacks market |
| Pirate's Booty | 2018 | Puffed snacks | Kids/family snacking occasions |
| Dot's Homestyle Pretzels | 2021 | Salty/savory pretzels | High-margin, high-growth pretzel category |
| ONE Brands (protein bars) | 2019 | Better-for-you bars | Health-conscious consumer segment |
The Cocoa Crisis and the Price of Sweetness
In 2024, cocoa prices hit fifty-year highs. The causes were structural and meteorological — poor harvests in West Africa (which produces roughly 70% of the world's cocoa), disease affecting cocoa trees, and the long-term underinvestment in farming infrastructure that had been building for decades. Cocoa futures that had traded around $2,500 per metric ton for most of the 2010s surged past $10,000 in early 2024 and spiked above $12,000 by year-end, a price level with no modern precedent.
For Hershey, which purchases enormous quantities of cocoa and cocoa derivatives, the impact was severe. Input costs surged. Margins compressed. The stock price, which had peaked near $275 in April 2023, fell more than 30% over the following eighteen months. Wall Street analysts, long accustomed to Hershey's steady, almost boring predictability, issued rare criticisms. The company raised prices — it had always been able to raise prices, given the brand power — but the magnitude of the cocoa shock tested the limits of consumer price elasticity in a way that sugar taxes and health trends never had.
Buck navigated the crisis with the same approach she'd brought to COVID-19, which she described as an exercise in empathetic leadership and operational triage. The company hedged cocoa purchases, adjusted pack sizes, and leaned on its salty snacks business — which was not cocoa-dependent — as a partial buffer. But the episode exposed a vulnerability that had always been latent in the business model: Hershey is, at its core, a value-added processor of an agricultural commodity whose supply is concentrated in politically unstable regions. The brand power that allows pricing is real. The commodity exposure that requires it is also real.
The MAHA Shadow
As Buck prepared to hand the CEO role to Kirk Tanner — a PepsiCo and Wendy's veteran who took over in August 2025 — a different kind of threat was gathering. The "Make America Healthy Again" movement, energized by political figures and amplified by social media, was pressuring food manufacturers to reformulate products, reduce sugar content, and confront the health implications of ultra-processed foods. For a company whose flagship products are chocolate bars and peanut butter cups, the rhetorical environment was hostile.
Buck, characteristically, refused to panic. "I don't see it as a hard right turn or like nothing we've ever seen before," she told Fortune in her exit interview. The candy industry had survived the health-food movement of the 1970s, the low-fat craze of the 1990s, and the organic revolution of the 2000s. Confectionery, Buck argued, occupied a different psychological space than everyday food — it was a treat, an indulgence, an emotional reward. Consumers who virtuously purchased kale and quinoa for dinner still wanted a Reese's Cup after.
The data largely supports this. U.S. confectionery sales have grown through every major health-consciousness cycle of the past fifty years. The category's resilience suggests that confectionery is not competing with healthy food — it is competing with other forms of small indulgence: a latte, a cocktail, a streaming subscription. The real competitive set is not carrots. It is dopamine.
The Anti-Acquisition
Hershey's governance structure — the Trust's voting control, the Pennsylvania AG's historical willingness to intervene, the deep emotional attachment between the Hershey brand and the community that bears its name — has created what might be the most takeover-proof company in American consumer products. This is not, as it might first appear, simply a story about defense. It is a story about what a company does when it cannot be bought.
Companies that can be acquired are disciplined by the threat of acquisition. If management underperforms, an acquirer will replace them. The market for corporate control, as economists call it, provides a check on complacency. Hershey lacks this check. The Trust's control means that no matter how badly the stock underperforms, no hostile bidder can take the company. This should, in theory, breed complacency.
It hasn't — at least not fatally. Why? The answer may lie in the unusual alignment of interests created by the Trust structure. The Trust needs Hershey's dividends to fund the school. The school needs the dividends to grow indefinitely. The Trust therefore has an incentive to ensure that Hershey generates strong, growing cash flows — not the rapid growth that venture-backed companies pursue, but the steady compounding that funds a perpetual endowment. The time horizon is not quarterly. It is not even decadal. It is genuinely perpetual. And this orientation, paradoxically, may be the most rational long-term shareholder structure available to a public company.
I'm very open to mergers and acquisitions. I see them playing a key role in our growth agenda going forward. We've an opportunity with mergers and acquisitions to go into spaces where our brands currently can't travel.
— Michele Buck, Fortune, 2017
130 Years of Wrapper on the Ground
Consider the arc. A farm boy with a fourth-grade education fails twice in the candy business, succeeds on the third attempt with milk caramels, sees a chocolate machine at a world's fair, sells everything, builds a factory in a cornfield surrounded by dairy cows, designs an entire town, creates a flavor of chocolate that is technically inferior by European standards but neurologically imprinted on the American palate, refuses to advertise for seven decades, gives his entire fortune to an orphanage that still controls the company, and — through this improbable sequence — produces a business that generates over $11 billion in annual revenue, maintains operating margins above 20%, and has compounded shareholder wealth for nearly a century.
The company Milton Hershey built has survived him by eighty years. It has survived two world wars, the Great Depression, a violent strike, hostile takeover attempts, the health-food movement, a global pandemic, and the most severe cocoa price shock in half a century. It has done this not through disruption or technological innovation but through the compounding of small advantages: a proprietary flavor, extraordinary brand recognition, relentless distribution, a governance structure that prevents short-termism, and a willingness to evolve — slowly, cautiously, but purposefully — from a chocolate company into a snacking company.
Kirk Tanner, the new CEO, inherits a machine. The question is not whether the machine works — it has worked for 130 years — but whether the same principles that built it can carry it into a world of GLP-1 drugs, shifting cocoa economics, and consumers who increasingly demand that corporations justify not just their products but their existence.
In Hershey, Pennsylvania, the streetlights are still shaped like Hershey's Kisses. The air, on certain days when the factory is running at full capacity, still smells of chocolate. And 2,100 children from low-income families are attending school — free of charge, fully funded — because a man who went bankrupt twice decided that the best use of his fortune was not a dynasty but a trust, and the best advertisement for his chocolate was a wrapper on the ground.