Posts
The Marshmallow Test Was Wrong (Kind Of): What the Latest Science Actually Tells Us About Kids and Delayed Gratification

The Marshmallow Test Was Wrong (Kind Of): What the Latest Science Actually Tells Us About Kids and Delayed Gratification

Jun 23, 2026

The famous marshmallow test has been misread for decades — and the real science is far more empowering for parents than the popular myth ever was.

If you’ve spent any time in parenting circles — or read a pop-psychology book in the last thirty years — you’ve almost certainly encountered the marshmallow test. A small child sits alone at a table. A researcher places a single marshmallow in front of them and says: wait fifteen minutes without eating it, and you’ll get two. Then the researcher leaves the room.

The follow-up research, published through the 1980s and 1990s, appeared to show that the children who waited grew up to get better SAT scores, earn higher incomes, and weigh less. The popular conclusion was electrifying and terrifying in equal measure: your child’s capacity for self-control, revealed in a single moment at age four, might shape the entire arc of their life.

Here’s the thing: the original study was not wrong. The popular takeaway from it was. And the correction — which comes from a richer, more nuanced body of research — is genuinely good news for parents. Because it turns out that delayed gratification is not a fixed personality trait your child either has or doesn’t have. It is a skill that gets built, day by day, through one of the most ordinary things families do together: keeping promises about money.

You Know the Test — But Do You Know the Whole Story?

Walter Mischel and his colleagues ran the original marshmallow experiments at Stanford’s Bing Nursery School in 1970 and 1972. The study published in the Journal of Personality and Social Psychology (Mischel, Ebbesen & Zeiss, 1972) enrolled roughly 90 children — ages 3.5 to 5.5 — who were almost all the sons and daughters of Stanford professors and graduate students. One marshmallow now, or two if you can wait fifteen minutes.

What Mischel found fascinating wasn’t just who waited, but how they waited. The children who succeeded didn’t stare heroically at the marshmallow, summoning willpower. They sang to themselves. They covered their eyes. They invented little games. They turned their chairs around. Thinking directly about the reward made waiting harder, not easier. Successful waiting was, at its heart, a set of learned mental strategies — not a display of raw inner strength.

Then came the follow-up studies. In 1990, Shoda, Mischel, and Peake reported that children who had waited longer showed higher SAT scores in adolescence. Subsequent research linked waiting time to better educational attainment, lower BMI thirty years later, and fewer behavioral problems. These findings rocketed the marshmallow test to cultural stardom. Parenting books, TED talks, and op-eds all drew the same conclusion: if your four-year-old grabs the marshmallow, watch out.

Mischel himself grew deeply uncomfortable with this reading. In a 2015 PBS interview he said: “The idea that your child is doomed if she chooses not to wait for her marshmallows is really a serious misinterpretation.” His 2014 book, The Marshmallow Test: Mastering Self-Control, was partly written to correct the myth he had inadvertently helped create: “Self-control is not a fixed trait. It’s more like a set of strategies that can be learned and improved.”

The scientist who ran the experiment was telling us not to overread it. Almost nobody listened — until 2018.

The Study That Rewrote the Conclusion

In 2018, Tyler Watts, Greg Duncan, and Haohan Qi published “Revisiting the Marshmallow Test” in Psychological Science (Vol. 29, No. 7). Their study used a large, socioeconomically diverse sample drawn from a national longitudinal dataset — roughly 900 children, about ten times the size of Mischel’s most-cited follow-ups. Critically, it included many children whose mothers had not attended college, a population almost entirely absent from Stanford’s Bing Nursery cohort.

The results were striking:

  1. Delay of gratification did predict academic achievement at age 15 — but the effect size was half of what the original studies had reported.
  2. Once the researchers statistically controlled for family background — income, maternal education level, cognitive ability, and home environment quality — the correlation between marshmallow-waiting and achievement outcomes nearly vanished, reduced to statistical nonsignificance.
  3. The link between waiting time and behavioral outcomes didn’t replicate at all.
  4. Perhaps most puzzling: all predictive power was concentrated in the difference between waiting fewer than twenty seconds versus any longer. Children who waited two minutes versus seven minutes showed no additional benefit. Delay was not a continuous willpower trait — something more categorical and situational was at work.

Watts summarized the practical takeaway plainly: “If you have two kids who have the same background environment, the same kind of parenting, the same ethnicity, the same gender, a similar home environment, similar early cognitive ability — then if one of them is able to delay gratification and the other one isn’t, does that matter? Our study says, ‘Eh, probably not.‘”

Developmental psychologist Pamela Davis-Kean of the University of Michigan put it another way, speaking to Vox: “It’s very hard to find psychological effects that are not explained by the socioeconomic status of families. Nothing changes a kid’s environment like money. Money buys good food, quiet neighborhoods, safe homes, less stressed and healthier parents, books, and time to spend with children.”

The original marshmallow correlation wasn’t measuring a stable inner trait. It was measuring the downstream effects of the stable, resource-rich, low-stress environment that both produced greater waiting ability and better life outcomes. The marshmallow performance was a symptom, not a cause.

The Rochester Twist: It Was Never About Willpower

The single most important study for parents came not from Stanford or a national longitudinal dataset, but from a lab at the University of Rochester. Celeste Kidd, Holly Palmeri, and Richard N. Aslin published their findings in Cognition (2013), based on research conducted in 2012.

Before giving 28 children the standard marshmallow test, they divided them into two groups with a small pre-test interaction. In the reliable group, the experimenter made a small promise — “I’ll go get you some better art supplies” — and came back with them. In the unreliable group, the experimenter made the same promise and returned empty-handed with an apology.

The marshmallow results were dramatic. Children in the reliable group waited an average of about twelve minutes. Children in the unreliable group waited an average of about three minutes — a fourfold difference, produced entirely by one broken promise minutes earlier.

Here’s why this matters so much: if raw willpower were what determined waiting, the two groups should have performed identically. Their innate self-control hadn’t changed. What changed was their rational assessment of whether the environment could be trusted to follow through on its promises.

A child who grabs the marshmallow may not lack self-control at all. They may have made the entirely logical decision that in their experience, waiting doesn’t reliably pay off. They may be right.

This reframes the question for parents completely. Delayed gratification is not a fixed trait your child has or lacks. It is a learned belief that waiting is safe and worthwhile — a belief assembled, piece by piece, through repeated experiences of promises kept and goals reached.

What This Means for the Money Habits Forming Right Now

Researchers at the University of Cambridge (Dr. David Whitebread and Dr. Sue Bingham, Habit Formation and Learning in Young Children) found that money habits — default patterns around saving and spending, emotional responses to money, and trust in financial systems — are largely formed by age 7 and persist into adulthood with remarkable stability unless deliberately disrupted.

The Consumer Financial Protection Bureau’s Building Blocks framework, highlighted in their Financial Literacy Annual Report (December 2025), identifies Executive Function as the strongest predictor of long-term financial outcomes. It develops through practice — specifically, through real choices involving real consequences. A four-year-old deciding between a sticker now and two stickers in three days is doing the same neurological work as a twenty-five-year-old choosing between spending a paycheck and building an emergency fund.

Thirty states now require a standalone personal-finance course for high school graduation — genuine progress. But the Cambridge research tells us habits form at 7, and state mandates reach kids at 16 or 17. That’s a roughly decade-long gap that families fill — or don’t.

According to T. Rowe Price’s most recent Parents, Kids & Money Survey, children whose parents actively discuss money decisions are three times more likely to develop healthy financial behaviors. Yet 66% of parents have at least some reluctance to discuss money with their 8-to-14-year-olds. The gap is real, and it starts earlier than most families realize.

The Trust Engine: How You Build Delayed Gratification at Home

The Rochester study gives us the mechanism. Here is how to activate it:

1. Keep small money promises. This is the single highest-leverage action available to you. Allowance that lands on the same day every week. A savings-goal reward that actually appears when the goal is reached. When you say “we’ll buy that when you’ve saved up enough,” follow through. Every kept promise is a brick in the neural foundation that makes waiting feel safe.

2. Make waiting visibly pay off. Abstract future rewards are hard for young brains to hold onto. Make them concrete: a thermometer chart on the refrigerator, a glass jar where coins are added weekly, a photo of the target item taped to the savings container. When your child can see the gap closing, waiting becomes a strategy rather than a sacrifice. See more goal-setting ideas at teaching kids to save with allowance and chores.

3. Give real choices with real (small) money. The CFPB’s Executive Function building block develops through practice — not lectures. Starting as early as age 5, give children genuine spending decisions. Fifty small choices by age 8 — each with a real consequence they can feel — does more for financial self-regulation than any amount of explaining.

4. Normalize mistakes without shame. A child who blows their allowance on something they regret isn’t failing — they’re learning. Resist the urge to rescue or lecture. Instead, ask: “What were you hoping for? What actually happened? What would you do differently next time?” For a full framework, see letting kids make money mistakes safely.

5. Keep routines predictable. Allowance on the same day, every week, without reminders needed. Consistency in the financial environment teaches children that systems are trustworthy. Inconsistency — missed paydays, promises that materialize sometimes and don’t other times — teaches the opposite, and the Rochester study shows us exactly what that does to waiting behavior.

6. Narrate your own financial choices. “I’m not buying that today because we’re saving for the camping trip.” “This one costs less, so I’m choosing it.” Children absorb modeled behavior far more readily than explained values. You are your child’s most-watched financial educator, and you don’t have to engineer a lesson — you just have to let them see what you’re already doing.

A Quick Framework: Save, Spend, Share

One of the most effective structures for building these habits is the three-bucket system. It’s simple enough for a five-year-old and robust enough to last into adolescence.

  • Save bucket: Money set aside toward a specific, named goal — with a visible tracker. The goal should be just out of reach but achievable within a few weeks (for young children) to a few months (for older ones). The tracker makes the abstract concrete.
  • Spend bucket: Money that is freely the child’s to use as they choose — including to make mistakes. This is where real financial learning happens. Real regret is the curriculum.
  • Share bucket: Money designated for giving. This bucket is quietly powerful: choosing to give up money now for the benefit of someone else is delayed gratification in one of its purest forms.

Here’s a rough guide to age-appropriate allowance amounts that make this framework workable:

AgeWeekly AllowanceKey Focus
4–5$2–$4Coins are real; binary choices; first saving jars
6–8$5–$10Save/Spend/Share buckets; real regret is the curriculum
9–11$10–$15Longer goals (4–6 week saves); comparison shopping
12–14$15–$25Digital money literacy; multi-month goals
15–17$25–$50Budget management; multi-month savings; meaningful giving

These are starting points, not rules — adjust for your family’s circumstances and your child’s demonstrated readiness.

The Bottom Line

The marshmallow test was never a verdict on your child. It was a window into how environments shape behavior — and that is genuinely great news, because environments are something parents help create.

The 2018 replication showed us that family background explained most of what the original study attributed to willpower. The Rochester Trust Study showed us why: children rationally assess whether their environment can be trusted to deliver on its promises, and they wait accordingly. The science points in one clear direction — toward the ordinary, unglamorous, deeply effective practice of keeping small money promises and making the benefits of waiting visible and real.

Every time you pay allowance on the same day you said you would, every time a savings-goal reward actually appears when the jar is full, every time you follow through on a small financial commitment, you are doing something more powerful than any willpower drill or behavioral intervention. You are teaching your child — at the neurological level — that waiting is safe.

Tools like Isembl can help you maintain the consistent schedules and visible goal tracking that make this work in busy family life — whether your household runs in English, Spanish, French, or any combination of the three. But the engine underneath is trust, and that’s something only you can build.

The child who grabs the marshmallow isn’t doomed. They’re giving you useful information: they need more evidence that waiting pays off in your household. That evidence is yours to provide, one kept promise at a time.

en