Derech Truth Labs  ·  Unapologetically Faithful. Searching with Evidence.
The Hollowed House  ·  Part 1 of 8

The Report Card: Where the Richest Nation in the World Actually Stands

We spend more per student than nearly every nation on earth. We have launched reform after reform, passed law after law, and declared victory after victory. The data says otherwise — and we owe it to the children to read it honestly.

Tier 1 — Verified
Tier 2 — Interpretation Required
Tier 3 — Unverified / Single-Source
Tier 4 — False or Misleading
Author's Note

Before We Begin

I am a pastor, not a policy analyst. A Board Certified Christian Counselor, not an education researcher. I hold no doctorate, no academic credential in educational theory, no institutional affiliation that would give a peer-review committee reason to take my calls. What I have is twenty years of sitting across from families — and a research methodology that refuses to accept the official account without examining the evidence behind it.

For twenty years I have sat across from parents and children in a counseling room. I have watched boys who loved learning become boys who hated school. I have sat with parents who couldn't understand why their daughter — bright, curious, eager in elementary school — arrived at high school without the reading skills she should have developed years earlier. I have counseled young people in genuine distress who were handed an identity framework by their school before anyone thought to ask what was actually wrong.

This paper is what I found when I decided to stop accepting the official account and go looking at the evidence myself. What follows is not a conservative political brief. It is not a nostalgic argument for a school system that never existed. It is an evidence-based examination — claim by claim, source by source — of what thirty years of data actually says about American public education.

My lens is declared openly: I believe the Bible is the authoritative Word of God. I believe children are image-bearers deserving of an honest account of reality and the tools to navigate it. I believe bearing false witness — even for a just cause — remains bearing false witness. I believe the fruit of the Spirit should be evident in how we engage even the most contentious topics. Knowing my lens, you can account for it. Proverbs 18:17 applies to me too: the one who states his case first seems right, until the other comes and examines him.

This paper was produced collaboratively with AI research tools. The judgment, direction, and conclusions are mine. The research breadth is AI-assisted. Every source cited is real and verifiable. The methodology — examine both sides, name what the evidence actually says, distinguish between verified fact and contested interpretation — is the same standard I would apply to any other claim.

"Buy truth and do not sell it — wisdom, instruction, and understanding."

— Proverbs 23:23
Prologue

What a Hollowed House Looks Like

A hollowed house does not announce itself. That is the nature of the danger. From the street, everything appears to be in order: fresh paint, intact windows, a familiar silhouette against the neighborhood sky. The problems are structural. The beams are compromised. The load-bearing walls have been weakened by decades of water that found the places where the finish concealed rather than sealed. You do not know the house is hollowed until you press against it and feel it yield.

This paper is about one house in particular. We call it American public education. We hand it to every child at age five or six and tell them — and their parents — that it will do what houses are supposed to do: provide shelter, provide structure, provide the foundation from which a life can be built. We have spent thirty years and trillions of dollars maintaining the appearance of that promise.

Thirty years of evidence say otherwise. Not political opinion. Not one pundit's interpretation. The National Assessment of Educational Progress — the "Nation's Report Card," the most comprehensive and longest-running independent measure of what American students actually know — has been collecting data since 1990. The data is patient. It waits for us to read it honestly.

NAEP reading scores run on a scale of 0 to 500. A score of 215 for a fourth grader means a child can identify the main idea of a simple passage. It does not mean mastery. It means basic comprehension — the entry-level standard for functional literacy in a grade-four student.

The 2024 fourth-grade reading score is 215. The 1992 score was also 215. In thirty-two years — through welfare reform, No Child Left Behind, Race to the Top, Common Core, and the largest education spending increases in world history — the average American fourth grader reads at exactly the same level as the average American fourth grader did in 1992. The house has not been renovated. It has been repainted.

This paper is not written in despair. The house is not condemned — it is not beyond repair. What it requires is an honest inspection: not to score political points, but because the children living in it deserve adults who can tell the truth about the beams.

It is not written in anger. Anger is easy and often useless. It is written in the conviction that children deserve better than a system that has been failing them in plain sight for a generation while the adults responsible for it issued press releases about progress.

"The wisdom of the prudent is to discern his way, but the folly of fools is deceiving."

— Proverbs 14:8
Part One

The Report Card — Where the Richest Nation in the World Actually Stands

The Problem

Before the first table in this paper, there is something worth saying plainly: the children this data describes are not statistics. They are the boy who stopped reading for pleasure at age nine because the books assigned bored him and the school had no interest in what he loved. They are the girl who arrived at middle school unable to decode words she had never seen before, not because she wasn't intelligent, but because the reading method her school used did not teach her how. The numbers that follow are not abstractions. Each data point represents a child who deserved more than they received.

Now the data. Not because numbers matter more than children — they do not. But because numbers are what we owe children when we are trying to figure out how badly we have failed them and where we need to change.

Tier 1 — Verified

U.S. per-pupil spending: $20,387 annually (OECD Education at a Glance 2024). Rank: 3rd highest in OECD. Total public K–12 spending exceeds $800 billion annually.

Twenty Years of Evidence: What $20,000 Per Year Actually Bought

The table below is not a snapshot. It is a timeline. Read the mathematics column from top to bottom — not as a statistical curiosity but as a testimony: twenty years, one direction, the world's most expensive education system falling further below the international average with each passing cycle.

Table 1: U.S. PISA Performance Trends vs. OECD Average — 2003 to 2022
Year U.S. Math OECD Math Avg U.S. Reading OECD Reading Avg U.S. Science OECD Science Avg
2003483500495494
2006474498489500
2009487496500493502501
2012481494498496497501
2015470490497493496493
2018478489505487502489
2022465 ↓472 ↓504476499485
Source: OECD PISA Results 2003–2022 (NCES / OECD). Amber = U.S. below OECD average. ↓ = 2022 lowest U.S. math score in the assessment's history.

Twenty years. The mathematics number moved from 483 to 465 — a net decline of 18 points, still below where we started, and arriving at 2022 at its lowest recorded value. Meanwhile, the OECD average itself declined, which is the only reason the United States did not fall further behind relatively. The system that spent more than any of its peers produced less progress than almost all of them.

Tier 1 — Verified

PISA 2022: U.S. math 465 (OECD avg 472). U.S. has scored below the OECD math average in every single PISA cycle since 2003. Reading is the exception — the U.S. has historically scored above average — but the trend is flat.


The Richest Nation in the Room

There is a version of this conversation that begins with "we just need more funding." It is worth addressing that argument with the evidence it deserves — which is to say, with the evidence that specifically contradicts it.

Washington, D.C. — the seat of the federal government that writes education policy — spends approximately $30,000 per pupil annually. Its NAEP reading proficiency rate for fourth graders is 23 percent. The United States as a whole spends $20,387 per pupil. Singapore spends roughly $13,000. Finland spends roughly $12,400. Both consistently outperform the United States by margins that are not close.

Tier 1 — Verified

U.S. per-pupil: $20,387 (OECD Education at a Glance 2024, NCES). Singapore: ~$13,000. Finland: ~$12,400. Poland: ~$8,300. All three outperform the U.S. in mathematics on PISA 2022.

Table 2: Spending vs. PISA Performance — Selected Nations
Country Annual Per-Pupil Spending (USD) PISA Math 2022 PISA Reading 2022 PISA Science 2022
Singapore~$13,000575543561
Japan~$11,200536516547
South Korea~$12,400527515528
Finland~$12,400484490511
Canada~$13,500497507515
Poland~$8,300489489499
United States$20,387465504499
OECD Average~$11,900472476485
Sources: OECD Education at a Glance 2024; NCES 'Education Expenditures by Country' 2024; OECD PISA 2022 Results (Volume I).

Poland. Eight thousand dollars. Half of what we spend. Higher mathematics performance than the United States. For three consecutive PISA cycles. If the problem were money, Poland would be the cautionary tale and the United States would be the success story. The table says otherwise.

The Global Report Card: Every Subject, Every Assessment

PISA tests reading, mathematics, and science. But those are not the only lenses available. TIMSS measures mathematics and science for fourth and eighth graders independently. NAEP measures domestic performance across subjects. Taken together, they produce a composite picture — not of a system that is merely struggling, but of a system with a specific and patterned failure profile:

Tier 1 — Verified

PISA 2022: Singapore 575 math, Japan 536, S. Korea 527, Canada 497, U.S. 465, OECD avg 472. PISA reading: U.S. 504 (above average). PISA creative thinking: U.S. did not participate. TIMSS 2023 8th grade math: U.S. 518, Singapore 616, top quartile nations 560+.

Table 3: America's Full Report Card — Performance Across Every Assessed Subject
Subject / Assessment Grade U.S. Score OECD / Int'l Avg Source
PISA Mathematics (2022)D465472OECD PISA 2022
PISA Reading (2022)B504476OECD PISA 2022
PISA Science (2022)B499485OECD PISA 2022
PISA Creative Thinking (2022)INC33OECD Vol. III — U.S. did not participate
TIMSS Math, 8th Grade (2023)B518500NCES TIMSS 2023
TIMSS Science, 8th Grade (2023)B522500NCES TIMSS 2023
NAEP Reading, 4th Grade (2022)D217NAEP / NCES 2022
NAEP Math, 4th Grade (2022)D236NAEP / NCES 2022
Sources: OECD PISA 2022; NCES TIMSS 2023; NAEP 2022; OECD PISA Volume III. Grade: A = top tier, B = above avg, C = average, D = below average, INC = incomplete.

That INC deserves more than a footnote. The United States did not participate in the first-ever international assessment of creative thinking — the capacity to generate novel ideas and approach problems from multiple angles. Sixty-four countries sent students. The United States did not send a single one. You cannot improve what you refuse to measure.

Character, Critical Thinking, and the Will to Work

Academic scores tell you what students know. The following four subsections tell you something the scores do not: what kind of thinkers students are, what the learning environment they inhabit actually looks like, and what students believe about work and effort — the conditions that determine whether knowledge, once acquired, can be put to use.

1. Critical Thinking — The Assessment We Opted Out Of

The Problem

In June 2024, PISA released the results of its first-ever creative thinking assessment — measuring students' capacity to generate novel ideas, approach problems from multiple angles, and produce solutions that go beyond familiar patterns. Sixty-four countries participated, representing approximately 690,000 students. The United States did not submit data. It is listed as a non-participant.

Tier 1 — Verified

PISA 2022 Creative Thinking: 64 countries, ~690,000 students assessed. Singapore: 41 (highest). South Korea: 38. Canada: 38. OECD average: 33. United States: did not participate.

Table 4: PISA 2022 Creative Thinking — Top Performers
Country / Economy Creative Thinking Score vs. OECD Avg (33)
Singapore41+8 ▲
South Korea38+5 ▲
Canada38+5 ▲
Australia37+4 ▲
Finland36+3 ▲
Estonia35+2 ▲
OECD Average33
United StatesDID NOT PARTICIPATE
Source: OECD PISA 2022 Results (Volume III): Creative Minds, Creative Schools (June 2024).

The Cause

No Child Left Behind and its successors narrowed the American accountability system to reading and mathematics scores on standardized tests. The result was predictable: teachers teach to the measures that determine their schools' ratings. Creative thinking, critical reasoning, and the capacity to approach novel problems are not on the test — so they were not in the lesson. A generation of students was assessed on retrieval, not reasoning.

The Solution

The United States should participate in the next PISA Creative Thinking assessment — if only because you cannot improve what you refuse to measure. Beyond participation: an accountability framework that rewards reasoning alongside retrieval would begin to shift what teachers have permission to prioritize. The current framework does not give them that permission.


2. Classroom Discipline — The Order We Stopped Expecting

The Problem

Of 81 nations assessed on PISA's Disciplinary Climate Index, the United States ranks 37th. Thirty percent of American students report being distracted by electronic devices during most or all of their lessons. The Disciplinary Climate Index measures noise levels, disruption frequency, and teacher capacity to maintain a learning environment — the conditions that make it possible for instruction to occur at all.

Tier 1 — Verified

PISA 2022 Disciplinary Climate: U.S. rank #37 of 81 (index −0.02 vs. OECD mean 0.00). Device distraction during lessons: U.S. 30%, Japan 5%, South Korea 9%. U.S. Black student suspension rate: 3x white peers (CRDC 2020).

Table 5: PISA 2022 Disciplinary Climate — International Comparison
Country Climate Index PISA Rank Can't Work Well in Class Distracted by Devices
Japan1.09#112%5%
South Korea0.84#29%9%
Singapore0.61#518%22%
Finland0.38#1420%24%
United States−0.02#3714%30%
OECD Average0.0023%30%
Source: OECD PISA 2022 Results (Volume II). Suspension data: U.S. Dept. of Education CRDC 2020.

One number deserves honest context: the 14% of U.S. students who say they cannot work well in class is below the OECD average of 23%. That is a genuine bright spot. The device distraction number, however, is not — at 30%, it matches the OECD average exactly, which means we are keeping pace with a problem, not solving it.

The Cause

Character formation was removed from the American curriculum across the late 20th century. At the same time, schools disbanded most of the practical accountability structures — mandatory phone-free classrooms, consistent discipline frameworks with clear consequences — that produced the learning conditions the data now shows we are missing. Japan's classroom climate did not happen by accident. It was produced by deliberate formation policy.

The Solution

France banned phones from schools in 2018. Australia followed nationally in 2023. Both report measurable improvements in academic focus and student-reported wellbeing. The United States has the option. What it lacks is the institutional will to prioritize a learning environment over the convenience of device access.


3. Work Ethic — A 46-Year Record Hits Its Floor

The Problem

Since 1976, the Monitoring the Future survey has asked American high school seniors one straightforward question: how do you feel about work? The 2022 results represent the lowest recorded values in the survey's 46-year history on three of the four metrics measured — and a collapse of 8 to 18 percentage points in a single two-year period between 2020 and 2022.

Tier 1 — Verified

Monitoring the Future (MTF), Univ. of Michigan / NIH-NIDA, longitudinal dataset 1976–present (~15,000 high school seniors annually). 2020 to 2022 declines: "Work as central to life" — 27% to 19% (all-time low). Pre-pandemic context: the 2020 values themselves were near the lower range of the historical trend, making the 2022 collapse more significant, not less.

Table 6: American High School Seniors' Work Attitudes — The 2020 → 2022 Collapse
Attitude Measured 2020 2022 Change
Willing to work overtime when needed54%36%▼ 18 pts
Want to work (strong work desire)78%70%▼ 8 pts
Work as central to life27%19%▼ 8 pts — All-time low
Prefer work even if not financially needed39%31%▼ 8 pts
Source: Monitoring the Future National Survey, Univ. of Michigan / NIH-NIDA, 2020 and 2022 annual reports. 46-year longitudinal dataset.

Nineteen out of every hundred American seniors believe work is central to their lives. That number was measured by people who have been asking the same question for forty-six years. They know what the baseline looks like. This is not the baseline. This is a floor being discovered.

The Cause

Vocational education was defunded and stigmatized across the 1980s–2000s in favor of a universal college-prep model. The cultural signal sent to students who learn differently, aspire differently, or are simply honest about what motivates them was consistent and clear: your path is not honored here. A generation of curriculum designers forgot that the dignity of work is something that has to be taught — and that a school that never teaches it will produce adults who never learned it.

The Solution

Germany's apprenticeship system — in which roughly half of students enter professional mentorships combining classroom instruction with skilled-trade formation — produces workers who know what they are for. German youth unemployment is consistently among the lowest in the developed world. The model is documented, replicable, and not a secret. The will to implement it against the resistance of a college-prep credentialing complex is what has been missing.


4. Teacher Support — A Number That Deserves a Harder Look

PISA 2022 produced one data point that at first glance appears to be a genuine American bright spot: 72% of U.S. students report that their teachers show interest in their learning — nine percentage points above the OECD average. It is real. It deserves recognition. And then it deserves a harder question: if students feel this supported, why aren't they learning at a higher level?

Tier 1 — Verified

PISA 2022: 72% of U.S. students — teachers show interest in learning (OECD avg 63%). 80% — teachers give extra help (OECD avg 67%). These are genuine advantages over the international average.

Tier 2 — Interpretation Required

Research identifies the "PISA paradox": high student perception of teacher support does not reliably correlate with higher academic outcomes across systems. Multiple explanations exist: cultural differences in how "support" is experienced; the possibility that warmth-without-rigor reads as support to students but does not produce competency; the distinction between emotional attentiveness and instructional effectiveness. These are genuinely contested interpretations of a real finding.

American students feel supported. American students are not learning at the level that support should produce. Either the support is not translating into instruction — or what teachers are trained to prioritize has shifted in ways that feel supportive but do not build competence. Part One, Section 2 (the Pipeline Problem) addresses the preparation system that produced this outcome.

Steelman

The fairest objection to this entire character section is that international comparisons carry cultural bias. What reads as "poor disciplinary climate" in the U.S. context may reflect cultural values around student voice and autonomy that have independent worth. Work ethic data reflects post-COVID disruption that may not represent a permanent shift. These are fair cautions — they do not explain a twenty-year mathematics decline, but they deserve acknowledgment.

The Nations That Beat Us — and What They Are Actually Teaching

The countries that consistently outperform the United States have not stumbled into their results. They have made deliberate choices about what education is for, and those choices are observable in their curriculum frameworks, their teacher preparation requirements, and their national outcomes data.

Singapore built three pillars into its national curriculum: academic excellence, character development, and civic identity — and treats all three as non-negotiable. Character and Citizenship Education is a formal subject, K through 12, with dedicated time, trained teachers, and measurable outcomes. The academic results are the product of a system that refuses to separate formation from instruction.

Japan's tokkatsu gives students structured responsibility for their own learning environment — including cleaning their own classrooms and organizing their own activities — producing the disciplinary climate that ranks first in the world. Character is not a separate subject; it is built into the architecture of the school day.

Finland uses no standardized testing until age 16. Teachers require a master's degree, and teacher preparation is among the most competitive programs in the country. The result is a teaching force selected and trained for depth of craft — not for compliance with an accountability checklist.

Tier 2 — Interpretation Required

Cross-national educational comparisons require acknowledging confounding variables: cultural context, economic structure, demographic composition, and student selection effects all shape outcomes in ways that resist simple policy transfer. The strongest evidence is not that Singapore's specific programs would work identically in the United States, but that the gap between the U.S. and top-performing nations has remained consistent across decades, assessment types, and adjustments for confounding variables.

Steelman

The strongest defense of American underperformance is the poverty-and-demographics argument: disaggregated by socioeconomic quartile, U.S. students from high-income families perform comparably to top nations. This is a real finding. It does not explain why high-income American students do not outperform their international peers as clearly as the spending gap would predict — and it does not release the system from accountability for what happens to students from the other three quartiles.

What the Scoreboard Says at Home: 30 Years of NAEP

Set aside the international comparisons for a moment. Stay inside our own country, inside our own data, inside the assessment that has been measuring American students longer than any other. The Nation's Report Card does not compare us to Singapore or Finland. It only asks: are American children learning more than they did before?

Tier 1 — Verified

NAEP 4th grade reading 2022: average score 217 — not significantly different from the first NAEP reading assessment in 1992 (215). NAEP 8th grade reading 2024: 257 — lower than any score recorded before 2019. NAEP 4th grade math 2024: 237 — returned to 2005 levels after COVID declines. These are domestic measures independent of international comparison.

Table 7: NAEP Reading and Math — 30-Year Historical Trend (1992–2024)
Year 4th Grade Reading 8th Grade Reading 4th Grade Math 8th Grade Math
1992215260220268
2000213264226273
2005219262238279
2009221264240282
2013222268242285
2015223265240282
2019220263241282
2022217260236274
2024215 ←257 ↓237275
Source: NCES NAEP Long-Term Trend Assessment; Nation's Report Card 2022 and 2024. ← = return to baseline year score. ↓ = lowest recorded score.

Read the reading column from the top of the table to the bottom. 1992: 215. 2024: 215. Not similar. Identical. In thirty-two years — through every reform, every initiative, every dollar — the average American fourth grader reads at exactly the same level. The math column shows genuine gains through 2013, then stagnation, then a COVID collapse that erased twenty years of progress in two years.

Meanwhile: in the 2021–22 school year, 87% of American students graduated from high school. The highest graduation rate ever recorded. A system that produces the highest graduation rates in its history while producing fourth-grade readers who perform at 1992 levels has not become more successful. It has become better at moving students through a system that is asking less of them.

Tier 1 — Verified

2024 NAEP: 4th grade reading average 215 — identical to 1992. 8th grade reading 257 — lower than any pre-2022 score. High school graduation rate 2022: 87% — all-time high. The divergence between graduation rates and assessed learning is not a statistical anomaly. It is a policy outcome.

The Question That Demands an Answer

The numbers in this section force a question that institutions have been skillfully deflecting for twenty years: if we have been spending more than any nation on earth, hiring more teachers, launching more reforms, and producing higher graduation rates — and the children are still reading at the same level they read in 1992, and performing below the international average in mathematics — then something in the system is not working. And that something has a cause.

The discerning reader already suspects the answer. Parts Two through Eight name it directly: not in anger, not in political point-scoring, but because the evidence points there and intellectual honesty requires following it. Part Two examines reading instruction specifically — the choice made about how to teach reading that produced the flatline. Part One continues next with the question behind the question: what were the people who trained those teachers taught to prioritize?

Disclosure

Doug Hamilton is a Christian pastor and Board Certified Christian Counselor. His faith informs his worldview. This lens is acknowledged, not hidden.

This analysis was produced collaboratively with AI research tools. The methodology, judgment, and conclusions are Doug's. The research breadth is AI-assisted.

No matter how diligently we work to set aside bias, a lens remains. Do your own research. Test these findings. Hold us to our own standard. Proverbs 18:17 applies to us too.