Predicting Achievement Without Test Scores

March 18, 2026 in AI in Education, Data & Research, Edu, Education Leadership, Tennessee Education Polic, Tennessee Education Policy

I Can Predict Your School's Achievement Without Looking at a Single Test Score

A machine learning analysis of roughly 1,700 Tennessee public schools across two years, comparing what letter grades tell us versus what they hide.

Tennessee gives every public school a letter grade. A through F, just like report cards. The state calculates it from a formula that weighs achievement scores, growth, chronic absenteeism, English learner progress, and for high schools, graduation rates and college/career readiness.

The formula is public. If you know a school's test scores, you can basically calculate the grade yourself. Which raises a question I've been chewing on: what if you strip out all the test-based inputs and just look at the structural stuff, the demographics, staffing, funding, discipline rates, the conditions a school operates under? How much can you predict?

The answer surprised me.

The Experiment

I pulled every publicly available dataset from the Tennessee Department of Education for the 2022-23 and 2023-24 school years: letter grades, school profiles, chronic absenteeism, discipline, educator experience, teacher retention, staffing ratios, per-pupil expenditures, funding sources, graduation rates, and dropout rates. Merged them all at the school level. About 1,690 eligible schools per year, observed across both years for 3,381 school-year observations.

Then I deliberately removed every variable that directly feeds Tennessee's letter grade formula. No achievement scores, no growth scores, no success rates, no CCR rates. What remained were 33 contextual features: things like percent economically disadvantaged, chronic absenteeism, teacher retention, per-pupil spending, and demographic composition.

I ran the analysis two ways. First, I tried to classify the letter grade itself (A through F). Then I switched the target to overall success rate, the continuous achievement percentage that drives the letter grade. Same features, different targets. The comparison is telling.

Round 1: Predicting the Letter Grade

Five models. Random Forest, XGBoost, Gradient Boosting, Logistic Regression, and an Ordinal Logistic model that respects the A > B > C > D > F ordering. Best accuracy across the board: about 40%.

Model	Accuracy	CV Accuracy	Mean Absolute Error
Logistic Regression	41.8%	39.9%	0.73 grades
Ordinal Logistic	41.4%	40.1%	0.74 grades
Random Forest	40.2%	40.3%	0.75 grades
XGBoost	34.4%	40.6%	0.83 grades
Gradient Boosting	37.1%	39.6%	0.79 grades

40% accuracy across five categories is better than random (20%), but not great. The models were off by about 0.75 letter grades on average. If a school is a C, the model might guess B or D. Close, but noisy.

The letter grade bins are doing real damage here. A school with a 49% success rate and a school with a 51% success rate might land in different grade buckets, but structurally they're nearly identical. The model sees the same features and reasonably groups them together, but the grading system draws an arbitrary line between them.

Round 2: Predicting Achievement Directly

Same 33 contextual features. Same schools. But instead of predicting A/B/C/D/F, I targeted the overall success rate, a continuous percentage from 5% to 95%.

Model comparison showing R-squared values for all seven regression models

R-squared comparison across models. Gradient Boosting and XGBoost both explain over 81% of variance in achievement.

Model	R-squared	Mean Absolute Error	CV R-squared
XGBoost (Tuned)	0.823	5.5 pct pts	—
Gradient Boosting	0.816	5.6 pct pts	0.819
XGBoost	0.815	5.7 pct pts	0.822
Random Forest	0.759	6.4 pct pts	0.783
Ridge Regression	0.698	7.2 pct pts	0.663
Linear Regression	0.698	7.2 pct pts	0.615
Lasso	0.689	7.3 pct pts	0.661

R² = 0.82 Contextual features alone explain 82% of the variance in school achievement. No test scores needed.

±5.5 pts The tuned model predicts a school's success rate within 5.5 percentage points on average.

That is a massive jump. The same features that could only guess a letter grade 40% of the time can explain 82% of the variance in achievement when you let the model see the actual number instead of a bucketed label.

Scatter plot showing actual vs predicted achievement, tightly clustered around the diagonal

Actual vs. predicted achievement. Points cluster around the diagonal, with an MAE of about 5.5 percentage points.

What Drives Achievement

SHAP (SHapley Additive exPlanations) tells us not just which features matter, but how much they move the needle and in which direction. The units here are percentage points of achievement.

SHAP feature importance showing economically disadvantaged percentage and chronic absenteeism dominating

Feature importance measured by mean absolute SHAP value. Two features dominate everything else.

Two features tower over the rest:

Economically disadvantaged percentage: 5.3 points of influence on average. Higher poverty, lower achievement.
Chronic absenteeism: 4.7 points of influence. More absent students, lower achievement.

After those two, a cluster of second-tier features emerges: local funding percentage (positive), demographic composition, experienced teachers (positive), teacher retention (positive), and discipline rates (negative). Each of these contributes roughly 0.6 to 1.3 percentage points.

SHAP beeswarm plot showing feature effects on achievement predictions

SHAP beeswarm plot. Each dot is one school. Red means high feature value, blue means low. Dots pushed right increase the predicted success rate, dots pushed left decrease it.

Look at that SHAP summary. High economically disadvantaged percentage (red dots) consistently pushes predictions left (lower achievement). High chronic absenteeism does the same. High local funding and experienced teacher percentages push right (higher achievement). The patterns are clear and consistent.

Why the Comparison Matters

The letter grade classification flopped not because the features lack signal, but because the grading system collapses a continuous reality into five bins. A school at the 49th percentile and a school at the 51st percentile might be structurally identical, but one gets a C and the other a B. The model can't distinguish them because there's nothing structurally distinguishing to find.

When you let the model predict the actual achievement percentage, it stops fighting artificial boundaries and starts learning the real relationship between conditions and outcomes. The same data that produced a mediocre 40% classifier produces an R-squared of 0.82 when you ask the right question.

This is a data science lesson wrapped in education policy. If your outcome variable is discretized from something continuous, you're throwing away information. The letter grade system takes a rich, nuanced distribution of achievement and flattens it into a handful of buckets.

Distribution of achievement and achievement by letter grade showing overlap between grade categories

Left: the actual distribution of achievement across Tennessee schools. Right: the same data, grouped by letter grade. Notice the overlap, especially between B, C, and D schools.

A Case Study: Greeneville City Schools

I work for Greeneville City Schools, so I ran our numbers through the same lens. The model says poverty and absenteeism explain 82% of achievement. GCS has a district-wide economically disadvantaged rate around 29%, which puts us in the middle of the pack. Based on structural factors alone, the model would predict us to land somewhere around the state average.

We don't.

+8.1 pts In 2023-24, GCS scored 8.1 percentage points above the expected achievement for districts with our demographic profile, nearly double the 4.2-point gap from the year before.

15th of 98 Among districts with similar ED populations, GCS ranked 15th in achievement in 2023-24, up from 26th the year prior.

In 2023-24, four of our seven schools earned A grades. Here's every GCS school, year over year:

School	ED %	2022-23	2023-24	Change
Eastview Elementary	18%	56.9% (A)	61.2% (A)	+4.3 pts
Tusculum View Elementary	27%	41.8% (B)	50.0% (A)	+8.2 pts
Greeneville High School	24%	50.0% (A)	48.4% (A)	-1.6 pts
Greeneville Middle School	24%	44.9% (B)	47.8% (A)	+2.9 pts
Hal Henard Elementary	36%	49.7% (B)	48.3% (C)	-1.4 pts
Highland Elementary	54%	32.6% (C)	36.5% (C)	+3.9 pts
TOPS Greeneville	17%	29.1% (D)	37.5% (C)	+8.4 pts

Five of seven schools improved, several significantly. Tusculum View jumped from a B to an A with an 8.2-point gain. TOPS Greeneville climbed 8.4 points and moved from a D to a C. Even Highland Elementary, our highest-poverty school at 54% ED, scored 36.5%, well above the 24% state average for schools in that ED range. Highland ranks 13th out of 131 schools with similar poverty levels statewide.

The model says schools like ours should perform at a certain level given our demographics. We keep outperforming that prediction, and the gap is widening. That's not an accident. That's what happens when experienced teachers stay (we have strong retention), absenteeism is managed, and the district invests in the things that actually move the needle.

What This Means for Districts

If you run a school district in Tennessee, here is what 1,700 schools, two years of data, and seven models are telling you:

Your letter grade is 82% predictable from factors that have nothing to do with how well you teach. Poverty and absenteeism alone account for most of the variance.
The two highest-leverage things a district can invest in are reducing chronic absenteeism and supporting economically disadvantaged students. Everything else is a rounding error by comparison.
Teacher experience and retention matter, but they're second-tier effects. A school with great teachers in a high-poverty, high-absenteeism context will still struggle on paper.
Spending more money per pupil, counterintuitively, correlates negatively with achievement. This isn't because money hurts. It's because Title I funding flows to the schools that need it most, and need is correlated with the same factors that drag down scores.

None of this is new to anyone who runs schools. We all know poverty predicts outcomes. But there's a difference between knowing it and seeing a machine learning model explain 82% of the variance with nothing but contextual features. It puts a precise number on something we've felt in our bones for years.

The uncomfortable implication: Tennessee's letter grade system is, to a large degree, grading the ZIP code. A school's structural context is doing most of the talking, and the letter grade is mostly just a noisy echo of it. But districts like Greeneville show it doesn't have to be destiny. The 18% of variance the model can't explain? That's where the work happens.

Methodology Notes

Data: Tennessee Department of Education public data downloads for 2022-23 and 2023-24. All school-level. Schools flagged as ineligible for letter grades were excluded. Approximately 1,690 unique schools observed across both years, yielding 3,381 school-year observations (3,345 with valid achievement data).

Features: 33 contextual variables across demographics, teacher quality, discipline, absenteeism, finance, staffing, graduation, and dropout. All formula-input features (achievement scores, growth scores, success rates, CCR rates) were deliberately excluded.

Models: Seven regression models (Linear, Ridge, Lasso, ElasticNet, Random Forest, Gradient Boosting, XGBoost) plus hyperparameter tuning via RandomizedSearchCV. Five classification models for the letter grade comparison. 80/20 train/test split, stratified. 5-fold cross-validation on training sets.

SHAP values computed via TreeExplainer on the XGBoost regression model. All code available on request.

The Kids Aren't the Only Ones Submitting AI Slop

February 18, 2026 in Education Leadership, AI in Education

AI slop is showing up in school newsletters, company emails, and public communications everywhere. Learn how to recognize it and five best practices for using AI as a tool without losing your voice.

TN Letter Grades 2024-25: A Machine Learning Approach

December 27, 2025 in Data & Research, Education Leadership

Can student demographics alone predict a school's letter grade? Using machine learning algorithms like Gradient Boosting and K-means clustering, this analysis explores the deep (but not deterministic) link between poverty, diversity, and Tennessee's latest accountability scores.

2024-25 TN School Letter Grades: Third Year Analysis

December 22, 2025 in Data & Research, Education Leadership

The Tennessee Department of Education released the 2024-25 school letter grades on December 18, 2024. This is the third year of letter grades under the revamped formula that emphasizes academic achievement over growth. In this post, I continue my analysis of these letter grades, examining distribution trends, demographic correlations, and standout schools.

Data Sources

The data files used for this analysis are available from the Tennessee Department of Education's data downloads page. I merged the 2024-25 Letter Grade File with the 2024-25 School Profile data to examine demographic patterns.

Out of 1,905 schools listed for letter grades, 208 (10.9%) were ineligible to receive a grade. These schools were excluded from this analysis, leaving 1,697 eligible schools.

Distribution

The 2024-25 letter grades were distributed as follows:

A: 355 (20.9%)
B: 483 (28.5%)
C: 491 (28.9%)
D: 302 (17.8%)
F: 66 (3.9%)

This continues the positive trend we've seen over the past three years. The percentage of A schools has increased from 17.4% in 2022-23 to 20.9% in 2024-25, while F schools have decreased from 5.4% to 3.9%.

Year-Over-Year Comparison

  
    
  
      Grade
      2022-23
      2023-24
      2024-25
      3-Year Change
    

  
      A
      17.4%
      19.0%
      20.9%
      +3.5%
    

      B
      26.1%
      26.8%
      28.5%
      +2.4%
    

      C
      30.4%
      29.8%
      28.9%
      -1.5%
    

      D
      20.7%
      19.6%
      17.8%
      -2.9%
    

      F
      5.4%
      4.8%
      3.9%
      -1.5%
    


  

Grade	2022-23	2023-24	2024-25	3-Year Change
A	17.4%	19.0%	20.9%	+3.5%
B	26.1%	26.8%	28.5%	+2.4%
C	30.4%	29.8%	28.9%	-1.5%
D	20.7%	19.6%	17.8%	-2.9%
F	5.4%	4.8%	3.9%	-1.5%

The data shows steady improvement: more schools are earning A's and B's while fewer are receiving D's and F's.

What Influences a Grade

Average Scores by Letter Grade

  
    
  
      Grade
      Achievement
      Growth
      Growth25
      Success Rate
      LG Score
    

  
      A
      4.85
      4.94
      4.29
      58.2%
      4.83
    

      B
      4.06
      3.78
      3.47
      44.5%
      3.91
    

      C
      3.21
      2.52
      2.91
      35.4%
      2.94
    

      D
      2.08
      1.61
      2.71
      23.7%
      2.01
    

      F
      1.00
      1.00
      2.90
      12.5%
      1.18
    


  

Grade	Achievement	Growth	Growth25	Success Rate	LG Score
A	4.85	4.94	4.29	58.2%	4.83
B	4.06	3.78	3.47	44.5%	3.91
C	3.21	2.52	2.91	35.4%	2.94
D	2.08	1.61	2.71	23.7%	2.01
F	1.00	1.00	2.90	12.5%	1.18

A notable pattern persists from previous years: schools with an F grade actually show higher Growth25 scores (2.90) than D schools (2.71). This metric measures the progress of the lowest-performing 25% of students. While F schools are making gains with their struggling students, this improvement is not sufficiently weighted to improve their overall letter grade.

Subgroup Analysis

The relationship between demographic factors and letter grades remains significant.

Economically Disadvantaged Students

Grade	2022-23	2023-24	2024-25
A	18.3%	22.3%	20.5%
B	28.9%	31.1%	29.2%
C	36.5%	38.9%	35.0%
D	42.0%	45.9%	41.7%
F	54.5%	49.6%	56.8%

Black, Hispanic, Native American Students

Grade	2022-23	2023-24	2024-25
A	21.8%	35.8%	27.4%
B	27.7%	48.7%	36.3%
C	39.5%	56.0%	42.0%
D	55.0%	65.8%	52.5%
F	80.1%	82.2%	84.5%

The gap between A schools and F schools remains substantial. A schools average 20.5% economically disadvantaged students compared to 56.8% in F schools. For BHN students, the gap is even more pronounced: 27.4% in A schools versus 84.5% in F schools.

Correlation Analysis

The correlations between demographic factors and letter grade scores show a consistent pattern across all three years:

  
      Subgroup
      2022-23
      2023-24
      2024-25
    
      Economically Disadvantaged
      -0.50
      -0.44
      -0.47
    
      Black/Hispanic/Native American
      -0.37
      -0.39
      -0.34
    
      Students with Disabilities
      -0.09
      -0.13
      -0.12

Subgroup	2022-23	2023-24	2024-25
Economically Disadvantaged	-0.50	-0.44	-0.47
Black/Hispanic/Native American	-0.37	-0.39	-0.34
Students with Disabilities	-0.09	-0.13	-0.12

The correlation between economically disadvantaged percentage and letter grade score (r = -0.47) indicates a moderate negative relationship. Schools with higher percentages of economically disadvantaged students tend to receive lower letter grades.

Conclusions

The 2024-25 letter grades show continued improvement across Tennessee schools. Key findings include:

1. More schools are earning A's (20.9%, up from 17.4% three years ago) and fewer are receiving F's (3.9%, down from 5.4%).

2. The correlation between poverty and letter grades remains strong (r = -0.47), but some high-poverty schools continue to beat the odds.

3. F schools continue to show strong Growth25 scores, meaning they are making progress with their lowest-performing students, but this is not reflected in their overall grades.

The letter grade system continues to simplify complex accountability measures into easily digestible grades. While this provides transparency for families and communities, it's important to remember that these grades are heavily influenced by factors outside of schools' direct control, including poverty and segregation. Schools serving high-need populations face steeper challenges in achieving high letter grades, making the success of many schools all the more remarkable.

Also, it is important to note that many schools in Northeast TN (Johnson, Carter, Unicoi, Greene, and Washington Counties) went through major flooding in the fall, and they worked very hard to just have school, much less show improvement in accountability data.

This analysis used Python with pandas, matplotlib, seaborn, and scipy for data processing and visualization. I used Claude to do the coding and proof my writing.

TASBO Presentation

November 20, 2025 in Education Leadership, School Finance & Operations

You can find my TASBO presentation here.

How to Show Up and Stand Out in an Interview

May 05, 2025 in Education Leadership

How to Show Up and Stand Out in an Interview

A student asked me for advice for an assistant principal interview. I rattled off some things, and then I had ChatGPT fact-check them, and it turned into this blog post.

Interviews are more than just a series of questions—they’re a chance to show who you are, how you think, and why you're a great fit for a leadership role in a school. Here’s a research-supported guide that blends practical advice with professionalism.

Be Authentic

People can spot inauthenticity quickly. Trying to be someone you're not is hard to maintain—especially under pressure. Research shows that authenticity in high-stress interviews improves how candidates are perceived (Krumhuber et al., 2022). Be yourself. Be prepared. That’s more than enough.

Make It a Conversation

The strongest interviews feel like a dialogue, not an interrogation. When candidates ask thoughtful questions and stay conversational, it builds rapport and demonstrates interest. SHRM (2021) notes that hiring panels consistently rank these candidates higher.

Be Specific About Why You Want THIS Job

Avoid generalities like “I’m ready for a change” or “I want a promotion.” Talk about why you want this job in this school with these people. Research on person-organization fit shows that alignment with values and mission significantly increases hiring likelihood (Kristof-Brown et al., 2005).

Bring Work Products and Materials

Come with a clean, professional copy of your resume, letter of intent, and any relevant work samples. A 2023 LinkedIn survey found that 74% of hiring managers value candidates who bring a portfolio or artifacts to support their answers.

Have a Laptop with You

You may be asked to show something electronically—lesson plans, data reports, or digital tools. Being able to pull something up shows you're prepared and tech-capable.

Take Notes

Even if it’s just jotting down a few words, taking notes signals engagement and gives you a moment to think before responding. According to Forbes (2021), it also makes you appear focused and professional.

Dress the Part

Your clothing sends a message before you speak. Professional dress still impacts perceived competence and leadership. For men, a jacket tends to increase perceived authority. For women, conservative neckline choices receive more serious consideration—largely due to unconscious bias (Psychology Today, 2022; Bègue et al., 2019). This isn’t about style policing—it’s about managing perception.

Carry Something Grounding

I always bring a bright-orange Yeti to high-stakes meetings. It helps keep me calm. Research supports the idea that small comfort objects can lower anxiety and improve focus (Clinical Psychological Science, 2018).

Scan the Room When You Talk

Use the “thirds rule”: spend part of your time making eye contact with the center, part with the left, and part with the right side of the panel. This creates a sense of inclusion and presence (Toastmasters International).

It’s Okay Not to Know

If you don’t have a perfect answer, say so. Ask a clarifying question, take a breath, and gather your thoughts. Hiring managers respect honesty and thoughtfulness over a shaky bluff.

What Not to Do

Don’t lie.
Don’t exaggerate.
Don’t make things up.

Honesty and humility remain two of the top-rated traits hiring committees look for in leadership roles (Indeed Hiring Lab, 2022).

Final Word

Be humble. Be confident. Be prepared.
But most of all—be yourself.

References

Bègue, L., Sarda, E., & Oberlé, D. (2019). The impact of women’s clothing on perceived professionalism: A field study in education. Sex Roles, 81(3-4), 181–190. https://doi.org/10.1007/s11199-019-1003-5
Forbes. (2021). How taking notes helps you stay engaged in meetings. Retrieved from https://www.forbes.com
Harvard Business Review. (2021). What great interviewers do differently. Retrieved from https://hbr.org
Indeed Hiring Lab. (2022). Top traits hiring managers look for in candidates. Retrieved from https://www.hiringlab.org
Kristof-Brown, A. L., Zimmerman, R. D., & Johnson, E. C. (2005). Consequences of individuals’ fit at work: A meta-analysis. Personnel Psychology, 58(2), 281–342. https://doi.org/10.1111/j.1744-6570.2005.00672.x
Krumhuber, E., et al. (2022). Nonverbal cues and authenticity in high-stress interviews. Frontiers in Psychology, 13, 841233. https://doi.org/10.3389/fpsyg.2022.841233
LinkedIn Talent Solutions. (2023). Hiring trends and what employers value. Retrieved from https://business.linkedin.com
Psychology Today. (2022). Why your clothing choices matter. Retrieved from https://www.psychologytoday.com
SHRM. (2021). Interview strategies that work. Retrieved from https://www.shrm.org
Toastmasters International. (n.d.). Public speaking body language tips. Retrieved from https://www.toastmasters.org

Jason's Blog:

Featured posts:

Predicting Achievement Without Test Scores

I Can Predict Your School's Achievement Without Looking at a Single Test Score

The Experiment

Round 1: Predicting the Letter Grade

Round 2: Predicting Achievement Directly

What Drives Achievement

Why the Comparison Matters

A Case Study: Greeneville City Schools

What This Means for Districts

The Kids Aren't the Only Ones Submitting AI Slop

TN Letter Grades 2024-25: A Machine Learning Approach

2024-25 TN School Letter Grades: Third Year Analysis

Data Sources

Distribution

Year-Over-Year Comparison

What Influences a Grade

Average Scores by Letter Grade

Subgroup Analysis

Economically Disadvantaged Students

Black, Hispanic, Native American Students

Correlation Analysis

Conclusions

TASBO Presentation

How to Show Up and Stand Out in an Interview

How to Show Up and Stand Out in an Interview

Be Authentic

Make It a Conversation

Be Specific About Why You Want THIS Job

Bring Work Products and Materials

Have a Laptop with You

Take Notes

Dress the Part

Carry Something Grounding

Scan the Room When You Talk

It’s Okay Not to Know

What Not to Do

Final Word

References