Math I · S-ID.6.b

Assessing Model Fit with Residuals

This objective teaches students how to check whether a model is actually doing its job. A fitted line or curve can look convincing, but residuals reveal the errors one point at a time.

Concept Statistics and Probability

Domain Interpreting Categorical and Quantitative Data

Read time 8 minutes

What this learning objective is really asking you to learn

This learning objective asks students to evaluate a fitted model. In the previous objective, students used scatter plots and fitted functions to model relationships between two quantitative variables. But fitting a model is not the end of the process. A responsible mathematician asks: How well does the model fit? Where does it miss? Are the errors small or large? Are the errors random, or do they form a pattern? Residuals answer those questions.

A residual is the difference between an observed value and a predicted value. In many classrooms, residual is defined as $actual y - predicted y$. If a model predicts a test score of 78 for a student and the actual score is 84, the residual is $84 - 78 = 6$. The model underpredicted by 6 points. If the model predicts 92 and the actual score is 85, the residual is $85 - 92 = -7$. The model overpredicted by 7 points.

Residuals give a signed measure of error. A positive residual means the actual value is above the model's prediction. A negative residual means the actual value is below the prediction. A residual near zero means the model predicted that point closely. The size of the residual tells how large the miss is; the sign tells the direction of the miss.

Students should understand that residuals are not failures by themselves. Real data almost never fit a model perfectly. A residual is normal. The important question is whether the residuals are reasonably small and patternless or whether they reveal a problem with the model. A model can be useful even with residuals, but residuals should not be ignored.

A residual plot displays residuals on the vertical axis against the original input values on the horizontal axis. If a model is appropriate, the residuals should be scattered randomly around zero with no clear pattern. That means the model's errors are not systematically related to the input. If the residual plot shows a curve, fan shape, cluster, or other pattern, the model may be missing structure.

For a linear model, residuals help decide whether a line is reasonable. Suppose a scatter plot curves upward, but a student fits a straight line. The original scatter plot may still look somewhat linear at first glance. But the residual plot may show negative residuals at low $x$, positive residuals in the middle, and negative residuals at high $x$. That curved pattern says the line is not capturing the relationship well. A different model, perhaps quadratic or exponential depending on context, may be better.

Residuals are a form of mathematical honesty. It is easy to draw a line through data and act as if the job is done. Residuals force the model to face every data point. They ask, “What did you predict here? What actually happened? How far off were you?” This is the same basic discipline used in science, engineering, economics, machine learning, and forecasting.

Why students should learn this math

Students should learn this math because predictions are only useful if we check them. Weather forecasts, sales forecasts, sports projections, traffic estimates, medical risk scores, recommendation systems, school growth targets, and business models all make predictions. A model that is never tested against actual outcomes is just a guess with algebra attached.

Residuals show whether a model is biased in a particular direction. Suppose a delivery-time model underpredicts times for long-distance orders and overpredicts times for short-distance orders. The average error might look small, but the pattern reveals unfair or unreliable predictions for different groups. In real systems, such patterns matter. They can affect planning, staffing, customer expectations, and trust.

In science, residuals help researchers see whether a theory or model captures the data. If residuals are random, the model may be adequate. If residuals show structure, the model may be missing a variable or using the wrong form. Many discoveries begin when the “leftover” error refuses to behave randomly.

In engineering and manufacturing, residual thinking is quality control. A machine may be designed to produce a part of a target size. The difference between actual and target size is like a residual. If errors are small and random, the process may be stable. If errors drift upward over time, the machine may need adjustment. Residuals reveal not only how much a system misses but how it misses.

In everyday life, residual thinking teaches humility. A student might predict how long homework will take, how much money a project will cost, or how many hours of sleep they will get. Comparing predictions with actual outcomes helps improve planning. The habit is the same: make a model, observe reality, study the misses, revise.

The “why” is simple: models have power, and power needs checking. A fitted function can influence decisions. Residuals keep those decisions grounded in evidence.

The historical machinery behind this idea

Residuals come from the broader history of measurement error and model fitting. Astronomers, surveyors, and scientists often had to reconcile imperfect observations. Measurements of the same object did not agree perfectly because instruments, conditions, and human observation introduced error. The question became: how should we choose the best model or estimate when the data do not line up exactly?

The method of least squares became a foundational answer. It chooses model parameters by minimizing the sum of squared residuals. Squaring residuals prevents positive and negative errors from canceling and gives larger errors more weight. At the Math I level, students do not need to derive least squares. But they should know that modern fitting methods are built around residuals. The residual is the raw material of model evaluation.

Residual analysis became important because a model can have a good-looking summary and still be wrong in a structured way. A line might have a high correlation with data, but residuals could reveal curvature. A model might have small average error overall but large errors for a subgroup. A residual plot makes the invisible visible by separating the model's predictions from the model's misses.

This idea has become even more important in the age of data science and machine learning. Advanced models are judged by prediction errors on data. The vocabulary may change—loss, error, residual, prediction difference—but the core idea is the same. Compare predicted values to actual values and learn from the gaps.

In the full historical arc, residuals represent a shift from believing models because they are elegant to testing models because they must answer to data. That is a major scientific habit.

Technical execution: how to do the math

To compute a residual, first find the predicted $y$ value from the model for a given $x$. Then subtract: $residual = actual y - predicted y$. Students should keep the sign. If the residual is positive, the point lies above the model. If it is negative, the point lies below the model.

Suppose a fitted line predicts $y = 4x + 10$. For a data point $(6, 38)$, the predicted value is $4(6) + 10 = 34$. The residual is $38 - 34 = 4$. The actual value is 4 units above the prediction. For a data point $(8, 39)$, the predicted value is $4(8) + 10 = 42$, and the residual is $39 - 42 = -3$. The actual value is 3 units below the prediction.

After computing residuals for all points, students can plot them. The horizontal coordinate is the original $x$ value. The vertical coordinate is the residual. A horizontal reference line at zero represents perfect prediction. Points above zero were underpredicted. Points below zero were overpredicted.

When analyzing a residual plot, students should ask: Are residuals centered around zero? Are they randomly scattered? Do they form a curve? Does the spread grow or shrink as $x$ increases? Are there outlier residuals? A good residual plot for a linear model looks like random scatter around zero. A patterned residual plot suggests the model may be inappropriate.

A curved pattern usually means a line is not capturing a curved relationship. A fan shape, where residuals become more spread out for larger $x$, may mean the variability changes across the range. A cluster of large residuals may identify a subgroup or unusual conditions. A single large residual may be an outlier worth investigating.

Students should also connect residual size to context. A residual of 5 dollars may be small in a house-price model but huge in a pencil-price model. A residual of 3 minutes may be acceptable for a commute estimate but unacceptable for a medical emergency response model. Error size must be judged with units and purpose.

Informally assessing fit means students do not need formal hypothesis tests or advanced formulas. They need to reason visually and contextually. Does the model miss in a random way, or is it systematically wrong? Are the misses small enough for the model's purpose? Would another type of function likely fit better? These are mature questions.

Where this objective fits on the full map of mathematics

S-ID.6.b sits directly after function fitting because every model needs evaluation. Objective 054 asks students to fit functions to data. Objective 055 asks them to inspect the leftover errors. Objective 056 will return to fitting a linear function when a scatter plot suggests a linear association. Objective 057 will interpret slope and intercept. Objective 058 will quantify linear association with correlation.

This objective connects to algebra because residuals are calculated from expressions. Students substitute an input into an equation, get a predicted output, and subtract. It connects to functions because the model is a function and the residual compares the function's output to observed data. It connects to statistics because the residuals themselves form a distribution of errors.

Residuals also connect to the idea of proof and justification. A model should not be accepted just because it was drawn. The residual plot is evidence for or against the appropriateness of the model. In this sense, residuals are part of mathematical argument: they support a claim that a model fits reasonably or show why the claim is weak.

Later in statistics, residuals become central to regression analysis. Students will learn about least-squares regression lines, standard errors, prediction intervals, and model assumptions. In advanced mathematics and data science, residual analysis is used to diagnose bias, nonlinearity, changing variance, and missing variables. Objective 055 is the first doorway into that world.

In the full map, this objective teaches that mathematics is iterative. You model, check, revise, and check again. That cycle is the engine of applied mathematics.

Common misconceptions and productive corrections

One misconception is that a small number of residuals proves a model is good. Students need to look at all residuals and their pattern. A few small errors do not guarantee that the model is appropriate across the whole range.

Another misconception is that positive and negative residuals cancel, so the model is fine. Cancellation can hide large errors. A model that overpredicts by 100 and underpredicts by 100 has an average residual of zero, but the misses are huge. That is why residual size and patterns matter.

A third misconception is that residuals are the same as $predicted - actual$. Some contexts use different sign conventions, but in this course students should be clear about the chosen formula. If using $actual - predicted$, positive means above the model and negative means below. Consistency is more important than memorizing a phrase.

A fourth misconception is that any model with residuals is bad. Not true. Residuals are expected with real data. The question is whether the residuals are acceptable for the model's purpose and whether they show a troubling pattern.

Mastery check

A student has mastered this objective when they can compute residuals, explain their signs and sizes, create or read a residual plot, and decide informally whether a fitted function is reasonable. They can say not just “the residual is 4,” but “the model underpredicted this point by 4 units, and the residual plot suggests whether that miss is part of random scatter or a pattern.”

Problem Library

Problems in the App From This Objective

144 problems across 12 archetypes in the app.

calculate observed minus predicted.

12 problems Warmup Practice Mixed Review Assessment

Problem 1

Compute the residual for observed value 18 and predicted value 15.

Problem 2

Compute the residual for observed value 42 and predicted value 45.

Problem 3

Compute the residual for observed value 10.5 and predicted value 10.5.

Problem 4

Compute the residual for observed value 7 and predicted value 9.2.

Problem 5

Compute the residual for observed value 25 and predicted value 20.

Problem 6

Compute the residual for observed value 10 and predicted value 17.

Problem 7

Compute the residual for observed value 50 and predicted value 50.

Problem 8

Compute the residual for observed value 12.5 and predicted value 10.0.

Problem 9

Compute the residual for observed value 5.1 and predicted value 8.3.

Problem 10

Compute the residual for observed value 3.75 and predicted value 3.75.

Problem 11

Compute the residual for observed value 15 and predicted value 11.8.

Open in simulator

Problem 12

Compute the residual for observed value 8 and predicted value 10.1.

explain overprediction or underprediction.

12 problems Warmup Practice Mixed Review Assessment

Problem 13

Interpret residual 5 in context predicted test score was lower than observed.

Problem 14

Interpret residual -3 in context predicted commute time compared with observed commute time.

Problem 15

Interpret residual 0 in context predicted cost compared with actual cost.

Problem 16

Interpret residual -8 in context actual height in centimeters.

Problem 17

Interpret residual 2 in context predicted sales volume was less than actual.

Problem 18

Interpret residual -7 in context predicted number of defects was higher than actual.

Open in simulator

Problem 19

Interpret residual 0 in context predicted rainfall compared to actual rainfall.

Problem 20

Interpret residual 10 in context predicted temperature was cooler than observed.

Problem 21

Interpret residual -1.2 in context predicted weight was heavier than observed.

Problem 22

Interpret residual 0.5 in context predicted growth rate was slower than actual.

Problem 23

Interpret residual -20 in context predicted time to complete a task was longer than actual.

Problem 24

Interpret residual 0 in context predicted stock price compared to actual stock price.

judge size of prediction error.

12 problems Warmup Practice Mixed Review Assessment

Problem 25

Interpret the magnitude of residual 2 relative to data scale values are around 100.

Problem 26

Interpret the magnitude of residual 25 relative to data scale values are usually between 40 and 60.

Problem 27

Interpret the magnitude of residual -0.5 relative to data scale measurements are in meters and range from 0 to 10.

Problem 28

Interpret the magnitude of residual -12 relative to data scale typical residuals are between -2 and 2.

Problem 29

Interpret the magnitude of residual 0.1 relative to data scale data points are in the thousands.

Problem 30

Interpret the magnitude of residual 3 relative to data scale the values are generally between 1 and 5.

Problem 31

Interpret the magnitude of residual -7% relative to data scale typical errors are within +/- 2%.

Problem 32

Interpret the magnitude of residual 0 relative to data scale the expected values are around 50.

Problem 33

Interpret the magnitude of residual 15 cm relative to data scale the objects being measured are about 1 meter long.

Problem 34

Interpret the magnitude of residual -500 relative to data scale the data values range from 10,000 to 20,000.

Open in simulator

Problem 35

Interpret the magnitude of residual 0.8 relative to data scale the measurements are typically between 0 and 1.

Problem 36

Interpret the magnitude of residual -3 relative to data scale typical residuals are around +/- 0.5.

compute residuals for several points.

12 problems Warmup Practice Mixed Review Assessment

Problem 37

Create a residual table from observed values 10, 14, 19 and predicted values 11, 13, 17.

Problem 38

Create a residual table from observed values 5, 7, 9, 11 and predicted values 5, 8, 8, 12.

Problem 39

Create a residual table from observed values 100, 95, 90 and predicted values 98, 96, 92.

Problem 40

Create a residual table from observed values 10, 20, 30 and predicted values 5, 15, 25.

Problem 41

Create a residual table from observed values 10, 20, 30 and predicted values 15, 25, 35.

Problem 42

Create a residual table from observed values 1, 2, 3, 4 and predicted values 1, 3, 2, 5.

Problem 43

Create a residual table from observed values 50, 60, 70, 80 and predicted values 45, 65, 70, 75.

Problem 44

Create a residual table from observed values 10, 20 and predicted values 12, 18.

Problem 45

Create a residual table from observed values 1, 2, 3, 4, 5 and predicted values 0, 2, 4, 3, 6.

Problem 46

Create a residual table from observed values 200, 210, 220, 230 and predicted values 195, 215, 220, 235.

Problem 47

Create a residual table from observed values -5, -2, 0, 3 and predicted values -6, -2, 1, 2.

Problem 48

Create a residual table from observed values 10, 5, 0 and predicted values -2, -5, -1.

Open in simulator

plot input versus residual.

12 problems Warmup Practice Mixed Review Assessment

Problem 49

Create residual plot points from inputs 1, 2, 3, 4 and residuals -1, 2, 0, -2.

Problem 50

Create residual plot points from inputs 0, 5, 10 and residuals 3, -1, -4.

Open in simulator

Problem 51

Create residual plot points from inputs 2, 4, 6, 8 and residuals 0.5, -0.5, 1, -1.

Problem 52

Create residual plot points from inputs 10, 20, 30 and residuals 1, -2, 0.

Problem 53

Create residual plot points from inputs -3, -1, 0, 2, 4 and residuals 0.1, -0.2, 0.3, -0.1, 0.2.

Problem 54

Create residual plot points from inputs 1.5, 2.5, 3.5 and residuals -0.5, 0.5, -0.5.

Problem 55

Create residual plot points from inputs 100, 200, 300, 400 and residuals 5, -10, 0, 15.

Problem 56

Create residual plot points from inputs -5, -4, -3, -2, -1 and residuals 1, 1, -1, -1, 0.

Problem 57

Create residual plot points from inputs 0.1, 0.2, 0.3, 0.4 and residuals 0.01, -0.02, 0.03, -0.01.

Problem 58

Create residual plot points from inputs 7, 8, 9 and residuals 0, 0, 0.

Problem 59

Create residual plot points from inputs 1, 1.5, 2, 2.5, 3 and residuals -0.1, 0.2, -0.3, 0.4, -0.5.

Problem 60

Create residual plot points from inputs -10, -5, 0, 5, 10 and residuals 2, 1, 0, -1, -2.

judge linear model fit as reasonable.

12 problems Warmup Practice Mixed Review Assessment

Problem 61

Interpret residual plot description points are randomly scattered above and below zero with no clear pattern.

Open in simulator

Problem 62

Interpret residual plot description small random residuals centered near zero.

Problem 63

Interpret residual plot description residuals are random but widely spread.

Problem 64

Interpret residual plot description residuals are randomly distributed around the zero line.

Problem 65

Interpret residual plot description the residual points show no discernible pattern or trend.

Problem 66

Interpret residual plot description residuals are scattered randomly above and below the x-axis.

Problem 67

Interpret residual plot description the plot shows a random cloud of points centered at zero.

Problem 68

Interpret residual plot description no systematic curve or fan pattern is visible in the residuals.

Problem 69

Interpret residual plot description residuals are randomly dispersed without any structure.

Problem 70

Interpret residual plot description the residual plot displays purely random noise around zero.

Problem 71

Interpret residual plot description points are randomly scattered with no obvious pattern.

Problem 72

Interpret residual plot description residuals are randomly spread, indicating no systematic bias.

identify model mismatch.

12 problems Warmup Practice Mixed Review Assessment

Problem 73

Interpret patterned residual plot description residuals are positive at low and high x-values and negative in the middle.

Problem 74

Interpret patterned residual plot description residuals fan out as x increases.

Problem 75

Interpret patterned residual plot description residuals form separate clusters above and below zero.

Problem 76

Interpret patterned residual plot description residuals show an S-shaped curve.

Problem 77

Interpret patterned residual plot description residuals become narrower as x increases.

Open in simulator

Problem 78

Interpret patterned residual plot description all residuals are positive.

Problem 79

Interpret patterned residual plot description all residuals are negative.

Problem 80

Interpret patterned residual plot description residuals show a wave-like pattern.

Problem 81

Interpret patterned residual plot description residuals consistently increase as x increases.

Problem 82

Interpret patterned residual plot description residuals consistently decrease as x increases.

Problem 83

Interpret patterned residual plot description residuals form two distinct parallel lines, one above zero and one below.

Problem 84

Interpret patterned residual plot description residuals first fan out and then fan in as x increases.

choose model with smaller/more random residuals.

12 problems Warmup Practice Mixed Review Assessment

Problem 85

Compare models using residual summaries -1, 0, 1, -1 and -5, 3, 4, -2.

Problem 86

Compare models using residual summaries random scatter from -3 to 3 and curved pattern from -1 to 1.

Problem 87

Compare models using residual summaries 2, 2, 2, 2 and -1, 0, 1, 0.

Problem 88

Compare models using residual summaries 10, -8, 12, -9 and 1, -2, 0, 1.

Problem 89

Compare models using residual summaries -2, 1, 0, -1, 2 and 3, 4, 3, 4, 3.

Problem 90

Compare models using residual summaries increasing pattern from -1 to 1 and random scatter from -2 to 2.

Problem 91

Compare models using residual summaries 0.1, -0.2, 0.0, 0.1 and 5, 6, 5, 6.

Problem 92

Compare models using residual summaries -5, 2, -3, 4, 0 and -1, 0, 1, -0.5, 0.5.

Open in simulator

Problem 93

Compare models using residual summaries -0.5, 0.5, -0.1, 0.1 and -3, -2, -3, -2.

Problem 94

Compare models using residual summaries decreasing pattern from 3 to -3 and random scatter from -3 to 3.

Problem 95

Compare models using residual summaries 0.01, -0.02, 0.03, -0.01 and -1, 2, -3, 1.

Problem 96

Compare models using residual summaries 1.5, 1.2, 1.8, 1.3 and -0.5, 0.2, -0.1, 0.4.

find unusually large residual.

12 problems Warmup Practice Mixed Review Assessment

Problem 97

Identify the outlier from residuals A=1, B=-2, C=15, D=0.

Problem 98

Identify the outlier from residuals P=-0.5, Q=0.2, R=-8, S=1.

Problem 99

Identify the outlier from residuals W=2, X=-1, Y=3, Z=-2.

Problem 100

Identify the outlier from residuals M=0.1, N=-0.2, O=0.3, P=10.

Problem 101

Identify the outlier from residuals G=1.2, H=-0.5, I=-12, J=0.8.

Problem 102

Identify the outlier from residuals K=0.5, L=-0.8, M=0.3, N=-0.6.

Problem 103

Identify the outlier from residuals V1=0.1, V2=0.5, V3=-0.2, V4=0.3, V5=20.

Problem 104

Identify the outlier from residuals D1=1.5, D2=-2.1, D3=0.8, D4=-1.9, D5=2.5.

Problem 105

Identify the outlier from residuals X1=0.1, X2=0.05, X3=-7.5, X4=0.2.

Open in simulator

Problem 106

Identify the outlier from residuals R1=0, R2=0.01, R3=-0.02, R4=0.05.

Problem 107

Identify the outlier from residuals S1=0.001, S2=-0.005, S3=50, S4=0.002.

Problem 108

Identify the outlier from residuals T1=-1.1, T2=0.9, T3=1.3, T4=-15.

compare error to practical tolerance.

12 problems Warmup Practice Mixed Review Assessment

Problem 109

Decide whether residual $3 is acceptable for context a grocery cost estimate with tolerance $5.

Problem 110

Decide whether residual 12 minutes is acceptable for context bus arrival prediction with tolerance 3 minutes.

Problem 111

Decide whether residual 0.2 cm is acceptable for context manufacturing part length with tolerance 0.1 cm.

Open in simulator

Problem 112

Decide whether residual -1 point is acceptable for context test score prediction where a few points of error is normal.

Problem 113

Decide whether residual 2 degrees Celsius is acceptable for context a weather forecast with a typical error of 1 degree Celsius.

Problem 114

Decide whether residual -5 meters is acceptable for context a GPS location reading with a tolerance of 10 meters.

Problem 115

Decide whether residual 0.05 kg is acceptable for context a package weight measurement for shipping with a tolerance of 0.1 kg.

Problem 116

Decide whether residual 0.5 seconds is acceptable for context a stopwatch measurement for a sprint with a required precision of 0.1 seconds.

Problem 117

Decide whether residual -10 mL is acceptable for context a chemical solution preparation where an error of up to 20 mL is acceptable.

Problem 118

Decide whether residual $500 is acceptable for context a project budget estimate with a strict tolerance of $100.

Problem 119

Decide whether residual 3 mph is acceptable for context a car speedometer calibration where an error of up to 5 mph is considered normal.

Problem 120

Decide whether residual -2 items is acceptable for context an inventory count where an error of 1-2 items is common and acceptable.

connect residuals to prediction errors.

12 problems Warmup Practice Mixed Review Assessment

Problem 121

Explain why residuals help evaluate model fit for linear model predicting test scores.

Problem 122

Explain why residuals help evaluate model fit for scatter plot with possible nonlinear trend.

Open in simulator

Problem 123

Explain why residuals help evaluate model fit for two competing fitted models.

Problem 124

Explain why residuals help evaluate model fit for a simple linear regression model.

Problem 125

Explain why residuals help evaluate model fit for a multiple regression model for sales forecasting.

Problem 126

Explain why residuals help evaluate model fit for a model predicting a student's final grade based on midterm scores.

Problem 127

Explain why residuals help evaluate model fit for a polynomial regression model for growth data.

Problem 128

Explain why residuals help evaluate model fit for a time series model for daily temperature.

Problem 129

Explain why residuals help evaluate model fit for a model for predicting crop yield based on fertilizer use.

Problem 130

Explain why residuals help evaluate model fit for a model for predicting customer churn.

Problem 131

Explain why residuals help evaluate model fit for a model for predicting the spread of a virus.

Problem 132

Explain why residuals help evaluate model fit for a model for manufacturing defect rates.

catch reversed subtraction, wrong prediction, or sign confusion.

12 problems Warmup Practice Mixed Review Assessment

Problem 133

Correct the residual error in Observed 20, predicted 25, residual reported as 5.

Open in simulator

Problem 134

Correct the residual error in Residual -4 is interpreted as the model underpredicted by 4.

Problem 135

Correct the residual error in Predicted value from the wrong x-row is used to compute residual.

Problem 136

Correct the residual error in Residuals are plotted against predicted y-values instead of x-values.

Problem 137

Correct the residual error in Observed value is 15, predicted value is 12, and the residual is calculated as 12 - 15 = -3.

Problem 138

Correct the residual error in A residual of +6 is interpreted as the model overpredicting the observed value by 6.

Problem 139

Correct the residual error in When calculating the residual for an observed value at x=10, the prediction for x=8 was mistakenly used.

Problem 140

Correct the residual error in For an observed value of 42 and a predicted value of 48, the residual is reported as 6.

Problem 141

Correct the residual error in A negative residual indicates that the model's prediction was lower than the actual observed value.

Problem 142

Correct the residual error in A residual plot shows observed y-values on the horizontal axis and residual values on the vertical axis.

Problem 143

Correct the residual error in The residual for an observed value of 70 and a predicted value of 75 is given as |70 - 75| = 5.

Problem 144

Correct the residual error in A residual of -2 means the model had an error of 2 units below the observed value.