Math I · S-ID.6.c

Fitting a Linear Function When Data Suggest a Linear Association

This objective teaches students how to turn a messy cloud of real observations into a usable model. Real data rarely fall perfectly on a line, but a well-chosen line can still reveal a pattern, make predictions, and support decisions.

Concept Statistics and Probability

Domain Interpreting Categorical and Quantitative Data

Read time 12 minutes

What this learning objective is really asking you to learn

This learning objective asks students to move from seeing data to modeling data. In the previous statistics objectives, students learned to represent one-variable data, compare distributions, read two-way tables, create scatter plots, and check model fit with residuals. Objective 056 now asks students to do something central to modern applied mathematics: look at a relationship between two quantitative variables and decide whether a line is a reasonable summary of that relationship.

A scatter plot is a picture of paired numerical observations. Each point represents a pair such as study hours and test score, outside temperature and electricity use, age of a used car and resale price, height and arm span, advertising spending and sales, or year and average cost. The horizontal coordinate represents one variable. The vertical coordinate represents the other. When the points form a rough upward or downward band, students may describe the association as roughly linear. That does not mean the points lie exactly on one line. Real measurements include variation, individual differences, rounding, noise, and omitted factors. Linear association means that a line captures the general direction and pattern well enough to be useful.

A linear function has the form \(y = mx + b\), where \(m\) is the slope and \(b\) is the y-intercept. In a data-modeling setting, the line is often written as a prediction equation, such as \(predicted y = mx + b\). The line is not claiming that every point must follow the equation exactly. Instead, it gives an estimated or typical value of \(y\) for each value of \(x\). If the line predicts that a car loses about 1,800 dollars of value per year, an actual car may be above or below that prediction depending on mileage, condition, brand, accident history, and market conditions. The line gives the broad relationship, not the entire universe.

The key phrase is when a scatter plot suggests a linear association. Students must learn that fitting a line is not always appropriate. A scatter plot may curve upward, flatten out, bend downward, split into clusters, or show no clear pattern at all. A linear model is useful when the points show a roughly straight trend. If the data are clearly curved, a line may produce misleading predictions. If the data have no association, a line may create the illusion of a relationship where none exists. If there are extreme outliers, one or two points may pull the line away from the pattern followed by most of the data.

There are several ways to fit a linear function. In early high-school work, students may draw a line that goes through the middle of the data cloud, with roughly balanced points above and below. They may choose two representative points on the line and use those points to calculate slope. They may use technology to compute a least-squares regression line. They may compare candidate lines by looking at residuals. The level of technique can vary, but the purpose is the same: choose a line that describes the central trend of a two-variable data set.

The fitted line is a model. A model is a simplified mathematical representation of something more complicated. The model leaves out some detail so that the main structure can be seen and used. This is not a weakness; it is the whole point. A map is useful because it leaves out many details and highlights the features needed for navigation. A linear model is useful because it compresses many data points into two parameters: slope and intercept. Those two numbers tell a compact story about direction, rate, and baseline.

When students fit a line, they are making several decisions. Which variable should go on the horizontal axis? Which variable is being predicted? Is the relationship roughly linear? Are there outliers? Does the line pass through the middle of the points? Does it make sense in context? Are predictions being made within the observed range of data, or are they being extrapolated far beyond the data? A strong student does not merely produce an equation. A strong student defends why the equation is reasonable and describes what it can and cannot tell us.

Why students should learn this math

Students should learn this objective because linear modeling is one of the most common ways humans make practical predictions from imperfect data. The world does not usually hand us exact formulas. Instead, we collect observations. We notice patterns. We summarize those patterns with models. Then we use the models to predict, compare, budget, design, and argue.

Consider a student trying to understand the relationship between hours studied and exam score. There may be a positive relationship: more study time often leads to higher scores. But the points will not fall perfectly on a line. Some students study efficiently, some are tired, some already know the material, some guess well, and some have test anxiety. A linear model can still help answer a practical question: on average, how much does the score seem to increase for each additional hour of studying? That kind of answer is more useful than a vague statement such as “studying helps.”

Businesses use linear models constantly. A store might compare advertising dollars and weekly sales. A delivery company might compare miles driven and fuel cost. A restaurant might compare number of customers and staffing needs. A streaming company might compare viewing time and subscription renewal. A city might compare traffic volume and commute time. These relationships may not be perfectly linear, but over a certain range, a line can be a useful first model.

Science also depends on fitted lines. In chemistry, calibration curves relate instrument readings to known concentrations. In physics, position-time graphs can reveal constant velocity. In biology, body measurements often show approximate linear relationships over limited ranges. In environmental science, data may show trends over time. Researchers do not expect every measurement to be perfect. They use fitted models to separate pattern from noise.

This objective also matters for civic life. People encounter claims such as “as education increases, income increases,” “as screen time increases, sleep decreases,” or “as housing supply changes, rent changes.” Such claims are often based on scatter plots and fitted lines. A student who understands linear modeling can ask better questions. Is the association actually linear? How strong is it? Were outliers handled? What is the slope? What is the uncertainty? Are we predicting inside the data range or far outside it? Does the model show causation or only association?

The deeper reason is that fitting a line teaches disciplined simplification. Many students think math is about exact answers. Real modeling is often about useful approximations. A line of best fit is not exact. It is not supposed to be exact. Its job is to capture the main tendency in a messy situation while acknowledging that individual cases vary. This is a mature idea. It prepares students for science, economics, statistics, engineering, medicine, data science, and responsible citizenship.

The historical machinery behind this idea

The history of fitting lines is tied to measurement, astronomy, navigation, and the need to make sense of error. Before modern computers, astronomers made repeated observations of planets, comets, and stars. Measurements were never perfect. Instruments had limits. Observers made tiny mistakes. Atmospheric conditions interfered. Yet scientists needed to predict celestial motion with high accuracy because calendars, navigation, and theory depended on it.

One major historical breakthrough was the method of least squares, associated with Adrien-Marie Legendre and Carl Friedrich Gauss in the early nineteenth century. The basic idea is simple but powerful: choose the line or curve that makes the squared prediction errors as small as possible overall. A prediction error is the vertical difference between an observed value and the value predicted by the model. In modern classroom language, that difference is a residual. Squaring the residuals prevents positive and negative errors from canceling out and gives larger errors more weight.

Least squares became important because it gave a systematic way to combine imperfect observations. Instead of choosing a line by eye, mathematicians could define an optimal line according to a clear criterion. This did not make models perfect, but it made them more objective and repeatable. The same idea expanded far beyond astronomy into physics, engineering, economics, psychology, biology, and almost every quantitative field.

Linear modeling also belongs to the history of analytic geometry. When René Descartes and others connected algebra to coordinate geometry, equations became shapes and shapes became equations. A line could be described by a symbolic rule. Its slope and intercept became meaningful quantities. This connection made it possible to treat data relationships as geometric objects: clouds of points, fitted lines, deviations, slopes, and intercepts.

In the twentieth and twenty-first centuries, linear regression became a central tool of statistics. Computers made it easy to calculate fitted lines for large data sets. But technology did not remove the need for judgment. In fact, it made judgment more important. A calculator or spreadsheet can fit a line to almost anything, even data where a line is nonsense. The human job is to decide whether the line is appropriate, whether the variables make sense, and whether the interpretation is honest.

This objective introduces students to that historical machine in an accessible form. Students are not yet proving least-squares formulas in full. They are learning the first layer: scatter plot, pattern, fitted line, prediction, residual, interpretation, limitation. That first layer is the doorway to regression, correlation, inference, experimental design, and causal reasoning.

Technical execution: how to fit and use a linear model

A good process begins with context. Identify the two variables. Decide which variable is the input and which is the output. The input often represents the quantity used for prediction, such as hours studied, age, temperature, or distance. The output is the quantity being predicted, such as score, price, energy use, or cost. This choice matters because the fitted equation is written to predict one variable from the other.

Next, make a scatter plot with appropriate scales and labels. The scale should reveal the pattern without exaggerating or hiding variation. Students should look for direction, form, strength, clusters, and outliers. Direction asks whether the association is positive, negative, or neither. Form asks whether the pattern is roughly linear or curved. Strength asks how tightly the points follow the pattern. Clusters may show subgroups. Outliers may signal unusual cases, data-entry errors, or important exceptions.

If the scatter plot suggests a straight-line pattern, fit a line. By hand, students can draw a line that passes through the middle of the data cloud with points roughly balanced above and below. The line should follow the trend, not chase every point. Then choose two points on the line, preferably not just two data points unless they lie well on the trend, and calculate slope: \(m = change in y / change in x\). Use one point and the slope to write the equation \(y = mx + b\) or use point-slope form first.

With technology, students may enter the data into a calculator, spreadsheet, or statistics tool and request a linear regression equation. The technology may return values such as \(y = 2.4x + 18.7\). Students should not stop there. They must ask whether the result fits the scatter plot and whether the parameters make sense in context. A line produced by technology can be mathematically computed and still be a poor model.

After fitting the line, use it carefully. To predict, substitute an input value into the equation. If the model is \(score = 4.2(hours) + 63\), then a student who studies 5 hours has a predicted score of \(4.2(5) + 63 = 84\). The word “predicted” matters. It is not a guarantee. It is an estimate based on the data pattern.

Students should distinguish interpolation from extrapolation. Interpolation means predicting within the range of observed data. If the data include study times from 1 to 8 hours, predicting for 5 hours is interpolation. Extrapolation means predicting beyond the observed range. Predicting for 30 hours of study would be extrapolation and might be unreasonable. Many linear relationships hold only over a limited range. A model that works for temperatures between 50 and 90 degrees may fail at extreme values. A model that works for ages 12 to 18 may fail for toddlers or adults.

Residuals help check the line. For each data point, the residual is \(actual y - predicted y\). If residuals show no obvious pattern and are relatively small, the line may be reasonable. If residuals curve, fan out, or cluster in a pattern, the line may be missing important structure. This connects directly to Objective 055.

Students should also understand that fitting a line is not the same as proving a cause. A positive linear association between two variables does not prove one causes the other. That issue becomes central in Objective 059. For now, students should describe the line as a model of association unless the data come from a well-designed experiment or there is strong causal evidence from other reasoning.

A concrete example

Suppose a class collects data on the number of hours students practiced a skill and their performance score. The scatter plot shows an upward trend. The points are not perfect, but they form a rough band that rises from left to right. A linear model seems reasonable.

A student draws a line through the center of the cloud and chooses two points on the line, perhaps \((2, 68)\) and \((7, 88)\). The slope is \((88 - 68) / (7 - 2) = 20 / 5 = 4\). The model is approximately \(score = 4(hours) + 60\). In context, this means the predicted score increases by about 4 points for each additional hour of practice, and the model predicts about 60 points when practice time is 0 hours.

The student can then predict that 6 hours of practice corresponds to about \(4(6) + 60 = 84\) points. But the explanation should include limits. The line is based on the data range. It predicts typical performance, not exact performance. It does not prove that every extra hour causes exactly 4 more points. Other factors may matter. The intercept may or may not be meaningful, depending on whether 0 hours was observed and whether the model remains reasonable there.

This example shows the full machinery: scatter plot, visual judgment, fitted line, slope, intercept, prediction, and caution.

Where this objective fits on the full map of mathematics

On the big map of math, this objective is where functions become statistical models. Earlier in Math I, students learned that linear functions have constant rates of change and graphs that are lines. They learned slope, intercepts, equations, inequalities, systems, and function notation. In statistics, they learned to represent and interpret data. Objective 056 merges those strands. A line is no longer only a perfect graph of exact solutions. It becomes an approximate model of a noisy relationship.

This is a major conceptual shift. In algebra, the point \((3, 11)\) either lies on the line \(y = 2x + 5\) or it does not. In statistics, a point can miss the fitted line and still belong to the relationship. The distance from the point to the line is not a failure; it is information. This prepares students for the statistical worldview: variation is normal, models are judged by usefulness, and evidence comes in degrees.

The objective also prepares students for Math II and Math III. Quadratic, exponential, logarithmic, and trigonometric models all extend the same basic idea: choose a function type that fits a pattern and interpret its parameters. Later, students will ask whether growth is linear or exponential, whether a quadratic model fits projectile motion, whether a normal distribution fits data, or whether sample evidence supports a claim. The habit begins here.

Common misconceptions and how to fix them

One misconception is that the line must go through every point. That is usually impossible with real data. The fitted line represents the trend, not each individual observation. Another misconception is that any scatter plot can use a line. A line should be fitted when the data suggest a roughly linear association. If the pattern curves, a different model may be better.

A third misconception is that the best line is the one connecting the first and last data points. Those points may be unusual or noisy. A better line follows the center of the entire data cloud. A fourth misconception is that predictions are facts. Predictions from fitted lines are estimates. They should be described with uncertainty, especially when residuals are large.

A fifth misconception is that technology replaces thinking. Technology can compute a line quickly, but it cannot decide whether the model is meaningful in context. Students must still inspect the graph, consider outliers, interpret slope and intercept, and avoid extrapolating carelessly.

Mastery looks like this

A student has mastered this objective when they can look at a scatter plot and say whether a linear model is appropriate; fit a line by hand or with technology; write an equation; use the equation for predictions; interpret the model in the original context; and explain limitations. The strongest evidence of mastery is not the equation alone. It is the student's explanation of why the line is reasonable, what the slope and intercept mean, where the model can be used safely, and why the line does not turn data into certainty.

When students learn this well, they gain a tool they will use for the rest of their lives. A fitted line is one of the simplest and most powerful ways to turn messy experience into quantitative insight.

Problem Library

Problems in the App From This Objective

144 problems across 12 archetypes in the app.

recognize approximate straight-line pattern.

12 problems Warmup Practice Mixed Review Assessment

Problem 1

Decide whether scatter plot description points fall close to an upward sloping straight band suggests a linear association.

Problem 2

Decide whether scatter plot description points curve upward more steeply as x increases suggests a linear association.

Problem 3

Decide whether scatter plot description points are scattered with no pattern suggests a linear association.

Problem 4

Decide whether scatter plot description points fall close to a downward sloping straight band suggests a linear association.

Problem 5

Decide whether scatter plot description points closely follow an increasing straight line suggests a linear association.

Problem 6

Decide whether scatter plot description points closely follow a decreasing straight line suggests a linear association.

Problem 7

Decide whether scatter plot description points form a clear U-shaped curve suggests a linear association.

Open in simulator

Problem 8

Decide whether scatter plot description points form a clear inverted U-shaped curve suggests a linear association.

Problem 9

Decide whether scatter plot description points are widely dispersed with no apparent trend suggests a linear association.

Problem 10

Decide whether scatter plot description points trend upward in a generally straight path suggests a linear association.

Problem 11

Decide whether scatter plot description points trend downward in a generally straight path suggests a linear association.

Problem 12

Decide whether scatter plot description points increase slowly at first, then rapidly as x increases suggests a linear association.

balance points and follow trend.

12 problems Warmup Practice Mixed Review Assessment

Problem 13

Describe a reasonable line of fit for scatter data 1, 3; 2, 5; 3, 6; 4, 8; 5, 11.

Problem 14

Describe a reasonable line of fit for scatter data 0, 10; 1, 8; 2, 7; 3, 4; 4, 3.

Problem 15

Describe a reasonable line of fit for scatter data 2, 4; 4, 5; 6, 7; 8, 8.

Problem 16

Describe a reasonable line of fit for scatter data 10, 22; 11, 24; 12, 25; 13, 27; 14, 29; 15, 31.

Open in simulator

Problem 17

Describe a reasonable line of fit for scatter data 1, 15; 2, 13; 3, 11; 4, 9.

Problem 18

Describe a reasonable line of fit for scatter data 1, 1.5; 2, 2.2; 3, 3.1; 4, 3.8; 5, 4.5.

Problem 19

Describe a reasonable line of fit for scatter data 0.5, 9.5; 1.5, 8.0; 2.5, 6.5; 3.5, 5.0; 4.5, 3.5.

Problem 20

Describe a reasonable line of fit for scatter data 1, 5.1; 2, 4.9; 3, 5.2; 4, 4.8; 5, 5.0.

Problem 21

Describe a reasonable line of fit for scatter data 1, 5; 2, 9; 3, 14; 4, 18; 5, 23.

Problem 22

Describe a reasonable line of fit for scatter data 1, 20; 2, 15; 3, 10; 4, 6; 5, 1.

Problem 23

Describe a reasonable line of fit for scatter data 10, 10; 11, 12; 12, 13; 13, 15; 14, 16.

Problem 24

Describe a reasonable line of fit for scatter data 1, 2; 2, 4; 3, 5.5; 4, 7; 5, 8.5; 6, 10.

choose two points on the line and compute slope.

12 problems Warmup Practice Mixed Review Assessment

Problem 25

Estimate slope from line-of-fit points 0, 4; 5, 14.

Open in simulator

Problem 26

Estimate slope from line-of-fit points 2, 9; 6, 1.

Problem 27

Estimate slope from line-of-fit points 1, 3; 9, 7.

Problem 28

Estimate slope from line-of-fit points 10, 20; 30, 25.

Problem 29

Estimate slope from line-of-fit points 1, 2; 3, 8.

Problem 30

Estimate slope from line-of-fit points -1, 7; 1, 1.

Problem 31

Estimate slope from line-of-fit points 0, 0; 6, 4.

Problem 32

Estimate slope from line-of-fit points -2, 5; 2, 2.

Problem 33

Estimate slope from line-of-fit points 1, 1; 3, 9.

Problem 34

Estimate slope from line-of-fit points -1, 8; 1, 0.

Problem 35

Estimate slope from line-of-fit points -3, 1; 3, 3.

Problem 36

Estimate slope from line-of-fit points 0, 7; 5, 2.

read or extrapolate intercept.

12 problems Warmup Practice Mixed Review Assessment

Problem 37

Estimate the y-intercept from fitted line information line of fit crosses the y-axis near 6.

Problem 38

Estimate the y-intercept from fitted line information line of fit through (2,10) with slope 3.

Problem 39

Estimate the y-intercept from fitted line information line of fit through (5,8) with slope -1.

Open in simulator

Problem 40

Estimate the y-intercept from fitted line information line of fit is off-window but through (10,20) with slope 0.5.

Problem 41

Estimate the y-intercept from fitted line information line of fit crosses the y-axis at -3.

Problem 42

Estimate the y-intercept from fitted line information line of fit through (1, 7) with slope 2.

Problem 43

Estimate the y-intercept from fitted line information line of fit through (4, 2) with slope -0.5.

Problem 44

Estimate the y-intercept from fitted line information line of fit through (-3, 8) with slope 0.

Problem 45

Estimate the y-intercept from fitted line information line of fit through (6, 1) with slope -1/3.

Problem 46

Estimate the y-intercept from fitted line information line of fit through (1, 5) and (3, 9).

Problem 47

Estimate the y-intercept from fitted line information line of fit through (-2, 12) with slope 1.5.

Problem 48

Estimate the y-intercept from fitted line information line of fit passes through the point (0, -5).

combine slope and intercept in equation.

12 problems Warmup Practice Mixed Review Assessment

Problem 49

Write a linear model from fitted line slope 2 and intercept 5.

Problem 50

Write a linear model from fitted line slope -1.5 and intercept 12.

Problem 51

Write a linear model from fitted line slope 1/4 and intercept 3.

Problem 52

Write a linear model from fitted line slope 0 and intercept 8.

Problem 53

Write a linear model from fitted line slope -3 and intercept -7.

Problem 54

Write a linear model from fitted line slope 2/3 and intercept -4.

Problem 55

Write a linear model from fitted line slope 0.5 and intercept 1.2.

Problem 56

Write a linear model from fitted line slope 1 and intercept 6.

Problem 57

Write a linear model from fitted line slope -1 and intercept 10.

Problem 58

Write a linear model from fitted line slope 15 and intercept 1.

Problem 59

Write a linear model from fitted line slope -1/2 and intercept 3/4.

Open in simulator

Problem 60

Write a linear model from fitted line slope 4 and intercept 0.

explain average change in context.

12 problems Warmup Practice Mixed Review Assessment

Problem 61

Interpret slope 4 dollars per ticket of fitted linear model in context cost versus number of tickets.

Problem 62

Interpret slope -2 degrees per hour of fitted linear model in context temperature over time.

Problem 63

Interpret slope 0.8 points per hour of fitted linear model in context study time and test score.

Problem 64

Interpret slope 60 miles per hour of fitted linear model in context distance traveled versus time.

Open in simulator

Problem 65

Interpret slope 0.5 cm per day of fitted linear model in context plant height versus number of days.

Problem 66

Interpret slope -0.02 gallons per mile of fitted linear model in context fuel remaining versus distance driven.

Problem 67

Interpret slope 15 dollars per item of fitted linear model in context total revenue versus number of items sold.

Problem 68

Interpret slope 120 people per year of fitted linear model in context city population over years.

Problem 69

Interpret slope -1.5 pounds per week of fitted linear model in context body weight over weeks.

Problem 70

Interpret slope 2.5 dollars per unit of fitted linear model in context total production cost versus number of units produced.

Problem 71

Interpret slope -5 percentage points per hour of fitted linear model in context battery charge versus hours of use.

Problem 72

Interpret slope 10 visitors per dollar of fitted linear model in context number of website visitors versus advertising spend.

explain starting prediction and whether it is meaningful.

12 problems Warmup Practice Mixed Review Assessment

Problem 73

Interpret intercept 12 dollars of fitted linear model in context cost model for x items.

Problem 74

Interpret intercept 80 points of fitted linear model in context score predicted from study hours.

Problem 75

Interpret intercept -5 cm of fitted linear model in context plant height versus days after planting.

Problem 76

Interpret intercept 30 mph of fitted linear model in context car speed vs. time since starting (in minutes).

Problem 77

Interpret intercept 500 liters of fitted linear model in context water volume in a tank vs. time since a leak started (in hours).

Problem 78

Interpret intercept 0.5 kg of fitted linear model in context weight of a package vs. number of items inside.

Problem 79

Interpret intercept 10 degrees Celsius of fitted linear model in context room temperature vs. hours after heater turned off.

Problem 80

Interpret intercept -100 dollars of fitted linear model in context company profit vs. number of units sold.

Problem 81

Interpret intercept 200 people of fitted linear model in context population of a town vs. years since a new factory opened.

Problem 82

Interpret intercept 0 minutes of fitted linear model in context time to complete a task vs. hours of training.

Open in simulator

Problem 83

Interpret intercept 1.5 meters of fitted linear model in context height of a child vs. age in years.

Problem 84

Interpret intercept 25 units of fitted linear model in context product inventory vs. days since last restock.

predict within data range.

12 problems Warmup Practice Mixed Review Assessment

Problem 85

Use linear model y=3x+2 to interpolate at input 4 within data range 1 to 8.

Problem 86

Use linear model y=-2x+20 to interpolate at input 6 within data range 0 to 10.

Problem 87

Use linear model y=0.5x+7 to interpolate at input 12 within data range 10 to 15.

Problem 88

Use linear model y=5x-3 to interpolate at input 2 within data range 0 to 5.

Problem 89

Use linear model y=-x+15 to interpolate at input 7 within data range 5 to 10.

Problem 90

Use linear model y=2.5x+1 to interpolate at input 4 within data range 2 to 6.

Problem 91

Use linear model y=10x-50 to interpolate at input 6 within data range 5 to 7.

Problem 92

Use linear model y=-3x+100 to interpolate at input 20 within data range 10 to 30.

Open in simulator

Problem 93

Use linear model y=0.2x+5 to interpolate at input 10 within data range 0 to 20.

Problem 94

Use linear model y=x/2+3 to interpolate at input 8 within data range 6 to 10.

Problem 95

Use linear model y=-4x+0 to interpolate at input 1 within data range 0 to 2.

Problem 96

Use linear model y=1.5x-0.5 to interpolate at input 3 within data range 1 to 5.

predict outside range and note risk.

12 problems Warmup Practice Mixed Review Assessment

Problem 97

Use linear model y=3x+2 to extrapolate at input 15 outside data range 1 to 8.

Problem 98

Use linear model y=-2x+20 to extrapolate at input 14 outside data range 0 to 10.

Problem 99

Use linear model y=0.5x+7 to extrapolate at input 20 outside data range 10 to 15.

Problem 100

Use linear model y=4x-5 to extrapolate at input 12 outside data range 0 to 10.

Problem 101

Use linear model y=-x+100 to extrapolate at input 40 outside data range 50 to 80.

Problem 102

Use linear model y=10x+1 to extrapolate at input 7 outside data range 0 to 5.

Problem 103

Use linear model y=2.5x+10 to extrapolate at input 0 outside data range 1 to 5.

Problem 104

Use linear model y=-3x+50 to extrapolate at input 20 outside data range 10 to 15.

Problem 105

Use linear model y=0.1x+2 to extrapolate at input 35 outside data range 20 to 30.

Problem 106

Use linear model y=5x-10 to extrapolate at input 1 outside data range 2 to 6.

Problem 107

Use linear model y=-0.5x+10 to extrapolate at input 15 outside data range 0 to 10.

Problem 108

Use linear model y=x+100 to extrapolate at input 5 outside data range 10 to 20.

Open in simulator

choose better fit based on residuals or point balance.

12 problems Warmup Practice Mixed Review Assessment

Problem 109

Compare possible lines of fit balances points with small random residuals and lies above most points for data description linear upward scatter.

Problem 110

Compare possible lines of fit passes through an outlier and misses the main cluster and follows the main cluster with one large outlier residual for data description linear data with one outlier.

Problem 111

Compare possible lines of fit has residuals -5,-4,-3,-2 and has residuals -1,1,-1,1 for data description same data.

Problem 112

Compare possible lines of fit passes directly through the center of the data cloud and consistently lies below the majority of the points for data description scattered points showing a positive linear trend.

Problem 113

Compare possible lines of fit shows a clear U-shaped pattern in its residuals and has residuals randomly scattered above and below zero for data description data with a slight curve.

Problem 114

Compare possible lines of fit has residuals of 2, 3, 4, 5 and has residuals of -1, 0, 1, 2 for data description same data set.

Problem 115

Compare possible lines of fit is pulled towards an extreme outlier, distorting the main trend and ignores the outlier and fits the main cluster of points well for data description data with one significant outlier far from the main cluster.

Problem 116

Compare possible lines of fit has a sum of residuals of +15 and has a sum of residuals of -2 for data description a set of 5 data points.

Problem 117

Compare possible lines of fit has more points above the line than below and has an equal number of points above and below, and is centered for data description data with a clear linear relationship.

Problem 118

Compare possible lines of fit has all positive residuals and has both positive and negative residuals for data description a scatter plot of points.

Open in simulator

Problem 119

Compare possible lines of fit is steeper but has larger residuals and is less steep but has smaller, more random residuals for data description data with a moderate positive correlation.

Problem 120

Compare possible lines of fit has residuals ranging from -10 to 10 and has residuals ranging from -2 to 2 for data description same data set.

decide whether outlier affects model choice.

12 problems Warmup Practice Mixed Review Assessment

Problem 121

Account for outlier one point far above the otherwise linear cluster when fitting a line to data pattern main cluster trends upward.

Problem 122

Account for outlier one high-leverage point far to the right that pulls the line upward when fitting a line to data pattern remaining points have weak trend.

Problem 123

Account for outlier one point slightly away from the line but within general scatter when fitting a line to data pattern strong linear trend.

Problem 124

Account for outlier one point far below the otherwise linear cluster when fitting a line to data pattern main cluster trends upward.

Problem 125

Account for outlier a high-leverage point at a low x-value that pulls the line significantly upward when fitting a line to data pattern the remaining points show a moderate downward trend.

Problem 126

Account for outlier one point near the mean x-value but significantly below the trend line when fitting a line to data pattern a strong upward linear trend with tight scatter.

Problem 127

Account for outlier an influential point at a high x-value with a y-value that is too low, significantly reducing the slope when fitting a line to data pattern the majority of points show a steep positive linear trend.

Problem 128

Account for outlier an influential point at a low x-value with a y-value that is too high, significantly increasing the slope when fitting a line to data pattern the majority of points show a moderate positive linear trend.

Problem 129

Account for outlier a single point far from the main cluster in both x and y dimensions when fitting a line to data pattern the remaining points are scattered with no discernible linear trend.

Problem 130

Account for outlier a point with an impossible or clearly erroneous value (e.g., a measurement of zero where only positive values are possible) when fitting a line to data pattern an otherwise consistent positive linear relationship.

Open in simulator

Problem 131

Account for outlier a point that clearly belongs to a different underlying process or subpopulation when fitting a line to data pattern the main dataset forms a distinct linear pattern.

Problem 132

Account for outlier a high-leverage point far to the right that perfectly aligns with the strong linear trend when fitting a line to data pattern a strong upward linear trend.

catch bad line placement, wrong slope, or swapped axes.

12 problems Warmup Practice Mixed Review Assessment

Problem 133

Correct the linear-fit modeling error in line of fit is drawn below nearly all points.

Problem 134

Correct the linear-fit modeling error in slope is calculated using run over rise.

Problem 135

Correct the linear-fit modeling error in axes are swapped before writing the prediction model.

Problem 136

Correct the linear-fit modeling error in model is used far outside data range without warning.

Problem 137

Correct the linear-fit modeling error in line of fit is drawn above nearly all points.

Problem 138

Correct the linear-fit modeling error in slope is calculated using only two extreme points.

Problem 139

Correct the linear-fit modeling error in dependent variable is plotted on the x-axis.

Problem 140

Correct the linear-fit modeling error in predictions are made far below the minimum observed x-value.

Open in simulator

Problem 141

Correct the linear-fit modeling error in line of fit does not pass through the mean of x and mean of y.

Problem 142

Correct the linear-fit modeling error in the sign of the calculated slope is ignored.

Problem 143

Correct the linear-fit modeling error in axes are not labeled with appropriate units.

Problem 144

Correct the linear-fit modeling error in the line of fit clearly does not follow the trend of the data.