Michael Short
803781208
Stats 10 Lab 2
Dis 2b
Question 1
I would use a scatter plot to show the relationship between runs and another numerical
variable.
The relationship does appear to have somewhat of a positive linear relationship.
The graph says that there appears to be a correlation between at bats and runs scored and
may suggest a link between the two saying that the more at bats directly relates to more
runs.
Question 2
If I had to summarize my data with a single line I would put the line in a position where
half of the data is above the line and the other half of the data is below the line. As I move
the line to better fit the data the sum of squares decreases, because the line of best fit is
the line where the sum of the squares is smallest, so as the line comes closer to the line of
best fit the sum of the squares decreases.
Question 3
The sum of squares using the least squares line versus the line I chose is much smaller.
The line I chose had a sum of squares of 110000 compared to the least squares line which
had a sum of squares of 102800.
This preview has intentionally blurred sections. Sign up to view the full version.
View Full DocumentQuestion 4
Using the regression line one would predict that a team with 5,578 at bats in a season
would score around 775 runs. This prediction is about 44.9 runs off based on the given
This is the end of the preview.
Sign up
to
access the rest of the document.
 Fall '08
 Ioudina
 Regression Analysis, regression line, runs, bats graph

Click to edit the document details