Chi Square Handout - S08

Chi Square Handout - S08 - STAT E-50 - Introduction to...

Info iconThis preview shows pages 1–5. Sign up to view the full content.

View Full Document Right Arrow Icon
STAT E-50 - Introduction to Statistics The Chi-Square Test The chi-square test is a nonparametric test that is used to compare experimental results with theoretical models. That is, we will be comparing observed frequencies with expected frequencies . In a hypothesis test, the expected frequencies are those we would expect if the null hypothesis of our test is true. ( ) χ = 2 2 OE E The formula is: where O represents the observed frequency and E represents the expected frequency. The value of df depends on the type of test you are performing. The Chi-Square Distribution The P 2 distribution is: • nonnegative • not symmetrical; it is skewed to the right • distributed to form a family of distributions, with a separate distribution for each different number of degrees of freedom.
Background image of page 1

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
The Chi-Square Test for Goodness of Fit The goodness-of-fit test compares the distribution of observed outcomes for a single categorical variable to the expected outcomes predicted by a probability model. This test involves one sample, and one variable. Assumptions and Conditions: Counted data condition Be sure that the data is counts, or frequencies Independence assumption Randomization condition Sample size assumption Expected cell frequency condition: each expected frequency is at least 5 The Chi-square test is one-sided. 0 P 2 (df, α ) Page 2
Background image of page 2
Automobile insurance is much more expensive for teenage drivers than for older drivers. To justify this cost difference, insurance companies claim that the younger drivers are much more likely to be involved in costly accidents. To test this claim, a researcher obtains information about registered drivers from the Department of Motor Vehicles and selects a sample of 300 accident reports from the police department. The DMV reports the percentage of registered drivers in each age category as reported below. The number of accident reports is also shown. Does this data indicate that accidents occur with the same distribution as the ages of the drivers? H 0 : The distribution of accidents is the same as the distribution of registered drivers. H a : The distribution of accidents is not the same as the distribution of registered drivers. AGE percent of drivers number of accidents (observed) expected O - E () 2 OE 2 E under age 20 16 68 age 20 - 29 28 92 age 30 or older 56 140 Check the conditions: Counted data condition Randomization condition Expected cell frequency condition Specify the sampling distribution model and the test you will use. 2 2 E χ= ∑ , with df = k-1 χ 2 = P-value: Statistical conclusion: Conclusion in context: Page 3
Background image of page 3

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
Using Minitab for a Goodness of Fit Test If you have the expected proportions: 1. Enter the observed values in a column named “observed” 2. Select < Stat < Tables < Chi-Square Goodness-of-Fit Test (One Variable) Tell Minitab the column where the observed frequencies are stored Click on Test Specific Proportions In the drop-down box select Input constants
Background image of page 4
Image of page 5
This is the end of the preview. Sign up to access the rest of the document.

This note was uploaded on 05/20/2008 for the course STAT 50 taught by Professor Weinstein during the Spring '08 term at Harvard.

Page1 / 14

Chi Square Handout - S08 - STAT E-50 - Introduction to...

This preview shows document pages 1 - 5. Sign up to view the full document.

View Full Document Right Arrow Icon
Ask a homework question - tutors are online