a one-sample test because the same group is being tested twice (or sometimesrelated/dependent groups e.g. pairs of twins)25•If we rejectܪwhen it is true, this is atype I error•If we fail to rejectܪwhen it is false, this is atype II error26Types of error
4/02/201914Central Limit Theorem•If the population sampled from is Normally distributed, theܺതdistribution will be Normally distributedwhatever the size of݊Mean ofܺതdistribution isߤStandard deviation ofܺതdistribution isఙ(also called ‘standard error’)Variance ofܺതdistribution isఙమࢄഥ~ࡺ(ࣆ,࣌)•Central Limit Theorem: if we take many random samples of size݊andif݊ ≥ 30, theܺതdistribution will approximate a Normal distributioneven when the population sampled from is not Normally distributedscan be substituted in place of࣌so that௦replacesఙin calculations27Distribution of the sample mean•When݊ ≥ 30and sampling is random, sample means (ܺത) will beNormally distributed (Central Limit Theorem)test statistic isݖ =௫̅ିఓൗ(=௦ ି௬௧௦௦ௗ ௦௧ௗௗ )can substitutesinstead of࣌and replaceݖ =௫̅ିఓൗwithݖ ≈௫̅ିఓೞൗtest statistic is nowZ,hence conducting aZ-test (using Normal dist)•When݊ < 30, can also substitutesinstead of࣌and useappropriatet-distributionbut only if sampling from a Normally distributed population•Asnincreases (and therefore݂݀increases), thet-distributionapproaches the Normal distributionsot-tests andZ-tests give increasingly similar results28
4/02/201915Using a sample mean or median to test a hypothesis aboutpopulation central location,when࣌is unknown and sampling is randomis samplesize large?is samplingfrom Normallydist.population?nonoyes(CLT)Z-test (can also uset-test)Usesinstead ofߪSign Testyes29t-testUsesinstead ofߪSection C(Weeks 8 – 11)Two variable analysis30
4/02/201916Chi-square test(2 cat. variables)•When working with two variables, our motivating question willgenerally be“is there an association between the variables?”•The chi-square(χଶ)test is used to detect the presence of anassociation between twocategoricalvariables•Theχଶdistribution is asymmetrical and skewed to the right•Expected values are calculated by multiplying the appropriate rowtotal by the column total and dividing by the total number ofobservationsܧ =ோוHypotheses when performing a chi-square test will be:ܪ:there isnoassociation between variablesܪଵ:thereisan association31Chi-square test•If there were perfect agreement between theobservedandexpectedfrequencies,then we would expect the߯ଶtest statistic to equal zero•The more the observed and expected frequencies disagree, the greater the valueof the test statistic•A߯ଶtest is a one-tailed test•The degrees of freedom for the߯ଶtest are calculated as݂݀ =ܴ − 1 ×ܥ − 1,whereܴis no. of rows andܥis no. of columns•Necessary conditions for you to check when conducting a߯ଶtest are1.observations must be independent2.
Upload your study docs or become a
Course Hero member to access this document
Upload your study docs or become a
Course Hero member to access this document
End of preview. Want to read all 22 pages?
Upload your study docs or become a
Course Hero member to access this document