12 Pages

week10

Course: STAT 1000, Fall 2011
School: Pittsburgh
Rating:
 
 
 
 
 

Word Count: 5327

Document Preview

23 Nancy Lecture Pfenning Stats 1000 Chapter 11: Testing Hypotheses About Proportions Recall: last time we presented the following examples: 1. In a group of 371 Pitt students, 42 were left-handed. Is this significantly lower than the proportion of all Americans who are left-handed, which is .12? 2. In a group of 371 students, 45 chose the number seven when picking a number between one and twenty "at...

Register Now

Unformatted Document Excerpt

Coursehero >> Pennsylvania >> Pittsburgh >> STAT 1000

Course Hero has millions of student submitted documents similar to the one
below including study guides, practice problems, reference materials, practice exams, textbook help and tutor support.

Course Hero has millions of student submitted documents similar to the one below including study guides, practice problems, reference materials, practice exams, textbook help and tutor support.
23 Nancy Lecture Pfenning Stats 1000 Chapter 11: Testing Hypotheses About Proportions Recall: last time we presented the following examples: 1. In a group of 371 Pitt students, 42 were left-handed. Is this significantly lower than the proportion of all Americans who are left-handed, which is .12? 2. In a group of 371 students, 45 chose the number seven when picking a number between one and twenty "at random". Does this provide convincing statistical evidence of bias in favor of the number seven, in that the proportion of students picking seven is significantly higher than 1/20 = .05? 3. A university has found over the years that out of all the students who are offered admission, the proportion who accept is .70. After a new director of admissions is hired, the university wants to check if the proportion of students accepting has changed significantly. Suppose they offer admission to 1200 students and 888 accept. Is this evidence of a change from the status quo? Each example mentions a possible value for p, which would indicate no difference/no change/status quo. The null hypothesis H0 states that p equals this "traditional" value. In contrast to the null hypothesis, each example suggests that an alternative may be true: a significance test problem always pits an alternative hypothesis Ha against H0 . Ha proposes that the proportion differs from the "traditional" value p0 --Ha rocks the boat/upsets the apple cart/ marches to a different drummer. A key difference among our three examples is the direction in which Ha refutes H0 . In the first, it is suggested that the proportion of all Pitt students who are left-handed is less than the proportion for adults in the U.S., which is .12. In the second, we wonder if the proportion of students picking the number seven is significantly more than .05. In the third, we inquire about a difference in either direction from the stated proportion of .70. We can list our null and alternative hypotheses as follows: 1. H0 : p = .12 2. H0 : p = .05 3. H0 : p = .70 Ha : p < .12 Ha : p > .05 Ha : p = .70 < > In general, we have H0 : p = p0 vs. Ha : p p0 = Note that your textbook may have expressed the first two null hypotheses as H 0 : p .12 and H0 : p .05. These expressions serve well as logical opposites to the alternative hypotheses, but our strategy to carry out a test will be to assume H0 is true, which means we must commit to a single value p0 at which to center the hypothesized distribution of p. Thus, we will write H0 : p = p0 in these notes. ^ Alternatives with < or > signs are called one-sided alternatives; with = they are two-sided. When in doubt, a two-sided alternative should be used, because it is more general. Note: In statistical inference, we draw conclusions about unknown parameters. Thus, H 0 and Ha are statements about a parameter (p), not a statistic (^). We can't argue about p; its value has been measured p ^ and taken as fact. Note: Just as "success" and "failure" in binomial settings lost their connotations of favorable and unfavorable, Ha may or may not be a desired outcome. It can be something we hope or fear or simply suspect is true. However, because p0 is a traditionally accepted value, we'll stick with H0 unless there is convincing evidence to the contrary: H0 is "innocent until proven guilty". How can we produce evidence to refute H0 ? By using what probabilty theory tells us about the behavior of the R.V. sample proportion p: it is centered at p, has spread ^ is normal. 88 p(1-p) , n and for large enough n its shape Our strategy will be to determine if the observed value p is just too unlikely to have occurred if H 0 : p = p0 ^ were true. If the probability of such an outcome (called the P-value) is too small, then we'll reject H 0 in favor of Ha . The P-value of a test about a proportion p is the probability, computed assuming that H 0 : p = p0 is true, that the test statistic (^) would take a value at least as extreme--that is, as low or as high or as p different--as the one observed. The smaller the P-value, the stronger the evidence against H 0 . Since Example 1 has Ha : p < .12, the P-value is the probability of a sample proportion of left-handers as 42 low as 371 = .113 or lower, coming from a population where the proportion of left-handers is .12. Based on what we learned about the sampling distribution of p, we know that p here, assuming H 0 is true, has mean ^ ^ p = .12, standard error .12(.88) , and an approximately normal shape since 371(.12) 45 and 371(.88) 326 371 are both greater than 10. [Also, we have in mind a much larger population of Pitt students, certainly more than 10(371) = 3710.] P-value = P (^ .113) P (Z p .113 - .12 .12(.88) 371 ) = P (Z -.41) = .3409 p(1-p) n Note: For confidence intervals, since p has standard deviation ^ with s.e.(^) = p of p is ^ p0 (1-p0 ) n p(1-p) ^ ^ . n with p unknown, we estimated it Now, we carry out our test assuming H0 : p = p0 is true, so the standard deviation p-p0 ^ p0 (1-p0 ) n and the test statistic is z = Since it's not at all unlikely (probability about 34%) for a random sample of 371 from a population with proportion .12 of left-handers to have a sample proportion of only .113 left-handers, we have no cause to reject H0 : p = .12. The proportion of left-handers at Pitt may well be the same as for the whole country, .12. Because we rely on standard normal tables to determine the P-value, we transform from an observed p-p0 ^ value p to a standardized value z = ^ . The way to compute the P-value depends on the form of p (1-p ) 0 0 n Ha , as illustrated below, first in terms of p, then in terms of z. ^ 89 P-value for Tests of Significance about p Observed Ha : p < p 0 Standardized Ha : p < p 0 P-value = P (^R.V. pstat ) p ^ c p ^ x p0 P-value = P (Z z) c z= p - p0 ^ p0 (1 - p0 ) n Z 0 Ha : p > p 0 Ha : p > p 0 P-value = P (^RV pstat ) p ^ c pRV ^ p0 p ^ 0 z= P-value = P (Z z) c p - p0 ^ p0 (1 - p0 ) n Z Ha : p = p 0 Ha : p = p 0 P-value = combined area s d d p0 p one of these ^ d d d 90 P-value = 2P (Z |z|) d d d pRV ^ Z s d 0 ^ d p - p0 one of these z= p0 (1 - p0 ) n Summary of Test of Significance about p Say a simple random sample of size n is drawn from a large population with unknown proportion p of successes. We measure p = X and carry out the test as follows: ^ n < > 1. Set up H0 : p = p0 vs. Ha : p p0 = 0 0 n 2. Verify that the population is at least 10 times the sample size, and that np0 10 and n(1 - p0 ) 10. p-p0 ^ Then calculate standardized test statistic z = p (1-p ) Find P-value = = = P (ZR.V. zstatistic ) for Ha : p < p0 P (ZR.V. zstatistic ) for Ha : p > p0 2P (ZR.V. |zstatistic |) for Ha : p = p0 3. 4. Determine if the results are statistically significant: if the P-value is "small", reject H 0 in favor of Ha , and say the data are "statistically significant"; otherwise, we have failed to produce convincing evidence against H0 . [For specified , reject H0 if P-value < .] 5. State conclusion in context of the particular problem. Example Let's follow these steps to solve the second problem. In a group of 371 students, 45 chose the number seven when picking a number between one and twenty "at random". Does this provide convincing statistical evidence of bias in favor of the number seven, in that the proportion of 45 students picking seven is significantly higher than 1/20 = .05? First calculate p = 371 = .12. ^ 1. H0 : p = .05 Ha : p > .05 2. We have in mind a very large population of all students. We check that 371(.05) = 19 and 371(.95) = 352 are both greater than 10. Next, z = .12-.05 = 6.19 .05(.95) 371 3. P-value = P (Z 6.19) = P (Z -6.19) 0 4. Since the P-value is very small, we reject H0 and say the results are statistically significant. 5. There is very strong evidence of bias in favor of the number seven. Example I also suspected bias in favor of the number seventeen. In a group of 371 students, 25 chose the number seventeen when picking a number between one and twenty "at random". Does this provide convincing statistical evidence of bias in favor of the number seventeen, in that the proportion of students picking seventeen is significantly higher than 1/20 = .05? First calculate 25 p = 371 = .067. ^ 1. H0 : p = .05 2. z = .067-.05 .05(.95) 371 Ha : p > .05 = 1.50 3. P-value = P (Z 1.50) = P (Z -1.50) = .0668 4. We could call this a borderline P-value. Next lecture, we'll discuss guidelines for how small the P-value should be in order to reject H0 , and we'll solve the third example. Often, a cut-off probability is set in advance, in which case we reject H0 if the P-value is less than . 91 Lecture 24 Testing Hypotheses About Proportions Last time, we learned the steps to carry out a test of significance: < > 1. Set up H0 : p = p0 vs. Ha : p p0 = 2. In order to verify that the underlying distribution is approximately binomial, check that the population is at least 10 times the sample size. In order to justify use of a normal approximation to binomial p-p0 ^ proportion, check that np0 10 and n(1-p0) 10. Calculate standardized test statistic z = p (1-p ) 0 0 n 3. Find P-value = = = P (ZR.V. zstatistic ) for Ha : p < p0 P (ZR.V. zstatistic ) for Ha : p > p0 2P (ZR.V. |zstatistic |) for Ha : p = p0 4. Assess significance: if the P-value is "small", reject H0 in favor of Ha , and say the data are "statistically significant"; otherwise, we have failed to produce convincing evidence against H 0 . [For specified , reject H0 if P-value < .] 5. State conclusions in context. Last time we began to solve the following example: Example When students are asked to pick a number "at random" from one to twenty, I suspect their selections will show bias in favor of the number seventeen. In a group of 371 students, 25 chose the number seventeen. Does this provide convincing statistical evidence of bias in favor of the number seventeen, in that the proportion of students picking seventeen is significantly higher than 1/20 = .05? The null and alternative hypotheses were H0 : p = .05 Ha : p > .05 and so the z-statistic was z = .067-.05 = 1.50 and the P-value was = P (Z 1.50) = P (Z -1.50) = .0668 .05(.95) Step 4 says to reject H0 if the P-value is small. How small is small? Sometimes, it is decided in advance exactly how small the P-value would have to be to lead us to reject the null hypothesis: a cut-off probability is prescribed in advance. Then, if the P-value is less than , we reject H0 , and say the results are statistically significant at level . Otherwise, we do not have sufficient evidence to reject H0 . Example Is there evidence of bias in favor of the number seventeen at the = .05 level? The P value is .0668, which is not less than .05, so by this criterion it is not small enough to reject H 0 . It could be that students didn't have any systematic preference for the number seventeen, and the proportion of seventeens selected was a bit high only by chance. Example A university has found over the years that out of all the students who are offered admission, the proportion who accept is .70. After a new director of admissions is hired, the university wants to check if the proportion of students accepting has changed significantly. Suppose they offer admission to 1200 students and 888 accept. Is this evidence at the = .05 level that there 92 371 has been a real change from the status quo? How about at the .02 level? First we find that 888 p = 1200 = .73 is the sample proportion of students who accepted admission. ^ 1. Set up H0 : p = .70 vs. Ha : p = .70 2. Both conditions are satisfied. z = .73-.70 .7(.3) 1200 = 2.27 3. Because of the two-sided alternative, our P-value is = 2P (Z |2.27|) = 2P (Z -2.27) = 2(.0116) = .0232 4. Since .0232 < .05, we have evidence to reject at the 5% level. But .0232 is not less than .02, so we don't have evidence to reject at the 2% level. 5. If we set out to gather evidence of a change in either direction for overall proportion of students accepting admission, we would say yes with a cutoff of .05, no with a cutoff of .02. Thus, this test is rather inconclusive. Example A university has found over the years that out of all the students who are offered admission, the proportion who accept is .70. After a new director of admissions is hired, the university wants to check if the proportion of students accepting has increased significantly. Suppose they offer admission to 1200 students and 888 accept. Is this evidence at the = .05 level that there has been a significant increase in proportion of students accepting admission? How about at the .02 888 level? Again we find that p = 1200 = .73 is the sample proportion of students who accepted ^ admission. 1. The subtle re-phrasing of the question ("increased" instead of "changed") results in a different alternative hypothesis. H0 : p = .7 vs. Ha : p > .7 2. The z statistic is unchanged: z = .73-.70 .7(.3) 1200 = 2.27 3. Because of the one-sided alternative, our P-value is = P (Z 2.27) = P (Z -2.27) = (.0116) 4. Since .0116 < .05, we again have evidence to reject at the 5% level. This time, .0116 is also less than .02, so we also have evidence to reject at the 2% level. 5. If we set out to gather evidence of increased overall proportion of students accepting admission, we would say yes, we have produced evidence of an increase, whether the = .05 or = .02 level is used. The previous examples demonstrate that 1. It is more difficult to reject H0 for a two-sided alternative than for a one-sided alternative. In general, the two-sided P-value is twice the one-sided P-value. The one-sided P-value is half the two-sided P-value. 2. It is more difficult to reject H0 for lower levels of . Calculating the P-value in Step 3 gives us the maximum amount of information to carry out our test--we know exactly how unlikely the observed p is. ^ If a cut-off level is prescribed in advance, then it is possible to bypass the calculation of the P-value in Step 3. Instead, the z-statistic is compared to the critical value z associated with . For example, if we have a two-sided alternative and is set at .05, then the rejection region would be where the test-statistic z exceeds 1.96 in absolute value. The disadvantage to this method is that it provides only the bare minimum of information needed to decide whether to reject H0 or not. We will not employ the rejection region method in this course, but students should be aware of it in case they encounter it in other contexts. A method that falls somewhere in between those which provide maximum and minimum information is the following: "close in on" the P-value by surrounding the z statistic with neighboring values z from the 93 "infinite" row of Table A.2. The advantage to this method is that it familiarizes us with the use of Table A.2, which will be needed when we carry out hypothesis tests about unknown population mean of a quantitative variable. Note that z = 1.645 corresponds to an area of .90 symmetric about zero, so each tail probability, that z takes a value less than -1.645 or greater than +1.645, is .05. z = 1.960 corresponds to an area of .95 symmetric about zero, so each tail probability, that z takes a value less than -1.960 or greater than +1.960, is .025. z = 2.326 corresponds to an area of .98 symmetric about zero, so each tail probability, that z takes a value less than -2.326 or greater than +2.326, is .01. z = 2.576 corresponds to an area of .99 symmetric about zero, so each tail probability, that z takes a value less than -2.576 or greater than +2.576, is .005. These tail probabilities may be penciled in at the top or bottom ends of the columns in Table A.2 for easy reference. We will now re-solve some of our earlier examples, using Table A.2 instead of Table A.1. Example A university has found over the years that out of all the students who are offered admission, the proportion who accept is .70. After a new director of admissions is hired, the university wants to check if the proportion of students accepting has increased significantly. Suppose they offer admission to 1200 students and 888 accept. Is this evidence at the = .05 level that there has been a significant increase in the proportion of students accepting? 888 First we found that p = 1200 = .73 is the sample proportion of students who accepted admission and ^ set up H0 : p = .70 vs. Ha : p .70. Next we calculated z = .73-.70 = 2.27. .7(.3) 1200 Our P-value is = P (Z 2.27). According to Table A.2, z = 2.27 is between z = 1.960 and z = 2.326. Therefore, our p-value, P (Z 2.27), is between .025 and .01, which means it must be less than .05. We can reject H0 at the 5% level. [Recall: Table A.1 showed the precise P-value to be .0116, which is in fact between .025 and .01.] Example A university has found over the years that out of all the students who are offered admission, the proportion who accept is .70. After a new director of admissions is hired, the university wants to check if the proportion of students accepting has changed significantly. Suppose they offer admission to 1200 students and 888 accept. Is this evidence at the = .05 level that there has 888 been a real change (in either direction) from the status quo? First we found that p = 1200 = .73 ^ is the sample proportion of students who accepted admission, and we set up H0 : p = .70 vs. Ha : p = .70. Next we calculated z = .73-.70 = 2.27 .7(.3) 1200 Because of the two-sided alternative, our P-value is = 2P (Z |2.27|) = 2P (Z +2.27). According to Table A.2, z = 2.27 is between z = 1.960 and z = 2.326. Therefore, P (Z 2.27) is between .025 and .01, and the P-value, 2P (Z 2.27), is between 2(.025) and 2(.01), that is, between .05 and .02. We still can reject H0 at the 5% level, but not at the 2% level. Example In a group of 371 Pitt students, 42 were left-handers, which makes the sample proportion .113. Is this significantly lower than the proportion of Americans who are left-handers, which is .12? Earlier we found the z-statistic to be .113-.12 = -.41 and the P-value to be P (Z -.41). .12(.88) Consulting Table A.2, we see that -.41 is less extreme than 1.645, so the P-value is larger than .05. Again, we have failed to produce any evidence against H0 . 371 94 Example When students are asked to pick a number "at random" from one to twenty, I suspect their selections will show bias in favor of the number seventeen. In a group of 371 students, 25 chose the number seventeen. Does this provide convincing statistical evidence of bias in favor of the number seventeen, in that the proportion of students picking seventeen is significantly higher than 1/20 = .05? The null and alternative hypotheses were H0 : p = .05 z = .067-.05 .05(.95) 371 Ha : p > .05 and so the z-statistic was = 1.50 and the P-value was = P (Z 1.50). Instead of using Table A.1 to find the precise P-value, we note from Table A.2 that 1.50 is less than 1.645, so the tail probability must be greater than 1-.90 = .05. Thus, our P-value= P (Z 1.50) is greater than .05 and 2 we do not have convincing evidence of bias. Note: Earlier we found the exact P-value to be P (Z -1.50) = .0668, which is indeed greater than .05. Example Note: In a previous Example, we began by assuming that the proportion of freshmen taking intro Stats classes is .25. According to survey data, we found the sample proportion of freshmen to be .08. By hand we calculated the probability of a sample proportion this low, coming from a population with proportion .25: it was approximately zero. I characterized this as "virtually impossible" and decided not to believe that the overall proportion of freshmen is .25. Alternatively, I could use MINITAB to test the hypothesis that population proportion is .25, vs. the "less than" alternative. Since "year" allows for more than two possibilities, it is necessary to use Stat, then Tables, then Tally to count the number of freshmen (35). Then use the Summarized Data option in the 1 Proportion procedure, specifying 445 as the Number of Trials and 35 as the Number of Successes. I opted to "use test and interval based on normal distribution", since that's how I originally solved the problem by hand. The p-value is zero, and I reject the null hypothesis in favor of the alternative. I again conclude that the proportion of freshmen in intro Stats classes (at least in the Fall) is less than .25. Tally for Discrete Variables: Year Year 1 2 3 4 other N= *= Count 35 257 102 37 14 445 1 Test and CI for One Proportion Test of p = 0.25 vs p < 0.25 Sample 1 X 35 N 445 Sample p 0.078652 95.0% Upper Bound 0.099642 Z-Value -8.35 P-Value 0.000 Exercise: In a previous Exercise, we explored the sampling distribution of sample proportion of females, when random samples are taken from a population where the proportion of females is .5. We noted the sample proportion of females among surveyed Stats students, and calculated by hand the probability of observing such a high sample proportion, if population proportion were really only .5. We used this probability to decide 95 whether we were willing to believe that population proportion is in fact .5. For this Exercise, address the same question by carrying out a formal hypothesis test using MINITAB. Be sure to specify the appropriate alternative hypothesis. State your conclusions clearly in context. Lecture 25 Type I and Type II Error When we set a cutoff level in advance for a hypothesis test, we are actually specifying the long-run probability we are willing to take of rejecting a true null hypothesis, which is one of the two possible mistaken decisions that can be made in a hypothesis test setting. Example Recall our testing-for-disease example in Chapter 7, in which the probability of a false positive was .015, probability of false negative was .003. All the possibilities for Decision and Actuality are shown in the table below. If we decide to use .015 as our cut-off probability (p-value < .015 means reject H0 ; otherwise don't reject), then .015 is the probability of making a Type I Error--the probability of rejecting the null hypothesis, even though it is true. That means the probability of correctly accepting a true null hypothesis is 1 - .015 = .985. In medical situtations, this is the specificity of the test. Actuality Decision Healthy (don't reject H0 ) Diseased (accept alt.hyp.) Healthy(H0 true) correct prob.=specificity=.985 incorrect(false pos.) Type I error(prob.=.015) Diseased(Ha true) incorrect(false neg.) Type II error(prob.=.003) correct prob.=sensitivity =power=.997 In our example, we were told the probability of a false negative, or Type II Error. Thus, the probability of a correct positive for an ill person (called the sensitivity of the test) = 1 minus the probabilty of Type II error. Statisticians refer to this probability as the power of the test. In a z test about population proportion p, the probability of a Type II error [incorrectly failing to reject the null hypothesis when the alternative is true] can only be calculated if we are told specifically the actual value of the population proportion. Thus, we need to know the alternative proportion which contradicts the null hypothesized proportion. What we do not need in order to calculate the probability of Type II error is the value of an observed proportion p. Our probability is about the test itself, not about the results. ^ Rather than focusing on making such calculations, we will instead think carefully about the implications of making Type I or Type II errors. Example For our medical example above, the probability of incorrectly telling a healthy person that he or she does have AIDS is higher than the probability of incorrectly telling an infected person that he or she does not have AIDS. If a healthy person initially tests positive (Type I error), then the consequence (besides considerable anxiety) is a subsequent, more discerning test, which has a better chance of making the correct diagnosis second time around. If an infected person tests negative (Type II error), then the consequences are more dire, because treatment will be withheld, or at best delayed, and there is the risk of further infecting other individuals. Thus, in this case it makes sense to live with a higher probability of Type I error in order to diminish the probability of Type II error. 96 Example Consider the following legal example: the null hypothesis is that the defendant is innocent and the alternative is that the defendant is guilty. The trial weighs evidence as in a hypothesis test in order to decide whether or not to reject the null hypothesis of innocence. What would Type I and II errors signify in this context? A Type I error means rejecting a null hypothesis that is true, in other words finding an innocent person guilty. Most people would agree that this is much worse than committing a Type II error in this context, which would be failing to convict a guilty person. Dr. Stephen Fienberg of CMU did extensive work for the government is assessing the effectiveness of lie-detector tests. He concluded that probabilities of committing both types of error were so high that he and a panel of investigators recommended discontinuing the use of such tests. A peek at a brain can unmask a liar tells about the most recent technology for new sorts of lie detectors to replace the old-fashioned polygraph. Role of Sample Size Example Suppose one demographer claims that there are equal proportions of male and female births in a certain state, whereas another claims there are more males. They use hospital records from all over the state to sample 10,000 recent births, and find 5120 to be males, or p = .512. They ^ test H0 : p = .5 vs. Ha : p > .5 and calculate z = .512-.5 = 2.4, so the P-value is .0084, quite .5(.5) 10,000 small. Does this mean (a) they have evidence that the population proportion of male births is much higher than .5; or (b) they have very strong evidence that the population proportion of male births is higher than .5? The interpretation in (b) is the correct one; (a) is not. Especially when the sample size is large, we may produce very strong evidence of a relatively minor difference from the claimed p0 . Conversely, if n is too small, we may fail to gather evidence about a difference that is quite substantial. Example A Statistics recitation instructor suspects there to be a higher proportion of females overall in Stats classes. She observes 12 females in a group of 20 students, so p = .6. Does this confirm ^ her suspicions? She would test H0 : p = .5 vs. Ha : p > .5. First she verifies that 20(.5) and 20(1 - .5) are both 10, just barely satisfying our condition for a normal approximation. Also, she has in mind a population in the hundreds or even thousands, so the binomial model applies. She calculates z = .6-.5 = .89. The P-value is .1867, providing her with no statistical evidence .5(.5) to support her claim. In fact, the population proportion of females really is greater than .5, but this sample size was just too small to prove it. In contrast, a lecture class of 80 students with p = .6 would produce a z statistic of 1.79 and a P-value of .0367. ^ Remember that z= p - p0 ^ (^ - p0 ) n p 20 p0 (1-p0 ) n = p0 (1 - p0 ) We reject H0 for a small P-value, which in turn has arisen from a z that is large in absolute value, on the fringes of the normal curve. There are three components that may result in a z that is large in absolute value, which in turn cause us to reject H0 : 1. What people tend to focus on as the cause of rejecting H0 is a large difference p - p0 between the ^ observed proportion and the proportion proposed in the null hypothesis. This naturally makes z large and the P-value small. 97 2. A large sample size n, because n is actually multiplied in the numerator of the test statistic z, brings about a large z and a small P-value. Conversely, a small sample size n may lead to a smaller z and failure to reject H0 , even if it is false (a Type II error). 3. If p0 is close to .5, then p0 (1 - p0 ) is considerably larger than it is for p0 close to 0 or 1. (For example, p0 (1 - p0 ) is .5 for p0 = .5, but it is .1 for p0 = .01 or .09.) When Hypothsis Tests are Not Appropriate Remember that we carry out a hypothesis test, based on sample data, in order to draw conclusions about the larger population from which the sample was obtained. Hypothesis tests are not appropriate if there is no larger group being represented by the sample. Example In 2002, the government requested and won approval for 1228 special warrants for secret wiretaps and searches of suspected terrorists and spies. Is this significantly higher than 934, which was the number of special warrants approved in 2001? Statistical inference is not appropriate here because 1228 and 934 represent entire populations for 2002 and 2001; they are not sample data. Example 928,000 In 2000, 1238,000 = .75 of all bachelor's degrees were earned by whites. Is this significantly lower than .86, the proportion of all bachelor's degrees earned by whites in 1981? We would not carry out a significance test, because the given proportions already describe the population. Example An internet review of home pregnancy tests reports: "Home pregnancy testing kits usually claim accuracy of over 95% (whatever that may mean). The reality is that the literature contains information on only four kits evaluated as they are intended to be used--by women testing their own urine. The results we have suggest that for every four women who use such a test and are pregnant, one will get a negative test result. It also suggests that for every four women who are not pregnant, one will have a positive test result." From this information we can identify the probabilities of both Type I and II errors, according to the review, as being 1 in 4, or 25%. Example Gonorrhea is a very common infectious disease. In 1999, the rate of reported gonorrhea infections was 132.2 per 100,000 persons. A polymerase chain reaction (PCR) test for gonorrhea is known to have sensitivity 97% and specificity 98%. What are the probabilities of Type I and Type II Errors? Given the high degree of accuracy of the test, if a randomly chosen person in the U.S. is routinely screened for gonorrhea, and the test comes up positive, what is the probability of actually having the disease? The null hypothesis would be that someone does not have the disease. A Type I Error would be rejecting the null hypothesis, even though it is true: testing positive when a person does not have the disease. A Type II Error would be failing to reject the null hypothesis, even though it is false: testing negative when a person does have the disease. A sensitivity of 97% means that if someone has the disease, the probability of correctly testing positive is 97%, and so the probability of testing negative (when someone has the disease) is 3%: this is the probability of a Type II error. A specificity of 98% means that if someone does not have the disease, the probability of correctly testing negative is 98%, and so the probability of 98 testing positive (when someone does not have the disease) is 2%: this is the probability of a Type I error. A two-way table makes it easier to identify the probability we are seeking (of having the disease, given that the test is positive). We begin with a total of 100,000 people, of whom 132 have the disease (the remaining 999,868 do not). Sensitivity 97% means 127 of the 132 with gonorrhea test positive. Specificity 98% means 979,871 of the 999,868 people without gonorrhea test negative. The remaining counts can be filled in by subtraction: Gonorrhea No Gonorrhea Total Positive 127 1,997 2,124 Negative 5 97,871 97,876 Total 132 99,868 100,000 Of the 2,124 people who test positive, 127 actually have the disease: if someone tests positive, 127 the probability of having the disease is 2,124 = .06. Remember, however, that this probability applies to a randomly chosen person being screened. If someone is screened because of exhibiting symptoms, the probability is of course higher. Exercise : Refer to the article How not to catch a spy: Use a lie detector, which reports at the bottom of the first column, "Even if the test were designed to catch eight of every 10 spies, it would produce false results for large numbers of people. For every 10,000 employees screened, Fienberg said, eight real spies would be singled out, but 1,598 innocent people would be singled out with them, with no hint of who's a spy and who isn't." Based on this information, set up a two-way table, classifying 10,000 employees as actually being spies or not, and being singled out as a spy by the lie detector or not. Report the probability of a Type I Error and of a Type II Error. If someone is identified by the lie detector as being a spy, what is the probability that he or she is actually a spy? 99
Find millions of documents on Course Hero - Study Guides, Lecture Notes, Reference Materials, Practice Exams and more. Course Hero has millions of course specific materials providing students with the best way to expand their education.

Below is a small sample set of documents:

Pittsburgh - STAT - 1000
Lecture 26Nancy Pfenning Stats 1000Chapter 12: More About Confidence IntervalsRecall: Setting up a confidence interval is one way to perform statistical inference: we use a statistic measured from the sample to construct an interval estimate for the un
Pittsburgh - STAT - 1000
Lecture 29Nancy Pfenning Stats 1000Reviewing Confidence Intervals and Tests for Ordinary One-Sample, MatchedPairs, and Two-Sample Studies About MeansExample Blood pressure X was measured for a sample of 10 black men. It was found that x = 114.9, s = 10
Pittsburgh - STAT - 1000
Lecture 32Nancy Pfenning Stats 1000Chapter 16: Analysis of VarianceExample Suppose your instructor administers 3 different forms of a final exam. When scores are posted, you see the observed mean scores for those 3 different forms-82, 66, and 60-are no
Pittsburgh - STAT - 1000
Lecture 33Nancy Pfenning Stats 1000Chapter 16: Analysis of VarianceLast time, we wanted to test if the difference among 3 observed mean test scores-82, 66, and 60-could be easily enough attributed to chance variation: H0 : 1 = 2 = 3 vs. Ha : not all th
Pittsburgh - PHYS - 3101
33-658, 758 Quantum Computation and Information Spring Semester, 2012 Assignment No. 1. Due Tuesday, January 24 In the future all assignments will be posted at the COURSE WEB SITE: http:/www.andrew.cmu.edu/course/33-658 = http:/quantum.phys.cmu.edu/QCQI/
Pittsburgh - PHYS - 3101
33-658, 758 Quantum Computation and Information Spring Semester, 2012 Assignment No. 2. Due Tuesday, January 31 READING: BORN = &quot;Stochastic Quantum Dynamics I. Born Rule&quot; Course web site CQT = Griffiths, Consistent Quantum Theory MEASURE = &quot;Measurements&quot;
Pittsburgh - PHYS - 3101
33-658, 758 Quantum Computation and Information Spring Semester, 2012 Assignment No. 3. Due Tuesday, February 7 ANNOUNCEMENT. There will be an hour exam on Tuesday afternoon, February 21, beginning at 3:00 pm (usual class hour). It is closed book, closed
Pittsburgh - PHYS - 3101
33-658, 758 Quantum Computation and Information Spring Semester, 2012 Assignment No. 4. Due Tuesday, February 14 ANNOUNCEMENT. There will be an hour exam on Tuesday afternoon, February 21, beginning at 3:00 pm (usual class hour). It is closed book, closed
Pittsburgh - PHYS - 3101
33-658, 758 Quantum Computation and Information Spring Semester, 2012 Assignment No. 5 (Not to be turned in) ANNOUNCEMENT. There will be an hour exam on Tuesday afternoon, February 21, beginning at 3:00 pm (usual class hour). It is closed book, closed not
Pittsburgh - PHYS - 3101
qitd113Hilbert Space Quantum MechanicsRobert B. Griffiths Version of 17 January 2012Contents1 Introduction 1.1 Hilbert space . . . . . . . . . . . 1.2 Qubit . . . . . . . . . . . . . . . 1.3 Physical interpretation of vectors 1.4 Incompatible properti
Pittsburgh - PHYS - 3101
qitd122MeasurementsRobert B. Griffiths Version of 2 Feb. 2010 References: CQT = Consistent Quantum Theory by Griffiths (Cambridge, 2002) QCQI = Quantum Computation and Quantum Information by Nielsen and Chuang (Cambridge, 2000).Contents1 Introduction
Pittsburgh - PHYS - 3101
Classical Information TheoryRobert B. Griffiths Version of 12 January 2010Contents1 Introduction 2 Shannon Entropy 3 Two Random Variables 4 Conditional Entropies and Mutual Information 5 Channel Capacity 1 1 3 4 6References: CT = Cover and Thomas, Ele
Pittsburgh - PHYS - 3101
qitd181Quantum Information TypesRobert B. Griffiths Version of 6 February 2012 References: R. B. Griffiths, Types of Quantum Information, Phys. Rev. A 76 (2007) 062320; arXiv:0707.3752Contents1 Introduction 2 Information Types 2.1 Definition . . . . .
Pittsburgh - PHYS - 3101
ProbabilitiesRobert B. Griffiths Version of 12 January 2010 References: Feller, An introduction to probability theory and its applications, Vol. 1, 3d ed (Wiley 1968). See Introduction, Ch. I, Ch. V DeGroot and Schervish, Probability and Statistics, 3d e
Pittsburgh - PHYS - 3101
qitd322Unitary Dynamics and Quantum CircuitsRobert B. Griffiths Version of 23 January 2012Contents1 Unitary Dynamics 1.1 Time development operator T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.2 Particular cases . . . . .
Pittsburgh - PHYS - 3101
Stochastic Quantum Dynamics I. Born RuleRobert B. Griffiths Version of 25 January 2010Contents1 Introduction 2 Born Rule 2.1 Statement of the Born Rule . . . 2.2 Incompatible sample spaces . . . 2.3 Born rule using pre-probabilities 2.4 Generalizations
Pittsburgh - PHYS - 3101
qitd342Histories and ConsistencyRobert B. Griffiths Version of 31 January 2012Contents1 Histories 1.1 Classical stochastic processes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.2 Quantum histories . . . . . . . . . . . . . . . . .
Pittsburgh - PHYS - 3101
qitd352Dense Coding, Teleportation, No CloningRobert B. Griffiths Version of 8 February 2012 References: NLQI = R. B. Griffiths, &quot;Nature and location of quantum information&quot; Phys. Rev. A 66 (2002) 012311; http:/arxiv.org/archive/quant-ph/0203058 QCQI =
Pittsburgh - PHYS - 3101
qitd421Correlations, Ensembles, Density OperatorsRobert B. Griffiths Version of 2 Feb. 2010Contents1 Correlations, Classical and Quantum 2 Conditional States 3 Ensembles 4 Density Operators 4.1 Introduction . . . . . . . . . . . . . . . . . . . . 4.2
Pittsburgh - PHYS - 3101
Phys. Rev. A 76 (2007) 062320; arXiv:0707.3752Types of Quantum InformationRobert B. GriffithsDepartment of Physics, Carnegie-Mellon University, Pittsburgh, PA 15213, USA Quantum, in contrast to classical, information theory, allows for different incomp
UVA - BIOE - 6421
Cell Culture Results Cells Cultured on PDMS Substrates (Plus-signshaped structure)Tissue EngineeringBioMEMS ShortcourseDr. Bruce K. GalePeg Scaffold PDMS cell culture scaffold Pegs with height 5 microns,diameter 10 microns Pegs spaced 20 microns
UVA - BIOE - 6421
Overview of the Photolithography ProcessSurface PreparationCoating (Spin Casting)Pre-Bake (Soft Bake)AlignmentExposureDevelopmentPost-Bake (Hard Bake)Processing Using the Photoresist as a Masking FilmStrippingPost Processing Cleaning (Ashing)Wa
UVA - BIOE - 6421
Phenolic Resins - 2OHO+CHHformaldehydephenolEE-527: MicroFabricationOHH2COHOHH2CH2CPositive PhotoresistsbakeliteO+HHwaterR. B. Darling / EE-527Phenolic Resins - 3R. B. Darling / EE-527Advantages of Positive PhotoresistsOHO+
UVA - BIOE - 6421
Tips for Effective Poster PresentationsThrough the process of trial and error, scientific societies and veteran poster presenters havecome up with the following rules of thumb for effective poster presentations.1. Prepare a banner in very large type co
UVA - MSE - 2090
MSE 209: Introduction to the Scienceand Engineering of MaterialsSpring 2010 MSE 209 - Section 1Instructor: Leonid ZhigileiMonday and Wednesday, 08:30 9:45 amOlsson Hall 009MSE 2090: Introduction to Materials ScienceChapter 1, Introduction1MSE 209
UVA - MSE - 2090
Syllabus:From atoms to microstructure: Interatomicbonding, structure of crystals, crystal defects,non-crystalline materials.Mass transfer and atomic mixing: Diffusion,kinetics of phase transformations.Mechanical properties, elastic and plasticdefor
UVA - MSE - 2090
Structure Subatomic level (Chapter 2)Electronic structure of individualatoms that defines interaction amongatoms (interatomic bonding). Atomic level (Chapters 2 &amp; 3)Arrangement of atoms in materials(for the same atoms can havedifferent properties,
UVA - MSE - 2090
Types of MaterialsLet us classify materials according to the way the atoms arebound together (Chapter 2).Metals: valence electrons are detached from atoms, andspread in an 'electron sea' that &quot;glues&quot; the ions together.Strong, ductile, conduct electri
UVA - MSE - 2090
Chapter Outline Review of Atomic StructureElectrons, protons, neutrons, quantum mechanics ofatoms, electron states, the periodic Table Atomic Bonding in SolidsBonding energies and forces Primary Interatomic BondingIonicCovalentMetallic Secondary
UVA - MSE - 2090
Some simple calculationsThe number of atoms per cm3, n, for material of density d(g/cm3) and atomic mass M (g/mol):n = Nav d / MGraphite (carbon): d = 2.3 g/cm3, M = 12 g/moln = 61023 atoms/mol 2.3 g/cm3 / 12 g/mol = 11.5 1022atoms/cm3Diamond (carb
UVA - MSE - 2090
Electrons in Atoms (IV)ElementAtomic #Hydrogen1Helium2Lithium3Beryllium4Boron5Carbon6.Neon10Sodium11Magnesium12Aluminum13.Electron configuration1s 11s 2(stable)1s 2 2s 11s 2 2s 21s 2 2s 2 2p 11s 2 2s 2 2p 2.Argon.Krypto
UVA - MSE - 2090
Bonding Energies and ForcesPotential Energy, UrepulsionrInteratomic distance r0attractionequilibriumThis is typical potential well for two interacting atomsThe repulsion between atoms, when they are brought closeto each other, is related to the
UVA - MSE - 2090
Ionic Bonding (I)Ionic Bonding is typical for elements that are situated atthe horizontal extremities of the periodic table.Atoms from the left (metals) are ready to give up theirvalence electrons to the (non-metallic) atoms from the rightthat are ha
UVA - MSE - 2090
Covalent Bonding (I)In covalent bonding, electrons are shared between themolecules, to saturate the valency. In this case theelectrons are not transferred as in the ionic bonding,but they are localized between the neighboring ionsand form directional
UVA - MSE - 2090
Secondary Bonding (I)Secondary = van der Waals = physical (as opposite tochemical bonding that involves e- transfer) bonding resultsfrom interaction of atomic or molecular dipoles and isweak, ~0.1 eV/atom or ~10 kJ/mol.+_+_Permanent dipole moment
UVA - MSE - 2090
Bonding in real materialsIn many materials more than one type of bonding isinvolved (ionic and covalent in ceramics, covalent andsecondary in polymers, covalent and ionic insemiconductors.Examples of bonding in Materials:Metals: MetallicCeramics: I
UVA - MSE - 2090
Chapter OutlineHow do atoms arrange themselves to form solids? Fundamental concepts and language Unit cells Crystal structuresFace-centered cubicBody-centered cubicHexagonal close-packed Close packed crystal structures Density computations Types
UVA - MSE - 2090
Metallic Crystal StructuresMetals are usually (poly)crystalline; although formationof amorphous metals is possible by rapid coolingAs we learned in Chapter 2, the atomic bonding in metalsis non-directional no restriction on numbers orpositions of nea
UVA - MSE - 2090
Body-Centered Cubic (BCC) Crystal Structure (I)Atom at each corner and at center of cubic unit cellCr, -Fe, Mo have this crystal structureMSE 2090: Introduction to Materials ScienceChapter 3, Structure of solids11Body-Centered Cubic Crystal Structur
UVA - MSE - 2090
FCC: Stacking Sequence ABCABCABC.Third plane is placed above the holes of the first planenot covered by the second planeMSE 2090: Introduction to Materials ScienceChapter 3, Structure of solids16HCP: Stacking Sequence ABABAB.Third plane is placed d
UVA - MSE - 2090
Polycrystalline MaterialsAtomistic model of a nanocrystalline solid by Mo Li, JHUMSE 2090: Introduction to Materials ScienceChapter 3, Structure of solids21Polycrystalline MaterialsSimulation of annealing of a polycrystalline grain structurefrom ht
UVA - MSE - 2090
Chapter OutlineCrystals are like people, it is the defects in them whichtend to make them interesting! - Colin Humphreys. Defects in Solids0D, Point defectsvacanciesinterstitialsimpurities, weight and atomic composition1D, Dislocationsedgescrew
UVA - MSE - 2090
Point Defects: VacanciesVacancy = absence of an atomfrom its normal location in aperfect crystal structureVacancies are always present in crystals and they areparticularly numerous at high temperatures, when atomsare frequently and randomly change t
UVA - MSE - 2090
Solids with impurities - Solid SolutionsSolid solutions are made of a host (the solvent ormatrix) which dissolves the minor component(solute). The ability to dissolve is called solubility.Solvent: in an alloy, the element or compoundpresent in greate
UVA - MSE - 2090
Composition ConversionsFrom Weight % to mass per unit volume (g/cm3):C1wtC1 = wtC1C2wt+12C2wtC2 = wtC1C2wt+12C1 and C2 are concentrations of the first and secondcomponents in g/cm3Average density &amp; average atomic weight in a binary alloy
UVA - MSE - 2090
Where do dislocations come from ?The number of dislocations in a material is expressed as thedislocation density - the total dislocation length per unitvolume or the number of dislocations intersecting a unitarea. Dislocation densities can vary from 1
UVA - MSE - 2090
Interaction between dislocations and grain boundariesMotion of dislocations can be impeded by grain boundaries increase of the force needed to move then(strengthening the material).Grain boundary present a barrier to dislocation motion: slipplane dis
UVA - MSE - 2090
Atomic VibrationsThermal energy (heat) causes atoms to vibrateVibration amplitude increases with temperatureMelting occurs when vibrations are sufficient to rupturebondsVibrational frequency ~ 1013 Hz (1013 vibrations persecond)Average atomic energ
UVA - MSE - 2090
Chapter OutlineDiffusion - how do atoms move through solids?Diffusion mechanismsVacancy diffusionInterstitial diffusionImpuritiesThe mathematics of diffusionSteady-state diffusion (Ficks first law)Nonsteady-State Diffusion (Ficks second law)Facto
UVA - MSE - 2090
Diffusion FluxThe flux of diffusing atoms, J, is used to quantifyhow fast diffusion occurs. The flux is defined aseither the number of atoms diffusing through unit areaper unit time (atoms/m2-second) or the mass of atomsdiffusing through unit area pe
UVA - MSE - 2090
Nonsteady-State Diffusion: Ficks second lawCC=Dtx 22Ficks second law relates the rate of change of compositionwith time to the curvature of the concentration profile:CCxCxxConcentration increases with time in those parts of thesystem where
UVA - MSE - 2090
Diffusion Temperature Dependence (II)b = logD0a=y = ax + bQd2.3Rx = 1/TGraph of log D vs. 1/T has slop of Qd/2.3R,intercept of log DoQd 1 log D = log D0 2.3R T log D1 log D 2 Qd = 2.3R 1 T1 1 T2 MSE 2090: Introduction to Materials Science
UVA - MSE - 2090
Diffusion: Role of the microstructure (I)Self-diffusion coefficients for Ag depend on the diffusionpath. In general the diffusivity if greater through lessrestrictive structural regions grain boundaries, dislocationcores, external surfaces.MSE 2090:
UVA - MSE - 2090
Chapter OutlineMechanical Properties of MetalsHow do metals respond to external loads?Stress and StrainTensionCompressionShearTorsionElastic deformationPlastic DeformationYield StrengthTensile StrengthDuctilityToughnessHardnessOptional read
UVA - MSE - 2090
Stress-Strain BehaviorElastic PlasticElastic deformationReversible: when the stressis removed, the materialreturns to the dimensions ithad before the loading.StressUsually strains are small(except for the case of someplastics, e.g. rubber).Plas
UVA - MSE - 2090
Elastic Deformation: Poissons ratioUnloadedLoadedyx= =zzMaterials subject to tension shrink laterally. Thosesubject to compression, bulge. The ratio of lateral andaxial strains is called the Poisson's ratio . Sign inthe above equations shows th
UVA - MSE - 2090
Tensile StrengthIf stress = tensile strength is maintainedthen specimen will eventually breakStress, fracturestrengthNeckingTensile strength: maximumstress (~ 100 - 1000 MPa)Strain, For structural applications, the yield stress is usually amore
UVA - MSE - 2090
Elastic Recovery During Plastic DeformationIf a material is deformed plastically and the stress is thenreleased, the material ends up with a permanent strain.If the stress is reapplied, the material again respondselastically at the beginning up to a n
UVA - MSE - 2090
Chapter OutlineDislocations and Strengthening MechanismsWhat is happening in material during plastic deformation?Dislocations and Plastic DeformationMotion of dislocations in response to stressSlip SystemsPlastic deformation insingle crystalspolyc
UVA - MSE - 2090
Interactions between dislocationsThe strain fields around dislocations cause them tointeract (exert force on each other). When they are inthe same plane, they repel if they have the same sign(direction of the Burgers vector) and attract/annihilateif
UVA - MSE - 2090
Slip in single crystals - critical resolved shear stressWhen the resolved shear stress becomes sufficiently large,the crystal will start to yield (dislocations start to movealong the most favorably oriented slip system). The onsetof yielding correspon