14 Pages

lecture4

Course: STAT 201, Fall 2008
School: Texas A&M
Rating:
 
 
 
 
 

Word Count: 1072

Document Preview

Distributions Sometimes Normal the overall pattern of a large number of observations is so regular that it can be described by a smooth curve Four examples of histograms which demonstrate a symmetric, bell-shaped pattern. 1 Normal distributions can take on many different means and standard deviations. Only the general bell shape must remain the same. Properties of the Normal Distribution: 1. Symmetric,...

Register Now

Unformatted Document Excerpt

Coursehero >> Texas >> Texas A&M >> STAT 201

Course Hero has millions of student submitted documents similar to the one
below including study guides, practice problems, reference materials, practice exams, textbook help and tutor support.

Course Hero has millions of student submitted documents similar to the one below including study guides, practice problems, reference materials, practice exams, textbook help and tutor support.
Distributions Sometimes Normal the overall pattern of a large number of observations is so regular that it can be described by a smooth curve Four examples of histograms which demonstrate a symmetric, bell-shaped pattern. 1 Normal distributions can take on many different means and standard deviations. Only the general bell shape must remain the same. Properties of the Normal Distribution: 1. Symmetric, bell-shaped 2. Mean, and standard deviation, 3. Area under the curve is 1 Because we will mention normal distributions often, a short notation is helpful. We abbreviate the normal distribution with mean and standard deviation as N (, ). N(0, 1) N(3, 2) N(-2, 0.5) 2 Q. Why are the normal distributions important in statistics? Here are two reasons (among many others): 1. normal distributions are good descriptions for some distributions of real data (if you don't believe me, take a look at the above figure!). 2. many statistical inference procedures based on normal distributions work well for other roughly symmetric distributions. (this is an advanced point which will be discussed later in the course if time permits). The 68-95-99.7 rule (a.k.a. the "empirical rule") Although there are many normal curves, they all have common properties. Here is one of the most important. 3 Example 1.3.1 Bigger animals tend to carry their young longer before birth. The length of horse pregnancies from conception to birth varies according to a roughly normal distribution with mean 336 days and standard deviation 3 days. Use the 68-95-99.7 rule to answer the following questions. 1. Sketch a picture of the distribution of the length of horse pregnancies from conception to birth. Also, using the short notation discussed earlier, to write an abbreviated representation of this distribution. 2. What percent of horse pregnancies fall between 333 to 339 days? What proportion fall between 330 to 342 days? 3. Almost all (99.7%) of horse pregnancies fall in what range of lengths? 4. What percent of horse pregnancies are shorter than 339 days? 4 Standardizing observations As the 68-95-99.7 rule suggests, all normal distributions share many properties. In fact, all normal distributions are the same if we measure in units of size about the mean as center. Changing to units is called standardizing. A z-score tells us how many standard deviations the original observation falls away from the mean, and in which direction. - Observations larger than the mean are positive when standardized, and observations smaller than the mean are negative. z-scores allow us to directly compare observations from two different "groups". Example 1.3.2 Scores on the Wechsler Adult Intelligence Scale for the 20 to 34 age group are approximately normally distributed with mean 110 and standard deviation 25. Scores for the 60 to 64 age group are approximately normally distributed with mean 90 and standard deviation 25. Sarah, who is 30, scores 135 on this test. Sarah's mother, who is 60, also takes the test and scores 120. Who scored higher relative to her age group, Sarah or her mother? Who has the higher absolute level of the variable measured by the test? At what percentile of their age groups are Sarah and her mother? (That is, what percent of the age group has lower scores?) 5 The standard normal distribution 6 Using the standard normal table Example 1.3.3 Assume that Z follows the standard normal distribution. Consider the following questions: 1. What percent of observations fall below 1? (i.e., is what the relative frequency of the event Z < 1?) 2. What percent of observations fall below 2? (i.e., what is the relative frequency of the event Z < 2?) 3. What percent of observations fall below 1.47? (i.e., what is the relative frequency of the event Z < 1.47?) Issue: Note that we can not answer the last question by simply using the 6895-99.7 rule. To answer such a question more generally, we need to refer to a standard normal table. This table is given in the front cover of the text book and the appendix (Table A). 7 Example 1.3.4 Find the relative frequency of each of the following events in a standard normal distribution. In each case, sketch a standard normal curve with the area representing the relative frequency shaded. (a) Z -2.25 (b) Z -2.25 (c) Z > 1.77 (d) -2.25 < Z < 1.77 8 Normal distribution calculations Example 1.3.5 Using the z-score formula given earlier along with our approach to using the standard normal table, we can answer the following types of questions: 1. Suppose X N (3, 2). Find the proportion of the observations that are less than 4. 2. Suppose X N (2, 3). Find the proportion of the observations that are less than 4. 3. Suppose X N (2, 3). Find the proportion of the observations that are greater than 5. 4. Suppose X N (2, 3). Find the proportion of the observations that are between -4 and 8. 9 Using the z-score formula given earlier along with our approach to using the standard normal table, we can answer the following types of questions: Example 1.3.6 Too much cholesterol in the blood increases the risk of heart disease. Young women are generally less afflicted with high cholesterol than other groups. The cholesterol levels for women aged 20 to 34 follow an approximately normal distribution with mean 185 milligrams per deciliter (mg/dL) and standard deviation 39 mg/dL. (a) Cholesterol levels above 240 mg/dL demand medical attention. What percent of young women have levels above 240 mg/dL? (b) Levels above 200 mg/dL are considered borderline high. What percent of young women have blood cholesterol between 200 and 240 ...

Find millions of documents on Course Hero - Study Guides, Lecture Notes, Reference Materials, Practice Exams and more. Course Hero has millions of course specific materials providing students with the best way to expand their education.

Below is a small sample set of documents:

Texas A&M - STAT - 201
Least-Squares RegressionRecall that correlation measures the direction and strength of the linear (straight line) relationship between two variables. If a scatterplot shows a linear relationship, we would like to draw a line on the plot to summarize
Texas A&M - STAT - 201
Solution to Quiz 2September 24, 2008Problem 1NO. Correlation does not imply a cause and effect relationship. So size of the hospital does not affect the length of stay of a patient. As a result the length of stay cannot be shortened by merely cho
Texas A&M - STAT - 201
CorrelationRecall that a scatterplot displays the form, direction and strength of the relationship between two quantitative variables. Linear relationships are particularly important because of their simple pattern that is quite common. A linear re
Texas A&M - STAT - 201
STAT 201 Section 501 Elementary Statistical InferenceVincent LeMoine Texas A&amp;M University Department of Statistics March 18, 20041Chapter 4 Section 4.1 : Randomness You cannot predict the results of tossing a coin or choosing an SRS, but there i
Texas A&M - STAT - 201
Syllabus for STAT 201, Section 501 Elementary Statistical Inference Fall 2004 Course: Lecture MW 4:105:25 PM, HECC 200 Instructor: Vincent LeMoine Oce: Phone: Email: Web Page: Oce Hours: Grader: Yi Qian Oce: Phone: Email: Oce Hours: Blocker 405F 845
Texas A&M - STAT - 303
Syllabus Statistics 303 Section 203, Summer 2008MWF 12:00 - 1:35pm BLOC 150, TR 12:00 - 1:35pm BLOC 161 Instructor: Vincent LeMoine, e-mail: vlemoine@stat.tamu.edu Office Hours: Yet to be announced, Office: 405-E Stat Dept. Office: Blocker 447, 8-5p
Texas A&M - STAT - 201
University Rules: Incompletes and Missed ExamsThe following university rules describe the procedures by which Incompletes should be given and the procedures to follow in deciding if a make-up exam should be given and the rules to follow in administe
Texas A&M - STAT - 303
STAT303Practice Questions Sp 06 jhc D. The assumption of linearity has been violated. E. All of the assumptions for simple linear regression are valid.Source | SS df MS -+-Model | 47.5154184 1 47.5154184 Residual | 8.0485833 8 1.00607291 -+-Total
Texas A&M - STAT - 201
STAT 201 Section 501 Elementary Statistical InferenceVincent LeMoine Texas A&amp;M University Department of Statistics January 21, 20041Chapter 1: Denitions: Individuals are objects described by a set of data. Examples are people, animals, or things
Texas A&M - STAT - 201
STAT 201 Section 501 Elementary Statistical InferenceVincent LeMoine Texas A&amp;M University Department of Statistics April 6, 20041Chapter 6 Introduction The purpose of statistical inference is to draw conclusions from data. We have examined data
Texas A&M - STAT - 201
Quiz 1 Solutions1. (a) Construct a five-number summary based on the stemplot below: (30 points) Stem = 1 1 2 3 4 5 6 |6889 |12237999 |122 |256 |2 |1(b) What is the shape of the distribution? (10 points) 2. List the two properties of a valid densit
Texas A&M - STAT - 201
STAT 201 Section 501 Elementary Statistical InferenceVincent LeMoine Texas A&amp;M University Department of Statistics April 17, 20041Chapter 7 Section 7.2 : Comparing Two Means Twosample problems are among the most common sitations encountered in s
Texas A&M - STAT - 303
S ta tis tic s 303Chapter 4 ProbabilityProbability modelWhen we study certain random phenomenon (ex. tossing a coin, choosing a digit at random ), we will describe it by: A list of all possible outcomes (called sample space, denoted by S)Ex.1.
Texas A&M - TEACHING - 651
Texas A&M - TEACHING - 651
Texas A&M - TEACHING - 651
Data Analysis and Statistical Methods Statistics 651http:/www.stat.tamu.edu/~suhasini/teaching.htmlLecture 6 (MWF)Suhasini Subba RaoLecture 6 (MWF)Review of previous lecture Every random variable has a probability `distribution' associated
Texas A&M - TEACHING - 651
Texas A&M - TEACHING - 673
BibliographyHong-Zhi An, Zhao-Guo. Chen, and E.J. Hannan. Autocorrelation, autoregression and autoregressive approximation. Ann. Statist., 10:926936, 1982. T. W. Anderson. An Introduction to Multivariate Analysis. Wiley, New Jersey, 2003. T. W. Ande
Texas A&M - TEACHING - 651
Data Analysis and Statistical Methods Statistics 651http:/www.stat.tamu.edu/~suhasini/teaching.htmlLecture 11 (MWF)Suhasini Subba RaoLecture 11 (MWF)Review of previous lecture Up until the previous lecture we were calculating probabilities
Texas A&M - TEACHING - 673
Chapter 7MixingalesIn this section we prove some of the results stated in the previous sections using mixingales. We rst dene a mixingale, noting that the denition we give is not the most general denition. Denition 7.0.1 (Mixingale) Let Ft = (Xt ,
Texas A&M - TEACHING - 651
Data Analysis and Statistical Methods Statistics 651http:/www.stat.tamu.edu/~suhasini/teaching.htmlLecture 8 (MWF)Suhasini Subba RaoLecture 8 (MWF)The binomial: mean and varianceRecall that the number of successes out of n, denoted by Sn is
Texas A&M - TEACHING - 651
11.1Commands in SPSSDowloading data from the web The data I post on my webpage will be either in a zipped directory containing a few les or just in one le containing data. Please learn how to unzip zipped documents. Saving data on my website t
Texas A&M - TEACHING - 673
Chapter 1Introduction: Why do time series?A time series is a series of observations xt , each observed at the time t. Typically the observations can be over an entire interval, randomly sampled on an interval or at xed time points. Dierent types o
Texas A&M - TEACHING - 673
STATISTICS 673 Autumn Semester, 2008 INSTRUCTOR: Dr. Suhasini Subba Rao OFFICE: BLOCKER 435 OFFICE HOURS: Monday and Wednesday 11:00 12:00 or by appointment. email address: suhasini.subbarao@stat.tamu.edu WWW: www.stat.tamu.edu/suhasini/teaching.htm
Texas A&M - TEACHING - 651
STATISTICS 651 Spring Semester, 2009 INSTRUCTOR: Dr. Suhasini Subba Rao OFFICE: BLOCKER 435 OFFICE HOURS: Monday and Wednesday 10:20 11:20 or by appointment. email address: suhasini.subbarao@stat.tamu.edu T.A.: Qing Chang, Room BLOCKER 409, email:qc
Texas A&M - TEACHING - 673
Chapter 5Almost sure convergence, convergence in probability and asymptotic normalityIn the previous chapter we considered estimator of several dierent parameters. The hope is that as the sample size increases the estimator should get closer to th
Texas A&M - TEACHING - 651
STAT 651 Midterm Test (50 minutes)NAME:Total number of Marks:Section:There are 7 questions in this paper, do not be deterred, they are all straightforward. Read each question carefully. The number of marks for each question are given in brack
Texas A&M - TEACHING - 651
Data Analysis and Statistical Methods Statistics 651http:/www.stat.tamu.edu/~suhasini/teaching.htmlLecture 15 (MWF)Suhasini Subba RaoLecture 14 (MWF)The underlying idea of statistical testing So far we have considered confidence intervals,
Texas A&M - TEACHING - 651
Texas A&M - TEACHING - 651
STAT 651 Midterm Test (50 minutes)October, 2008NAME:Total number of Marks:/30There are 6 questions in this paper, do not be deterred, they are all straightforward. Read each question carefully. There are questions on both side of the page.
Texas A&M - TEACHING - 673
Appendix AAppendixA.1 Background: some definition and inequalitiesThe norm of an object, is a postive numbers which measure the `magnitude' of that object. Suppose x = (x1 , . . . , xn ) Rn , then we define x 1 = n |xj | and x 2 = ( n |x2 )1/2 j