3 Pages

exam1sols

Course: STAT 218, Fall 2009
School: Cal Poly
Rating:
 
 
 
 
 

Word Count: 900

Document Preview

218 Exam Stat 1 Solutions Winter 2006 Some questions have different versions, so be sure to find the relevant version for your exam. Frequent flyer question: a) The value 83% is a statistic, because it pertains only to the sample of people who responded to the poll, not to the population of all adult Americans. b) The sampling method is almost certainly biased in favor of yes responses to using a credit card to...

Register Now

Unformatted Document Excerpt

Coursehero >> California >> Cal Poly >> STAT 218

Course Hero has millions of student submitted documents similar to the one
below including study guides, practice problems, reference materials, practice exams, textbook help and tutor support.

Course Hero has millions of student submitted documents similar to the one below including study guides, practice problems, reference materials, practice exams, textbook help and tutor support.
218 Exam Stat 1 Solutions Winter 2006 Some questions have different versions, so be sure to find the relevant version for your exam. Frequent flyer question: a) The value 83% is a statistic, because it pertains only to the sample of people who responded to the poll, not to the population of all adult Americans. b) The sampling method is almost certainly biased in favor of yes responses to using a credit card to accumulate airline miles, because only people interested in the topic would read the article in the first place, and people who use a credit card to accumulate airline miles would be more likely to be interested in the topic. A common error was not to specify the direction of the bias as it relates to this issue of using a credit card for airline miles. Short answer questions: a) Examples of categorical variables are gender, type of injury, whether the patient has health insurance, whether the patient came in an ambulance. Examples of quantitative variables are waiting time before seeing a doctor, total cost of treatment, number of stitches received. A common error was to give a summary about all patients, such as the number of patients who were treated that day. b) The midrange is not resistant to outliers, because it depends only on the largest and smallest values in the data, so outliers would affect the midrange greatly. The midhinge is resistant to outliers, because it depends only on the quartiles, which are not affected by a few extreme values. c) The sample mean is denoted by y . It is the midpoint of the confidence interval, because the interval is formed by taking the sample mean plus/minus the margin-of-error. In this case the midpoint, and therefore the sample mean, is (6.5+7.7)/2 = 7.1. d) The IQR is the difference between the quartiles, which are the 3rd and 8th values (in order). To make the IQR = 0 requires the middle six scores to be identical. To make the mean greater than the median, include one or two very high values, such as: 60, 60, 60, 60, 60, 60, 60, 60, 100, 100. Here the IQR = 0, median = 60, and mean = 68. To make the mean less than the median, include one or two very low values, such as: 0, 0, 60, 60, 60, 60, 60, 60, 60, 60. Here the IQR = 0, median = 60, and mean = 48. Dog height question: Since we are comparing apples and oranges, we can do so with z-scores. A German Shepherd with a height of 28 inches has a z-score of (28-25)/2.5 = 1.2, indicating that his height is 1.2 standard deviations above the mean. A Sheltie with a height of 18 inches has a z-score of (18-15)/1.5 = 2.0, indicating that his height is 2.0 standard deviations above the mean. Therefore, such a Sheltie is more unusual than such a German Shepherd. A German Shepherd with a height of 22 inches has a z-score of (22-25)/2.5 = -1.2, indicating that height his is 1.2 standard deviations below the mean. A Sheltie with a height of 12 inches has a z-score of (12-15)/1.5 = -2.0, indicating that his height is 2.0 standard deviations below the mean. Therefore, such a Sheltie is more unusual than such a German Shepherd. Body temperature question: a) Because we do not know the population standard deviation , we need to use the sample standard deviation s and therefore a t-interval rather than a z-interval. With a sample size of 65, the degrees of freedom is 64, but with our table we will use 60, giving t* = 1.990. The 95% CI for the population mean body temperature of a healthy adult American female is: 98.105 +/- 1.990(0.699)/sqrt(65), which is 98.105 +/- 0.173, which is ((97.932, 98.278). The 95% CI for the population mean body temperature of a healthy adult American male is: 98.394 +/- 1.990(0.743)/sqrt(65), which is 98.394 +/- 0.184, which is ((97.210, 98.578). b) The symbol represents the mean body temperature among all healthy adult American males/females. c) The 95% CI does not include the value 98.6, suggesting that 98.6 is not a plausible value for the population mean body temperature. d) We are told that the sample was randomly selected, so that condition is met. The second condition requires either a normally distributed population or a large sample size. We can not check normality, but a sample size of 65 is large, so this condition is also met. e) A 90% confidence interval would have the same midpoint, because the midpoint is the sample mean. The width would be smaller, because with a lower confidence level we do not need as wide an interval to be confident of capturing the population mea...

Find millions of documents on Course Hero - Study Guides, Lecture Notes, Reference Materials, Practice Exams and more. Course Hero has millions of course specific materials providing students with the best way to expand their education.

Below is a small sample set of documents:

Cal Poly - STAT - 218
Stat 218 - Day 31 Scatterplots and Association We will study relationships between two quantitative variables for the rest of the course. A scatterplot is a graphical display of the relationship between two quantitative variables. o The explanatory
Cal Poly - STAT - 321
Fall, 2004 Stat 321 - Day 11 Discrete Random VariablesWednesday, October 6Example: Random Babies and College Committees (cont.) Recall your analysis of distributing four babies to their mothers at random. You found that there are 24 equally likel
Cal Poly - STAT - 426
Stat 426 Assignment 7 (due Thur, Mar 13) Topics: Confidence Intervals, Hypothesis Testing, Types of Errors, Power Function, Size, pValue 1. Some student researchers at Cal Poly took a random sample of 100 students and calculated the ratio of backpack
Cal Poly - STAT - 150
Fall, 2008 Stat 150 - Day 7 Scatterplots and AssociationWednesday, Oct. 1Suppose that a criminal leaves behind a footprint so that we can measure the length of his/her foot. Does that enable us to say anything about his/her height? We will collec
Cal Poly - STAT - 321
Fall, 2004 Stat 321 - Day 2 Equal LikelinessTuesday, September 21Activity 2: Random Babies Suppose that on one night at a certain hospital, four mothers (named Johnson, Miller, Smith, and Williams) give birth to baby boys. Each mother gives her c
Cal Poly - STAT - 221
Stat 221Exam 3November 21, 2008Note: Your questions may have appeared in a different order.1. (16 pts, 4 pts each) For the following questions, identify which of the five types of inference procedures would be appropriate. Choose from the fol
Cal Poly - STAT - 252
STAT 252 - Day 29 Introduction to Multiple Regression Example 1: House Prices (cont.) Reconsider yet again the data on price (in thousands of dollars) and size (in square feet) of a random sample of 20 houses sold in Arroyo Grande in 2006-07 (HousePr
Cal Poly - STAT - 218
Stat 218 - Day 4 Five-number summary, Boxplots Five-number summary (FNS): Minimum, Lower Quartile (Q1), Median, Upper Quartile (Q3), Maximum Quartiles are calculated as the median of the values below/above the location of the actual median Example: N
Cal Poly - STAT - 322
Guide to MinitabDr. Jimmy Doi1Introduction to MinitabMinitab is a statistical analysis software package. The software is freely available to all students and is downloadable through the Technology Tab at my.calpoly.edu. When you rst launch Mi
Cal Poly - STAT - 330
Supplement: The IN= StatementStat 330, Dr. Jimmy A. DoiSupplement: The IN= StatementThis supplement is oered in response to a question a few students raised with regard to the IN= statement. Please refer to the course notes/textbook for more det
Cal Poly - STAT - 218
Ch. 7 Addendum: Using MinitabStat 218, Dr. Jimmy A. DoiAdditional Notes for Minitab Please add to the current Guide to MinitabExercise (A) from Course Notes: Researchers suspect that the exposure to toluene will increase the levels of NE in the
Cal Poly - STAT - 330
Supplement: The LAG StatementStat 330, Dr. Jimmy A. DoiSupplement: The LAG StatementThis supplement is oered in response to a question a student raised with regard to the lag statement. Please refer to the course notes/textbook for more details
Cal Poly - STAT - 330
Supplement: Permanent Data SetsStat 330, Dr. Jimmy A. DoiSupplement: SAS Permanent Data SetsSuppose we want to import the raw data le age salary.txt which is stored at C:/mySAS/raw data/age salary.txt. Notice that the data is based on three vari
Cal Poly - STAT - 330
OPTIONS FORMCHARStat 330, Dr. Jimmy A. DoiOPTIONS FORMCHARHeres a tip from Professor Russ Lenth (Statistics Department, University of Iowa): How to make SAS tables/output print correctly SAS is set up to use its own fonts, and this causes annoyi
Cal Poly - STAT - 330
SAS Installation Guide Dr. Jimmy Doi, Statistics Dept. Cal Poly State University San Luis ObispoNOTE FOR WINDOWS VISTA USERS: If you are running WINDOWS VISTA (HOME or HOME DELUXE), please consult with the instructor DO NOT ATTEMPT TO INSTALL SAS
Cal Poly - STAT - 330
1STAT 330Reading Multiple Lines of Data per Observation Typically data sets consist of one observation per line SAS will go to the next line if it runs out of data before it has read all variables from the input statement Better if you tell SA
Cal Poly - STAT - 330
12STAT 330Keeping Track of Stats Can use out= option with most proc's to keep your statistics in a SAS data set useful for presentation or reporting can manipulate these statistics if need be in most procs (such as means) out= goes in an ou
Cal Poly - STAT - 330
STAT 330Lecture Set 12 - Bells and WhistlesOutput Delivery System (ODS) Prior to version 7 SAS output was designed for a traditional line-printer limited to monospace fonts and not as flexible Version 7 and up now contain the ODS Where do we
Cal Poly - STAT - 330
12STAT 330PROC CHART Proc CHART is used to generate bar charts Choose between vertical bars (vbar) and horizontal bars (hbar) vbar and hbar options include:Lecture Set 9 - Basic Statistical PROCsmidpoints = list - specifies midpoints for
Cal Poly - STAT - 330
1STAT 330Lecture Set 9 - Basic Statistical PROCs2PROC CHART Proc CHART is used to generate bar charts Choose between vertical bars (vbar) and horizontal bars (hbar) vbar and hbar options include:midpoints = list - specifies midpoints for
Cal Poly - STAT - 330
STAT 330Lecture Set 13 - Useful things to knowDynamic Data Exchange (DDE) Recall that we used DDE to read data into SAS from Excel We can use a similar method to dump data back into excel (and word) Useful for creating specialized data files a
Cal Poly - STAT - 330
12STAT 330Macros Macros allow you to write piece of code to be used over and over throughout your program very useful as this cuts down on the size of your programLecture Set 11 - Macro Concepts Macros used to be considered an advanced t
Cal Poly - STAT - 330
12STAT 330Basic PROCsLecture Set 7 - Basic PROCs All PROCs have required statements and optional statements Most PROCs make some sort of report or analysis By default will send output to output (listing) window34PROC SORTPROC SORT
Cal Poly - STAT - 330
STAT 330Dynamic Data Exchange (DDE) Recall that we used DDE to read data into SAS from Excel We can use a similar method to dump data back into excel (and word) Useful for creating specialized data files as well as making reports Some work requ
Cal Poly - STAT - 330
STAT 330Output Delivery System (ODS) Prior to version 7 SAS output was designed for a traditional line-printer limited to monospace fonts and not as flexible Version 7 and up now contain the ODS Lecture Set 12 - Bells and Whistles Where do we
Cal Poly - STAT - 330
Stat 330: Assignment 3 Due: Wednesday, January 21, 2009 at 11:59:59 pm1. Forbes is an American publishing and media company that is famous for its lists including the list of richest Americans, billionaires, and the most powerful celebrities. The r
Cal Poly - STAT - 330
12STAT 330Assignments A quick note about programs. For each assignment youwill be required to include the following information at the top of your program(s):* Name: Your Name(s); * Assignment: Assignment #; * Due Date: Due date from assignm
Cal Poly - STAT - 330
Stat 330: Assignment 1 Due: Wednesday, January 7, 2009 at 4:00 PM SAS offers help manuals in html format on the web. These web pages are superior to the standard help menu offered with the SAS software installation. If you cant find what you are look
Cal Poly - STAT - 330
12STAT 330Range Lists There are some handy shortcuts for listing variables Numbered range lists are for variables that start with the same characters and end with consecutive numbering Var6 Var7 Var8 is the same as Var6-Var8Lecture Set 5
Cal Poly - STAT - 330
Stat 330: Assignment 4 Due: Wednesday, January 28, 2009 at 11:59:59 pm 1. A company that works with various military contracts (too confidential to even be discussed in detail here) has a data file that contains basic contract initiation information
Cal Poly - STAT - 330
Stat 330: Assignment 2 Due: Wednesday, January 14, 2009 at 11:59:59 pm1. Data were collected on prices in cents per pound received by fishermen and vessel owners for various species of fish and shellfish in 1970 and 1980. The variables in this file
Cal Poly - STAT - 330
1STAT 330Lecture Set 6 Reading Data IIReading Multiple Lines of Data per Observation Typically data sets consist of one observation per line SAS will go to the next line if it runs out of data before it has read all variables from the input
Cal Poly - STAT - 330
1STAT 330Lecture Set 10 More Statistical PROCs2Keeping Track of Stats Can use out= option with most proc's to keep your statistics in a SAS data set useful for presentation or reporting can manipulate these statistics if need be in most
Cal Poly - STAT - 330
12STAT 330Administrative - GeneralLecture Set 1 Goals of course and SAS overview Website: http:/statweb.calpoly.edu/rottesen Office hours Homework Email to: STAT330HW@yahoo.com Labs Quizzes and Groupwork Projects Exams Working toge
Cal Poly - STAT - 330
1STAT 330Lecture Set 8 - Back to the DATA Step2SAS Automatic Variables We already know about automatic variables such as _N_ and _ERROR_ SAS can also create additional automatic variables under certain circumstances When using a by state
Cal Poly - STAT - 330
12STAT 330Drop/Keep It is not always necessary to keep all the variables in a data set Lecture Set 4 How Does SAS Process Data? and a do loop extravaganza sometimes better to only hang on to what is needed for reporting/analysis Use a dro
Cal Poly - STAT - 218
12STAT 218Sampling DistributionsLecture Set 4 Sampling Distributions Definition: Sampling Variability is the variability among random samples from the same population. A probability distribution that characterizes some aspect of sampling
Cal Poly - STAT - 218
Sampling Distributions Definition: Sampling Variability is the variability among random samples from the same population.A probability distribution that characterizes some aspect of sampling variability is calledThe Meta-Experiment A meta-experim
Cal Poly - STAT - 218
12STAT 218ApplicationLecture Set 8 More on the T Test and the Wilcoxon-Mann-Whitney TestExample: Nine observations of surface soil pH were made two different locations. Does the data suggest that the true mean soil pH values differ for the
Cal Poly - STAT - 218
12STAT 218Probability Probability is important to statistics because: study results can be influenced by variation it provides theoretical groundwork for statistical inferenceLecture Set 3 Random Sampling, Probability, and the Normal Dis
Cal Poly - STAT - 218
1STAT 218Statistical Inference Concerning 1 How can we use statistical inference in regression?2Lecture Set 13 More Regression Suppose we would like to investigate the relationship between X and Y If X is telling us nothing about Y, wha
Cal Poly - STAT - 218
12STAT 218Quantitative DataLecture Set 5 Sampling Distribution for the Mean and Introduction to Confidence Intervals More complex than dichotomous data Sample and populations for quantitative data can be described in various ways: mean, m
Cal Poly - STAT - 218
1STAT 218Planning a Study to Estimate It is important before you begin collecting data to consider whether the estimates will be sufficiently precise. Two factors to consider:2Lecture Set 6 Sample Size Calculations and Confidence Interval
Cal Poly - STAT - 218
Two Way ANOVA Name the type of two way ANOVA discussed in our text book.What is the point of blocking?Can blocks consist of more than two observations?Name the three parts that the total variability is split into.Completely RandomizedSS(betw
Cal Poly - STAT - 217
12STAT 217Measures of Center Recall that center is #2 of the BIG three. Measures of center for numerical data include: the mean the median the mode (the value with the highest frequency)Lecture Set 4 Center and Spread These measures a
Cal Poly - STAT - 217
Comparative Bar Chars What is the main purpose of a comparative bar chart?Example: According to the Cal Poly Fact Book 2003-2004 the enrollment by gender and college are as follows.Chart of Enrollment vs College, Gender90Percent of Enrollment
Cal Poly - STAT - 217
Name: STAT 217 HW addition Chapter 5 1. Consider the following data collected on the yield (in bushels) for 19 equal sized plots planted in tomatoes for that were sprayed with different amounts of fertilizer. Amount of Fertilizer (pounds per plot) 25
Cal Poly - STAT - 217
12STAT 217Planning a Study Before we analyze data, much thought and planning should go into how to collect the data Without quality information our conclusions will not mean muchLecture Set 2 Sampling and Design Six steps of the data ana
Cal Poly - STAT - 217
Choose the appropriate scenario: independent, paired, chi-square/goodness of fit, chi-square/contingency table, Oneway ANOVA, Two-way ANOVA, or regression. Is Friday the 13th an unusually unlucky day, or is this just superstition? How do superstition
Cal Poly - STAT - 217
Name: STAT 217 HW addition Chapter 8 1. The serum cholesterol of a population of 18-year-olds is approximately normally distributed with a mean of 176 mg/dLi and a standard deviation of 30 mg/dLi. What percentage of the 18-year-olds have a serum chol
Cal Poly - STAT - 217
Name: STAT 217 HW addition Chapter 151. The following ANOVA table is only partially completed. a. Complete the table.6Plot A5Individual Value PlotSource Treatment Error TotalDF 2SSMS 46.67F _P 0.0004330.00 1421 A B Cb.
Cal Poly - STAT - 217
Name: STAT 217 HW addition Chapters 3 and 41.For the following histograms, estimate the mean and median.Histogram of C114 12 10Histogram of C235 30 25 Frequency 20 15 10 5Frequency8 6 4 2 036424854 C1606672780012
Cal Poly - STAT - 217
Name: STAT 217 HW addition Chapter 4 1. Which of the boxplots below is represented by the histogram? Justify your answer.Boxplot of 1, 2, 3, 412 1 10 2 8 6 4 2 0Histogram3401020 Data304050Frequency01020 230402.Whic
Cal Poly - STAT - 217
Sampling Variability Do we expect that the sample mean x from a random sample will be exactly equal to the population mean ? ExplainWhat if we gathered another random sample of the same size, would this x be exactly equal to ? Explain.Would the
Cal Poly - STAT - 217
12STAT 217Basic Probability We typically think of probability as the chance that an event will occur Probability plays an important role in statistics because it allows us to make decisions with confidence We can think of probability as the
Cal Poly - STAT - 217
Statistics Example What do you think of when you hear statistics?What are three important reasons to learn statistics?Definition: Statistics is the scientific discipline that provides methods to help us make sense of data. To utilize statistics w
Cal Poly - STAT - 217
12STAT 217One Variable Categorical Data So far our experience with hypothesis testing and categorical data is limited to one dichotomous variable We need to expand our thinking to deal with more than two categories The Chi Square Goodness of
Cal Poly - STAT - 217
12STAT 217Correlation Analyze the relationship, if any, between variables x and y by examining the linear strength between them Correlation is a numerical measure of this linear relationshipLecture Set 5 Summary of Bivariate Data can be
Cal Poly - STAT - 217
DISCLAIMER: The size and question content of this review should not be considered an exact replication of the exam. The questions on the exam will be different. The purpose here is to supply you with more questions to practice with. In my opinion the
Cal Poly - STAT - 217
National paper company must purchase a new machine for producing cardboard boxes. The company must choose between two machines, machine 1 and machine 2. Since both machines produce boxes of equal quality, the company would like to choose the new mach
Cal Poly - STAT - 217
One Variable Categorical Data What does one sample data mean?What type of test would we use to compare a categorical variable with more than two groups?What does k stand for?Goodness of Fit Test What are the general forms of the hypotheses? Ho: