Notes 3 - Chapter 3 Producing Data Some questions to ask...

Chapter 3- Producing Data Some questions to ask before producing data: What is the group of interest? What information about the group are we interested in? How do we collect this information? 3.1 Design Samples: Population, Sample: The population in a statistical study is the entire group of individuals about which we want information. -Often, it is too time-consuming or expensive to obtain data from the entire population. A sample is a part of the population from which we actually collect information used to draw conclusions about the whole. -We use the sample to draw conclusions about the entire population. -We call this process inference. Example 1: A department store mails customer satisfaction survey to people who make credit purchases at the store. This month, 45,000 people made credit card purchases. Surveys are mailed to 1000 of these people, chosen at random, and 137 people return the survey form. What is the population for this survey? What is the sample from which information was actually obtained? Population: Sample: Example 2: A political scientist wants to know how the college students feel about the Social Security system. She obtains a list of the 2356 undergraduates at her college and mails a questionnaire to 250 students selected at random. Only 104 questionnaires are returned. Population: Sample: Collecting Data: Can we collect information from the entire population easily/quickly/cheaply? If so, perform a census. If not, how should we sample from the population? A Census: The collection of data from every member of a population. The only way to find the true value of a parameter. Often are very impractical, meaning we must use sample.

Representative Sample: These are samples in which the relevant characteristics of the sample are generally the same as the characteristics of the population. These samples will contain little or no bias. Sampling Design: The sampling design is the method used to select a sample from a population. It is important to select a sample completely randomly so that we do not allow bias to be introduced in the responses. Samples should be as representative as possible. A biased sample is one that systematically favors certain outcomes. Bad Sampling Methods: Sampling badly leads to obtaining data that does not accurately represent the population that it is trying to describe. Bias: The design of a study is biased if it systematically favors certain outcomes. Bad Sampling Method 1) Convenience Sampling: very cheap and convenient way to collect data.
