HW1-1

Course: STATS 1181, Fall 2010
School: Langara
Word Count: 835

Descriptive d1 1. Statistics and Graphs for PLACE Frequency Table for PLACE Class 1 2 3 4 Value Africa America Asia Europe Frequency 5 15 24 6 Relative Frequency 0.1000 0.3000 0.4800 0.1200 Cumulative Frequency 5 20 44 50 Cum. Rel. Frequency 0.1000 0.4000 0.8800 1.0000 The StatAdvisor This table shows the number of times each value of PLACE occurred, as well as percentages and cumulative statistics. For example,...

Descriptive d1 1. Statistics and Graphs for PLACE Frequency Table for PLACE Class 1 2 3 4 Value Africa America Asia Europe Frequency 5 15 24 6 Relative Frequency 0.1000 0.3000 0.4800 0.1200 Cumulative Frequency 5 20 44 50 Cum. Rel. Frequency 0.1000 0.4000 0.8800 1.0000 The StatAdvisor This table shows the number of times each value of PLACE occurred, as well as percentages and cumulative statistics. For example, in 5 rows of the data file PLACE equaled Africa. This represents 10.0% of the 50 values in the file. The rightmost two columns give cumulative counts and percentages from the top of the table down. d2 B archart for PLACE Africa America Asia Europe 0 4 8 12 frequency 16 20 24 {[} {\} {]} {^} {_} {`} d3 d4 2. Define one statistic from the data on PLACE and calculate it. - The mode is the category with the most occurrences. In this study, the mode is "Asia" because Asia holds the largest percentage of occurrences in the sample with 48%. - For Asia: Frequency of AsiaNumber of all outcomes=2450= .48 3. Describe the distribution of PLACE by referring to either or both graphs. The pie chart above graphically shows the geographical distribution of the 50 destructive earthquakes that were sampled. 10% of the sampled earthquakes occurred in Africa, 48% occurred in Asia, 30% occurred in America, and 12% of them occurred in Europe. 4. Produce descriptive statistics and graphs for the variable MAGN Summary Statistics for MAGN Count 50 Average 7.748 Median 7.8 Standard deviation 0.705732 Minimum 5.8 Maximum 8.9 The StatAdvisor This table shows summary statistics for MAGN. It includes measures of central tendency, measures of variability, and measures of shape. Of particular interest here are the standardized skewness and standardized kurtosis, which can be used to determine whether the sample comes from a normal distribution. Values of these statistics outside the range of -2 to +2 indicate significant departures from normality, which would tend to invalidate any statistical test regarding the standard deviation. In this case, the standardized skewness value is within the range expected for data from a normal distribution. The standardized kurtosis value is within the range expected for data from a normal distribution. d5 Box-and-W hisker Plot 5.8 6.8 7.8 MAGN 8.8 9.8 Stem-and-Leaf Display for MAGN: unit = 0.1 1|2 represents 1.2 2 2 5 15 (17) 18 8 5|89 6| 6|588 7|0111113344 7|55566777888889999 8|0133333444 8|56667799 The StatAdvisor This display shows a frequency tabulation for MAGN. The range of the data has been divided into 7 intervals (called stems), each represented by a row of the table. The stems are labelled using one or more leading digits for the data values falling within that interval. On each row, the individual data values represented are by a digit (called a leaf) to the right of the vertical line. This results in a histogram of the data from which you can recover at least two significant digits for each data value. If there are any points lying far away from most of the others (called outside points), they are placed on separate high and low stems. In this case, there are no outside points. Outside points are illustrated graphically on the box-and-whisker plot, which you can access via the list of Graphical Options. The leftmost column of numbers are depths, which give cumulative counts from the top and bottom of the table, stopping at the row which contains the median. Histogram 30 25 20 15 10 5 0 5.5 6.5 7.5 MAGN 8.5 9.5 percentage d6 5. Standard deviation is the measure of dispersion from the mean, in the original units of the data. In our study's case, the summary statistics' standard deviation portrays the sample standard deviation. The number 0.71 tells us that on average the sample earthquakes differ from the average of 7.5 by about 0.71 Richter magnitudes. 6. I t is difficult to assess the aim of the study without more information on what t he population of the study was. Our case study's sample was not randomly chosen to represent all earthquakes since 1905. The sample was taken from destructive earthquakes where a judgement was made on the definition of "destructive". T herefore, I see the standard deviation as a statistic because it represents the characteristics of the sample, not the population. For the standard deviation to be considered as a parameter, the standard deviation should represent the characteristic of the population. A larger and random sample would be needed to get closer to the standard deviation of the population. 7. 8. Q1 = 7.3 and Q3 = 8.3 F irst, the Q3 shows that 8.3 is the number that represents the 75th percent of values in the set. The Q3 n umber above also shows that any value over 8.3 is considered an outlier. 9. From the Stem and Leaf display, 18 + 8 = 26 earthquakes are considered 'great'. The percentage of earthquakes considered great is 2650 = 52%. d7 10. The shape of the histogram is negatively skewed, in other words, left skewed. T he shape of this histogram tells us that the median is greater than the mean. The median is at 7.8 and the mean is at 7.748. 11. Box-and-W hisker Plot Africa PLACE America Asia Europe 5.8 6.8 7.8 MAGN 8.8 9.8 12. The box and whisker plot shows the five number summary within the graph. T he plot shows the minimum and maximum magnitudes in the sample for each region and the median along with the 1st and 3rd quartiles for each region in the d8 sample. The box plot also shows that there are no outliers in this graph. The absence of outliers may be due to the non-random sampling.
