Shown 12 12 964 1 t10js99a tjs99a

Info iconThis preview shows page 1. Sign up to view the full content.

View Full Document Right Arrow Icon
This is the end of the preview. Sign up to access the rest of the document.

Unformatted text preview: 7 Fair2.0: 22 Fair : 43 Average: 829 BAvg2.5: 125 BAvg2.5: 42 Average: 685 AbvAvrg: 666 Avrg3.0: 439 Avrg3.0: 903 Good : 688 Good : 86 AAvg3.5: 447 AAvg3.5: 297 Excllnt: 106 Good4.0: 351 Good4.0: 227 VGd 4.5: 164 VGd 4.5: 81 Excl5.0: 51 Excl5.0: 33 Excl5.5: 12 Table 3: Summary statistics for a selection of ordinal variables in the Ames housing dataset. 12 There are a lot of interesting variables to explore in this dataset, but to save space, we will only focus on a few in the following plots. In Figure 1, histograms of the variables sale price and lot area are shown, along with a scatterplot of the two. Both of these variables are right skewed with some possible outliers in the upper tails. The bivariate distribution of the two variables is difficult to interpret, as there is not a clear-cut relationship. There is some odd behavior with lot areas smaller than 5,000 square feet, which could indicate different types of houses/properties being sold. The relationship is a little more straight-forward for houses greater than 5,000 square feet, though, as there is a clear positive association. Distribution of Sale Price Distribution of Lot Area (Outliers Surpressed) Sale Price by Lot Area (outliers surpressed) 350 $600,000 400 300 250 $400,000 150 Sale Price Count Count 300 200 200 Overall Quality Overall Condition Basement Quality Kitchen Quality Poor1.0: 1 Poor1.0: 3 None : 91 Poor : 1 Poor1.5: 8 Poor1.5: 7 Poor : 2 Fair : 33 Fair2.0: 17 Fair2.0: 22 Fair : 43 Average: 829 Sale Value Lot Area (SF) Lot Area (SF) BAvg2.5: 125 BAvg2.5: 42 Average: 685 AbvAvrg: 666 Figure 1: Univariate and bivariate distributions of: Sale Price ($) and : 86 Avrg3.0: 439 Avrg3.0: 903 Good 688 Good Lot Area (SF). Note: Two houses have more than 75,000 square feet lot AAvg3.5: and are surpressed in these images. Excllnt: 106 447 AAvg3.5: 297 area, Good4.0: 351 Good4.0: 227 VGd 4.5: 164 VGd 4.5: 81 Excl5.0: 51 Excl5.0: 33 Excl5.5: 12 Neighborhood Analysis Table 3: Summary statistics for a selection of ordinal variables in the 100 $200,000 100 50 0 0 $0 $200,000 $400,000 $600,000 0 10000 20000 30000 40000 50000 60000 70000 0 10000 20000 30000 40000 50000 60000 70000 One of the questions Ames housing dataset. answering relates to the di...
View Full Document

Ask a homework question - tutors are online