Cleaning Data and Creating New
Variables
Recoding Variables
Example
o A summary stat on variable AGE resulted in maximum values of 9999 and
minimum values of -1
o Why do we care?
o These values dont make sense, so check t
7 October 2013
Chapter 2: Notes
Simple and Multiple Regressions
To calculate R2 we use, (the proportion of variation that is explained by our regression model)
TSS: Total Sum Of Squares variation in dependent variable(y)
o TSS=SSE+SSR
SSE: Explained Sum