University of Maryland University College

STAT200 - Assignment #1: Descriptive Statistics Data Analysis Plan Identifying Information

Student (Full Name): Yesenia Vasquez Martinez

Class: Statistics 200 –Assignment #1: Descriptive statistics data analysis plan

Instructor:tdr

Date:

Scenario: Please write a few lines describing your scenario and the four variables (in addition to income)

you have selected.

Use Table 1 to report the variables selected for this assignment. Note: The information for the required

variable, “Income,” has already been completed and can be used as a guide for completing information

on the remaining variables.

Table 1. Variables Selected for the Analysis

Variable Name in the Data

Set Variable 1: “Income” Description

(See the data dictionary for describing the

variables.)

Annual household income in USD. Variable 2: Type of Variable

(Qualitative or

Quantitative)

Quantitative Quantitative

Total Amount of Annual Expenditure on Food Variable 3:

Variable 4:

Marital Status of Head of Household

Variable 5: Qualitative

Quantitative Reason(s) for Selecting the Variables and Expected Outcome(s):

1. Variable 1: “Income” -Income is the upper limit of the budget to be determined, therefore, it is an important factor in helping the household establish the budget accordingly.

The higher the income, the higher the planned budget will be, and the lower income will

Page 1 of 3 2. Variable 2: “Food “- Food is an essential expense and a budget cannot be completed without including the expense related to this factor. Therefore, food becomes an essential part

of budget planning for each household

3. Variable 3: “ “ - 4. Variable 4: “Marital status “ -The marital status of the head of the family allows us to know the routine expenses. A married head has higher expenses compared to a single person.

Therefore, it is one of the prerequisite factors that affect family budget planning.

5. Variable 5: “Family size “-The purpose of the data is to plan the budget for the home. More family members would be more expenses, therefore it is important to know the number

of family members. Data Set Description:

Proposed Data Analysis:

Measures of Central Tendency and Dispersion

Complete Table 2. Numerical Summaries of the Selected Variables and briefly explain why you choose

those measurements. Note: The information for the required variable, “Income,” has already been

completed and can be used as a guide for completing information on the remaining variables. Table 2. Numerical Summaries of the Selected Variables

Variable

Name

Variable 1:

“Income” Measures of Central

Tendency and Dispersion

●

●

● Number of

Observations

Median

Sample Standard

Deviation Rationale for Why Appropriate I am using median for two reasons:

1. If there are any outliers or the data is not normally

distributed, the median is the best measure of

central tendency.

2. The variable is quantitative.

I am using sample standard deviation for three reasons:

1. The data is a sample from a larger data set.

2. It is the most commonly used measure of

dispersion.

Page 2 of 3 Variable 2:

Marital Status of Head of

Household

Variable 3:

Variable 4:

Variable 5: Graphs and/or Tables

Complete Table 3. Type of Graphs and/or Table for Selected Variables and briefly explain why you

choose those graphs and/or tables. Note: The information for the required variable, “Income,” has

already been completed and can be used as a guide for completing information on the remaining

variables.

Table 3. Type of Graphs and/or Tables for Selected Variables

Variable

Name Graph and/or Table Rationale for why Appropriate? Variable 1:

“Income” Graph: I will use the histogram to

show the normal distribution of data. Histogram is one of the best plot to show the

normal distribution of quantitative level data . Variable 2:

Variable 3:

Variable 4:

