STAT1303A Data Management
1. Introduction to Data Management
1 Introduction to Data Management
Before we introduce the concept of data management, some of the related
concepts are covered.
organizing, presenting, analyzing, and interpreting data to assist in making more
Simply speaking, Statistics is based on numbers (data). For example,
The total number of postgraduate students decreases from 1,654 in 2004±5 to
1,406 in 2005±6.
The proportions of the postgraduate students who are from the Faculty of
Science is 35.5%.
Based on a sample survey of 100 postgraduate students, the proportions of the
students who are from the Faculty of Science is 36%.
characteristics that varies from one person or object to another. Then, data is
Variables for student - Age, Marks in STAT1303 and Gender.
Data for a student - For student A, we have his±her data of 20yr, 67.5 and
Data Management and Data Management
Large scale data collection and analysis is common today. For example, in academic
research, huge amount of genetic data is investigated in biomedical research and
survey data in social science research. In particular, the genetic data studied
in biomedical research can help to develop new methods to treat diseases. In
government research, enormous amount of data survey data is obtained in census and
general household survey for regular period of time. After then, social policies, e.g.
transportation, can be determined from the observed survey data which indicates the
demographic di/erences in various districts of Hong Kong. In the area of business
research, business opportunity can be discovered from the results of marketing
research and data mining after huge amount of data about potential customers are
studied. Consequently, e¢ cient data management can provide reliable and accurate
information for decision making in various aspects.
HKU STAT1303A (2009-10, Semester 1)