o Push publishing deliver BI without request o Pull publishing requires user to

O push publishing deliver bi without request o pull

This preview shows page 9 - 14 out of 41 pages.

o Push publishing – deliver BI without request o Pull publishing – requires user to request BI results °°Data warehouse and data marts. What they are and their differences?
° ° Problems with operational data Dirty data Missing value Inconsistent data Data not integrated Wrong granularity o Too fine o Not fine enough Too much data o Too many attributes o Too many data points ° ° Reporting applications RFM (Recently frequently money) o Analyze and rank customers according to their purchasing patterns o R – recent orders o F – frequent orders o M – money (amount) of money spent OLAP o Online analytical processing
o Provides the ability to sum, count, average, and other simple arithmetic operations on groups of data o Dynamic hence the term ‘online’ o Dimension: a characteristic of a measure (e.g. purchase data, customer type, etc.) o Measure: a data item of interest. It is the item to be processed (e.g. total sales, average sales, etc.) ° ° Data mining applications Data mining is the application of statistical techniques to find patterns and relationships among data for classification and prediction Became popular due to large amount of data produced in 15 years (a result of cheap hardware) A convergence of many discipline Two broad categories: unsupervised and supervised Two typical techniques: market basket analysis and decision trees ° ° Supervised and unsupervised data mining Unsupervised: o Analyst does not start with a prior hypothesis or model o Hypothesized model created based on analytical results (later) to explain patterns found o Example: cluster analysis Supervised: o Uses a priori model to compute outcome of model o Prediction, such as regression analysis °
° Big data applications Big data is defined as: o Huge volume – petabyte and larger o Rapid velocity – generated rapidly o Great variety Structured data, free-form text, log files, graphics, audio, and video ° ° BI server Web server for publishing of BI MS SQL server mgr is most popular today Provides two major functions o Management (metadata of users, etc.) o delivery ° ° KM, CMS and expert systems KM (knowledge management) o Earliest KM system is expert system CM o Support management and delivery of documents, other expressions of employee knowledge o Challenges of content management Databases are huge
Content dynamic Documents do not exist in isolation Contents are perishable In many languages o In-house custom development Customer support department develops in house database applications to tract customers problems o Off the shelf Horizontal market products (share point) Vertical market application o Public search engine Google ° ° CHAPTER 10: °

  • Left Quote Icon

    Student Picture

  • Left Quote Icon

    Student Picture

  • Left Quote Icon

    Student Picture