Tuesday, 16 February 2016

MGT300 Chapter 8 : Accessing Organizational Information- Data Warehouse

DATA WAREHOUSE FUNDAMENTALS

- A data warehouse is a logical collection of information-gathered from many different operational database-that supports business analysis activities and decision-making tasks.
- The primary purpose of a data warehouse is to aggregate information throughout an organization into a single repository in such a way that employees can make decisions and undertake business analysis activities.
- The data warehouse then send subsets of the information to data mart.
- A data mart contains a subsets of data warehouse information


*Figure above show compiles information from internal database or transactional database and external database through extraction, transformation and loading (ETL) which a process that extracts information from internal and external database, transforms the information using a common set of enterprise definitions, and loads the information into a data warehouse.


MULTIDIMENSIONAL ANALYSIS AND DATA MINING

Relational Database contains information in a series of two-dimensional tables.
- In a data warehouse and data mart, information is multidimensional, it contains layers of columns and rows
       
              Dimension – A particular attribute of information



  Cube – common term for the representation of multidimensional information



- Once a cube of information is created, users can begin to slice and dice the cube to drill down into the information.
- Users can analyze information in a number of different ways and with number of different dimensions.
- Data Mining > the process of analyzing data to extract information not offered by the raw data alone. Also known as “knowledge discovery” – computer-assisted tools and techniques for sifting through and analyzing vast data stores in order to finds trends, patterns and correlations that can guide decision making and increase understanding
- To perform data mining users need data-mining tools
-          Data-mining tool > uses a variety of techniques to finds patterns and relationships in large volumes of information. Eg: retailers and use knowledge of these patterns to improve the placement of items in the layout of a mail-order catalog page or Web page.


INFORMATION CLEANSING OR SCRUBBING

- Information cleansing or scrubbing is a process that weeds out and fixes or discards inconsistent, incorrect or incomplete information.
- It occur during ETL process and second on the information once if is in the data warehouse
Ø  Contract information in an operational system
Ø  Standardizing Customer  name from Operational Systems
Ø  Information cleansing activities
-          Missing Records or Attributes
-          Redundant Records
-          Missing Keys or Other Required Data
-          Erroneous Relationships or References
-          Inaccurate Data

Ø  Accurate and complete information



BUSINESS INTELLIGENCE

- Business Intelligence refers to application and technologies that are use to gather, provide access to, and analyze data and information to support decision-making efforts. 

Enabling Business Intelligence
- Technology
- People
- Culture

No comments:

Post a Comment