BUSINESS ANALYTICS CHAPTER 29 LECTURE OUTLINE Data warehouses - - PowerPoint PPT Presentation

business analytics
SMART_READER_LITE
LIVE PREVIEW

BUSINESS ANALYTICS CHAPTER 29 LECTURE OUTLINE Data warehouses - - PowerPoint PPT Presentation

BUSINESS ANALYTICS CHAPTER 29 LECTURE OUTLINE Data warehouses Comparison with operational databases Multi-dimensional schemas Functionality of a data warehouse 2 DATA WAREHOUSES Data warehouse A subject


slide-1
SLIDE 1

BUSINESS ANALYTICS

CHAPTER 29

slide-2
SLIDE 2

LECTURE OUTLINE

  • Data warehouses
  • Comparison with operational databases
  • Multi-dimensional schemas
  • Functionality of a data warehouse

2

slide-3
SLIDE 3

DATA WAREHOUSES

  • Data warehouse
  • “A subject-oriented, integrated, nonvolatile, time-variant collection
  • f data in support of management’s decisions.” [W.H.Inmon]
  • Data comes from multiple databases
  • Tools to make business decisions quickly and reliably based on

historical data

  • Supported applications
  • OLAP (Online Analytical Processing)
  • Analysis of complex data from data warehouse
  • DSS (Decision Support Systems)
  • Also known as EIS (Executive Information Systems)
  • Provides data and tools for complex decision-making
  • Data mining
  • Knowledge discovery: searching data for unanticipated new

knowledge

3

slide-4
SLIDE 4

INTEGRATED WAREHOUSE AND DB

  • Extract, Transform, and Load (ETL)
  • Extracted from multiple, heterogeneous sources.
  • Includes data cleaning to ensure validity and consistency
  • Back flushing: upgrading the data with cleaned data
  • Analyzed data fed back into operating DB and data management

Databases Data Warehouse Cleaning Reformatting Updates/New Data Back Flushing Other Data Inputs OLAP Data Mining Data Metadata DSSI EIS

4

slide-5
SLIDE 5

DATA WAREHOUSES VS. DBS

  • Operations
  • Data warehouses optimized to find data correlations and to support

trend analyses

  • Traditional databases are transactional: optimized for access,

update, and integrity assurance

  • Data warehouses are less volatile than operational DBs.
  • Data currency
  • Operational DBs required to maintain up-to-date, detailed data
  • Data warehouses characterized by historical data
  • Information in data warehouse is relatively coarse grained (“view

from 10,000 ft.”) and refresh policy is carefully chosen, usually incremental.

  • Data volume
  • Data warehouses may be exceptionally large (7 years of records)
  • Data warehouse can be interpreted as a (special) view of the data.

5

slide-6
SLIDE 6

MULTI-DIMENSIONAL SCHEMAS

  • Multi-dimensional schemas specified using:
  • Dimension table
  • Consists of tuples of attributes of the dimension.
  • Fact table
  • Each tuple is a recorded fact.
  • Some measured or observed variable(s) and references to

dimension tables.

6

slide-7
SLIDE 7

WAREHOUSE FUNCTIONALITY

  • Roll-up: Data is summarized with increasing generalization
  • E.g., going from daily or weekly reports to annual aggregations
  • Drill-Down: Increasing levels of detail are revealed
  • E.g., going from national sales to sales from a particular region
  • Pivot: Cross tabulation is performed from given perspective
  • Slice and dice: Select and project data wrt some dimensions
  • Plus traditional operations
  • Sorting by ordinal value.
  • Selection by value or range.
  • Derived attributes: Attributes are computed by operations on stored

derived values.

7

slide-8
SLIDE 8

DATA MODELING FOR WAREHOUSES

  • OLAP data cube

REGION REG1 REG2 REG3 P123 P124 P125 P126 : : P R O D U C T Two Dimensional Model

: :

Three dimensional data cube P r

  • d

u c t F i s c a l Q u a r t e r Q t r 1 Q t r 2 Q t r 3 Q t r 4 Reg 1

P123 P124 P125 P126

Reg 2 Reg 3 R e g i o n

8

roll up drill down

slide-9
SLIDE 9

LECTURE SUMMARY

  • Glimpse at DBMS support for business analytics
  • Role of data warehouses
  • OLAP cube
  • Model and operations

9