1
An Overview of Data Warehousing and OLAP Technology
Pre se nte r: Otto Disc ussion: Jim Ma r. 07 2006
2
- What is de c ision suppor
t
- What is a data war
e house
- Why we ne e d it, and how it diffe r
s fr
- m a
re gular RDBMS
- Diffe r
e nc e be twe e n OL AP and OL T P
- T
ypic al OL AP ar c hite c tur e
- Database De sign Me thodology
- Star
and Snowflake sc he ma s
- Challe nge s of mate r
ialize d vie ws
- Imple me ntation of the OL
AP Se r ve r
- Me tadata r
e quir e me nts
Outline
3
Motive for a Data Warehouse
- Busine sse s ha ve a lot of data, ope r
ational data and fac ts.
- T
his da ta is usually in diffe re nt da ta ba se s and in diffe r e nt physic al plac e s.
- De c ision make r
s ne e d to ac c e ss infor mation (data that ha s be e n summar ize d) vir tually on the single site .
- T
his ac c e ss ne e ds to be fa st re gar dle ss of the size of the data, and how old the data is.
4
What is decision support
- De c ision suppor
t syste ms are a c lass of c ompute r ize d infor mation syste ms that suppor t de c ision making ac tivitie s.
- De c ision support syste ms usually re quire
c onsolidating data form ma ny he te r
- ge ne ous sour
c e s: the se might inc lude e xte r nal sour c e s.
- Suc h us stoc k mar
ke t fe e ds.
5 Retains the history Major subject areas Data from different data sources. Changes as new data trickle in
What is data warehouse
- Data war
e housing pr
- vide s ar
c hite c tur e s and tools for busine ss e xe c utive s to syste matic ally or ganize , unde r stand a nd use the ir data to make str ate gic de c isions. – Jiawe i Han
- A data war
e house is a subje c t- or ie nte d, inte gr ate d, time - var iant, a nd non-vola tile c olle c tion of data in suppor t of manage me nt’s de c ision ma king pr
- c e ss.
6
Difference between OLAP and OLTP
Que r y thr
- ug hput
T ransac tion throughput
Me tr ic
100 GB- T B 100 MB-GB
DB size
Hundr e ds thousands
# use rs
Millions te ns
# re c ac c e sse d
Comple x que r y Shor t, simple tr ansa c tion
Unit of work
L
- ts of sc a ns
Re ad/ wr ite
Ac c e ss
Ad- hoc r e pe titive
Usage
Histor ic al, summar ize d, multidime nsional,… Cur re nt, up-to-date de taile d.
Data
Subje c t-or ie nte d Applic ation- or ie nte d
DB De sign
De c ision suppor t Day to day ope rations
F unc tion
Knowle dg e wor ke r Cle r k, IT pr
- fe ssional
Use r s OL AP OL T P