Data Mining: Exploring Data Lecture Notes for Chapter 3
Slides by Tan, Steinbach, Kumar adapted by Michael Hahsler
Look for accompanying R code on the course web site.
Data Mining: Exploring Data Lecture Notes for Chapter 3 Slides by - - PowerPoint PPT Presentation
Data Mining: Exploring Data Lecture Notes for Chapter 3 Slides by Tan, Steinbach, Kumar adapted by Michael Hahsler Look for accompanying R code on the course web site. Topics Exploratory Data Analysis Summary Statistics
Slides by Tan, Steinbach, Kumar adapted by Michael Hahsler
Look for accompanying R code on the course web site.
tools
Engineering Statistics Handbook http://www.itl.nist.gov/div898/handbook/index.htm
http://www.ics.uci.edu/~mlearn/MLRepository.html
National Technical Center, Chester, PA. Courtesy of USDA NRCS Wetland Science Institute.
cases has a smaller value & 50% are larger
m
25th percentile – 1.5 IQR 25th percentile 75th percentile 50th percentile 75th percentile + 1.5 IQR
IQR
Q1 Q3 IQR Median Q3 + 1.5 × IQR Q1 − 1.5 × IQR −0.6745 σ 0.6745 σ 2.698 σ −2.698 σ 50% 24.65% 24.65% −4 σ −3 σ −2 σ −1 σ 0 σ 1 σ 3 σ 2 σ 4 σ
An example for Sea Surface Temperature (SST) is
Celsius
standard deviation
Setosa Versicolor Virginica