Statistics for Machine Learning Prof. Seungchul Lee Industrial AI - - PowerPoint PPT Presentation

โ–ถ
statistics for machine learning
SMART_READER_LITE
LIVE PREVIEW

Statistics for Machine Learning Prof. Seungchul Lee Industrial AI - - PowerPoint PPT Presentation

Statistics for Machine Learning Prof. Seungchul Lee Industrial AI Lab. Statistics and Probability statistics data model probability 2 Populations and Samples A population includes all the elements from a set of data A parameter is a


slide-1
SLIDE 1

Statistics for Machine Learning

  • Prof. Seungchul Lee

Industrial AI Lab.

slide-2
SLIDE 2

Statistics and Probability

data model statistics probability

2

slide-3
SLIDE 3

Populations and Samples

  • A population includes all the elements from a set of data
  • A parameter is a quantity computed from a population

โ€“ mean, ๐œˆ โ€“ variance, ๐œ2

  • A sample is a subset of the population.

โ€“ one or more observations

  • A statistic is a quantity computed from a sample

โ€“ sample mean, าง ๐‘ฆ โ€“ sample variance, ๐‘ก2 โ€“ sample correlation, ๐‘‡๐‘ฆ๐‘ง

3

slide-4
SLIDE 4

How to Generate Random Numbers

  • Data sampled from population/process/generative model

4

slide-5
SLIDE 5

Histogram

  • Graphical representation of data distribution

โ‡’ rough sense of density of data

... ...

bin counts/freq

5

slide-6
SLIDE 6

Inference

  • True population or process is modeled probabilistically
  • Sampling supplies us with realizations from probability model
  • Compute something, but recognize that we could have just as easily gotten a different set of

realizations

6

slide-7
SLIDE 7

Inference

7

slide-8
SLIDE 8

Inference

  • We want to infer the characteristics of the true probability model from our one sample.

8

slide-9
SLIDE 9

The Law of Large Numbers

  • Sample mean converges to the population mean as sample size gets large
  • True for any probability density functions

9

slide-10
SLIDE 10

Sample Mean and Sample Size

  • Sample mean and sample variance

10

slide-11
SLIDE 11

The Central Limit Theorem

  • Sample mean (not samples) will be approximately normally distributed as a sample size ๐‘› โ†’ โˆž
  • More samples provide more confidence (or less uncertainty)
  • Note: true regardless of any distributions of population

11

slide-12
SLIDE 12

Uniform Distribution: ๐’š~๐‘ฝ ๐Ÿ, ๐Ÿ

12

slide-13
SLIDE 13

Sample Size

13

slide-14
SLIDE 14

Variance Gets Smaller as ๐’ is Larger

  • Seems approximately Gaussian distributed
  • Numerically demonstrate that sample mean follows Gaussian distribution

14

slide-15
SLIDE 15

Multivariate Statistics

  • ๐‘› observations ๐‘ฆ ๐‘— , ๐‘ฆ 2 , โ‹ฏ , ๐‘ฆ ๐‘›

15

slide-16
SLIDE 16

Correlation of Two Random Variables

  • Correlation

โ€“ Strength of linear relationship between two variables, ๐‘ฆ and ๐‘ง

16

slide-17
SLIDE 17

Correlation of Two Random Variables

  • Assume

17

slide-18
SLIDE 18

Correlation Coefficient

  • +1 โ†’ close to a straight line
  • โˆ’1 โ†’ close to a straight line
  • Indicate how close to a linear line, but
  • No information on slope
  • Does not tell anything about causality

18

slide-19
SLIDE 19

Correlation Coefficient

19

slide-20
SLIDE 20

Correlation Coefficient

20

slide-21
SLIDE 21

Correlation Coefficient Plot

  • Plots correlation coefficients among pairs of variables
  • http://rpsychologist.com/d3/correlation/

21

slide-22
SLIDE 22

Covariance Matrix

22