Notion of mean point in the data Why bother about mean point? - - PowerPoint PPT Presentation

notion of mean point in the data why bother about mean
SMART_READER_LITE
LIVE PREVIEW

Notion of mean point in the data Why bother about mean point? - - PowerPoint PPT Presentation

Fundamentals of AI Introduction and the most basic concepts Notion of mean point in the data Why bother about mean point? Defining mean point can be considered as a simple application of unsupervised learning approach Calculating mean


slide-1
SLIDE 1

Introduction and the most basic concepts

Fundamentals of AI

Notion of mean point in the data

slide-2
SLIDE 2

Why bother about mean point?

  • Defining mean point can be considered as a simple

application of unsupervised learning approach

  • Calculating mean point is the extreme case of

dimensionality reduction: RN -> R0

  • In complex data spaces the definition of mean point is

non-trivial task

  • Definition of mean depends on the metrics of data space
  • General definition of mean leads to important

generalizations

slide-3
SLIDE 3

Notion of average (mean) point

Arithmetic mean

*ai can be vectors!

Geometric mean

* * arithmetic mean of logarithms

* **

Harmonic mean

slide-4
SLIDE 4

Notion of average (mean) point

  • In probability theory : ‘expected’ or ‘central’ value of the probability

distribution

  • The analytical formula depends on the type of probability distribution!
  • Can be non-existent
  • In geometrical approach: point m minimizing the mean squared distance

from all data points to m

  • this definition belongs to Maurice Fréchet (1878-1973)
  • depends on the metric structure of the

feature space

  • can be non-unique
slide-5
SLIDE 5

Notion of average (mean) point

  • In probability theory : ‘expected’ or ‘central’ value of the probability

distribution, first moment of the distribution

slide-6
SLIDE 6

Notion of average (mean) point

  • In geometrical approach: point m minimizing the mean squared

distance from all data points to m, ‘center of mass’

min

1 2 

 m i

point m

slide-7
SLIDE 7

Simple exercise: what is the mean point in Euclidean space?

slide-8
SLIDE 8

Simple exercise: what is the mean point in Euclidean space?

slide-9
SLIDE 9

Simple exercise: what is the mean point in Euclidean space?

slide-10
SLIDE 10

Simple exercise: what is the mean point in Euclidean space?

Arithmetic mean! 

slide-11
SLIDE 11

What is the mean point in L1 space?

slide-12
SLIDE 12

What is the mean point in L1 space?

slide-13
SLIDE 13

What is the mean point in L1 space?

slide-14
SLIDE 14

What is the mean point in L1 space?

slide-15
SLIDE 15

What is the mean point in L1 space?

This is definition of median value! Mean value in L1 space - medoid

slide-16
SLIDE 16

What is the mean point in L1 space?

This is definition of median value! Mean value in L1 space - medoid

For even number of data points, there is infinite number of L1- means

any point in this segment is L1-mean

slide-17
SLIDE 17

What is the mean point in L1 space?

For even number of data points, there is infinite number of L1- means

any point in this segment is L1-mean

Mean in Euclidean distance is unique For odd number of data points, L1-mean is also unique

L1-mean L2-mean

slide-18
SLIDE 18

Mean point on Rieman surface (e.g., sphere)

The distance is the length of the shortest path – of geodesics Formula still holds!

slide-19
SLIDE 19

Important generalizations of the mean point notion

  • Mean value = best approximation of the data point

cloud with single object of zero dimension (point)

  • Best approximation of the data point cloud with

multiple objects of zero dimension = k-means clustering (also called k principal points)

  • Best approximation of the data point cloud with

multiple objects of zero dimension in L1-space = k- medoids clustering

  • Best approximation of the data point cloud with

single object of dimension 1 = first principal component

min

1 2 

 m i