Automatic classification of galaxy morphology with ZEST+. Mariano - - PowerPoint PPT Presentation

automatic classification of galaxy morphology with zest
SMART_READER_LITE
LIVE PREVIEW

Automatic classification of galaxy morphology with ZEST+. Mariano - - PowerPoint PPT Presentation

Automatic classification of galaxy morphology with ZEST+. Mariano Ciccolini Institut fr Astronomie, ETHZ PSI-LTP Theory seminar ETH Zrich - IfA Automatic classification of galaxy morphology with ZEST+. Mariano Ciccolini Institut fr


slide-1
SLIDE 1

Automatic classification of galaxy morphology with ZEST+. Mariano Ciccolini

Institut für Astronomie, ETHZ

PSI-LTP Theory seminar ETH Zürich - IfA

slide-2
SLIDE 2

Automatic classification of galaxy morphology with ZEST+. Mariano Ciccolini

Institut für Astronomie, ETHZ

PSI-LTP Theory seminar ETH Zürich - IfA

Galaxies and their morphologies Support Vector Machines ZEST+

slide-3
SLIDE 3

PSI-LTP Theory seminar ETH Zürich - IfA 2

M109: A barred spiral galaxy

Dale Swanson/Adam Block/NOAO/AURA/NSF

Type: SB(rs)bc

slide-4
SLIDE 4

Early, purely descriptive systems

  • W. Herschel & J. Herschel (1780 -1860)
  • very faint
  • Faint
  • Bright
  • small (with definite borders)
  • exceedingly large
  • J. Dreyer (1888)
  • brightness
  • form
  • concentration

M. Wolf (1908)

  • 17 classes (g) - (w)

PSI-LTP Theory seminar ETH Zürich - IfA 3

slide-5
SLIDE 5

Early, purely descriptive systems

  • W. Herschel & J. Herschel (1780 -1860)
  • very faint
  • Faint
  • Bright
  • small (with definite borders)
  • exceedingly large
  • J. Dreyer (1888)
  • brightness
  • form
  • concentration

M. Wolf (1908)

  • 17 classes (g) - (w)

T.A.Rector and B.A.Wolpa/NOAO/AURA/NSF !!!eeB eL vmE

PSI-LTP Theory seminar ETH Zürich - IfA 3

slide-6
SLIDE 6

Early, purely descriptive systems

  • W. Herschel & J. Herschel (1780 -1860)
  • very faint
  • Faint
  • Bright
  • small (with definite borders)
  • exceedingly large
  • J. Dreyer (1888)
  • brightness
  • form
  • concentration

M. Wolf (1908)

  • 17 classes (g) - (w)

!eB eL R vgbM N Bill Schoening/NOAO/AURA/NSF

PSI-LTP Theory seminar ETH Zürich - IfA 3

slide-7
SLIDE 7

Early, purely descriptive systems

  • W. Herschel & J. Herschel (1780 -1860)
  • very faint
  • Faint
  • Bright
  • small (with definite borders)
  • exceedingly large
  • J. Dreyer (1888)
  • brightness
  • form
  • concentration

M. Wolf (1908)

  • 17 classes (g) - (w)

“Es gibt keine zwei Nebelflecken am Himmel, die sich gleichen. Trotzdem geht das Bestreben der Beobachter seit Herschel dahin, die kleinen Nebelflecken zu klassifizieren.”

PSI-LTP Theory seminar ETH Zürich - IfA 3

slide-8
SLIDE 8

Credit: STScI

PSI-LTP Theory seminar ETH Zürich - IfA 4

  • E. Hubble (1936): Tuning fork diagram
slide-9
SLIDE 9

Credit: STScI

PSI-LTP Theory seminar ETH Zürich - IfA 4

  • E. Hubble (1936): Tuning fork diagram
slide-10
SLIDE 10

Credit: STScI

PSI-LTP Theory seminar ETH Zürich - IfA 4

  • E. Hubble (1936): Tuning fork diagram
slide-11
SLIDE 11

Credit: STScI

NGC 1132 E5 : n = 10 (1-b/a)

Credit: NASA, ESA, and The Hubble Heritage Team (STScI/AURA)

PSI-LTP Theory seminar ETH Zürich - IfA 4

  • E. Hubble (1936): Tuning fork diagram
slide-12
SLIDE 12

Credit: STScI

PSI-LTP Theory seminar ETH Zürich - IfA 4

  • E. Hubble (1936): Tuning fork diagram
slide-13
SLIDE 13

Credit: STScI

M51 SAbc

Credit: NASA, ESA, and The Hubble Heritage Team (STScI/AURA)

PSI-LTP Theory seminar ETH Zürich - IfA 4

  • E. Hubble (1936): Tuning fork diagram
slide-14
SLIDE 14

Credit: STScI

PSI-LTP Theory seminar ETH Zürich - IfA 4

  • E. Hubble (1936): Tuning fork diagram
slide-15
SLIDE 15

Credit: STScI

NGC 1300

Credit: NASA, ESA, and The Hubble Heritage Team (STScI/AURA)

SB(s)bc

PSI-LTP Theory seminar ETH Zürich - IfA 4

  • E. Hubble (1936): Tuning fork diagram
slide-16
SLIDE 16

Credit: STScI

PSI-LTP Theory seminar ETH Zürich - IfA 4

  • E. Hubble (1936): Tuning fork diagram
slide-17
SLIDE 17

Credit: STScI

NGC 5866

Credit: NASA, ESA, and The Hubble Heritage Team (STScI/AURA)

S0

PSI-LTP Theory seminar ETH Zürich - IfA 4

  • E. Hubble (1936): Tuning fork diagram
slide-18
SLIDE 18

Credit: STScI

PSI-LTP Theory seminar ETH Zürich - IfA 4

  • E. Hubble (1936): Tuning fork diagram
slide-19
SLIDE 19

Credit: STScI

M82 I0

Credit: NASA, ESA, and The Hubble Heritage Team (STScI/AURA)

PSI-LTP Theory seminar ETH Zürich - IfA 4

  • E. Hubble (1936): Tuning fork diagram
slide-20
SLIDE 20

Credit: STScI

PSI-LTP Theory seminar ETH Zürich - IfA 4

  • E. Hubble (1936): Tuning fork diagram
slide-21
SLIDE 21

Credit: STScI

PSI-LTP Theory seminar ETH Zürich - IfA 4

Early-type Late-type

  • E. Hubble (1936): Tuning fork diagram
slide-22
SLIDE 22

PSI-LTP Theory seminar ETH Zürich - IfA 5

Why classify?

“The ultimate purpose of the classification is to understand galaxy formation and evolution.” A. Sandage

HEP example: Eight-fold way Quark model & QCD Hadrons classification scheme

slide-23
SLIDE 23

PSI-LTP Theory seminar ETH Zürich - IfA 6

Sérsic profile:

Parametric approach

Light decomposition: Fit a 1D/2D model to surface brightness profile Other approaches:

  • Isophote fitting
  • Fourier decomposition
  • Shapelets

n=1 : Exponential profile n=4 : de Vaucouleurs profile

  • I(R) = I4(R) + I1(R)

⇒ B D =

0 I4(R)

0 I1(R)

I(R) = Ie exp

  • −bn

R

Re

1/n

− 1

  • Re

I(R) = 1 2

I(R) ⇒ Γ(2n) = 2γ(2n, bn)

slide-24
SLIDE 24

PSI-LTP Theory seminar ETH Zürich - IfA 7

Parametric approach limitations

Redshift Lookback time (Gy) Morphology

z < 0.3 < ~3.5 Hubble scheme applies in full detail. z ~ 0.5 ~5 Barred spirals rare Underdeveloped arms z > 0.6 ~6 Mergers and Irr increase z~1 : 30% off HS

slide-25
SLIDE 25

PSI-LTP Theory seminar ETH Zürich - IfA 7

Parametric approach limitations

Nearby galaxy (z ~ 0.001) Distant galaxy (z ~ 1.2)

COSMOS HST

  • ACS survey

Hillary Mathis, N.A.Sharp/NOAO/AURA/NSF

M94

slide-26
SLIDE 26

PSI-LTP Theory seminar ETH Zürich - IfA 8

Non-parametric approach: Concentration

r20 :

  • i:r(i)≤r20

Ii < 0.20 Itot r80 :

  • i:r(i)≤r80

Ii < 0.80 Itot C = log

r20

r80

slide-27
SLIDE 27

PSI-LTP Theory seminar ETH Zürich - IfA 8

Non-parametric approach: Concentration

r20 :

  • i:r(i)≤r20

Ii < 0.20 Itot r80 :

  • i:r(i)≤r80

Ii < 0.80 Itot C = log

r20

r80

slide-28
SLIDE 28

PSI-LTP Theory seminar ETH Zürich - IfA 9

Non-parametric approach: Asymmetry

A0 = 1 2

  • ij |I(i, j) − I180(i, j)|
  • ij |I(i, j)|

, A = A0 − Abkg

I( i , j ) I180( i , j ) I( i , j ) - I180( i , j )

slide-29
SLIDE 29

PSI-LTP Theory seminar ETH Zürich - IfA 9

Non-parametric approach: Asymmetry

A0 = 1 2

  • ij |I(i, j) − I180(i, j)|
  • ij |I(i, j)|

, A = A0 − Abkg

slide-30
SLIDE 30

PSI-LTP Theory seminar ETH Zürich - IfA 10

Non-parametric approach: Clumpiness

I( i , j ) IS( i , j ) I( i , j ) - IS( i , j )

S0 = 1 2

ij I(i, j) − IS(i, j)

  • ij |I(i, j)|

, S = S0 − Sbkg

slide-31
SLIDE 31

PSI-LTP Theory seminar ETH Zürich - IfA 10

Non-parametric approach: Clumpiness

S0 = 1 2

ij I(i, j) − IS(i, j)

  • ij |I(i, j)|

, S = S0 − Sbkg

slide-32
SLIDE 32

PSI-LTP Theory seminar ETH Zürich - IfA 11

Non-parametric approach: Gini coefficient

G = 1 2I N (N − 1)

N

  • i=1

N

  • j=1

|Ii − Ij|

slide-33
SLIDE 33

PSI-LTP Theory seminar ETH Zürich - IfA 12

Non-parametric approach: M20

xc =

  • i∈E xiIi
  • i∈E Ii

yc =

  • i∈E yiIi
  • i∈E Ii

Mi = Ii[(xi − xc)2 + (yi − yc)2] Mtot =

  • i∈E

Mi

slide-34
SLIDE 34

PSI-LTP Theory seminar ETH Zürich - IfA 12

Non-parametric approach: M20

xc =

  • i∈E xiIi
  • i∈E Ii

yc =

  • i∈E yiIi
  • i∈E Ii

M20 = log

  • i∈E20 Mi

Mtot

slide-35
SLIDE 35

PSI-LTP Theory seminar ETH Zürich - IfA 13

Non-parametric approach: Towards automatic classification

Can these results be generalized?

  • Parameter space dimensionality.
  • Automatic, non-linear, region detection.
  • Number of classes.
  • Prediction.
  • R. Abraham et al. (1996):

C-A plane separation

  • C. J. Concelice (2003):

C-A-S volume separation

  • J. Lotz et al. (2004):

C-A and G-M20 separation

slide-36
SLIDE 36

PSI-LTP Theory seminar ETH Zürich - IfA 13

Non-parametric approach: Towards automatic classification

Can these results be generalized?

  • Parameter space dimensionality.
  • Automatic, non-linear, region detection.
  • Number of classes.
  • Prediction.
  • R. Abraham et al. (1996):

C-A plane separation

  • C. J. Concelice (2003):

C-A-S volume separation

  • J. Lotz et al. (2004):

C-A and G-M20 separation

slide-37
SLIDE 37

PSI-LTP Theory seminar ETH Zürich - IfA 13

Non-parametric approach: Towards automatic classification

Can these results be generalized?

  • Parameter space dimensionality.
  • Automatic, non-linear, region detection.
  • Number of classes.
  • Prediction.
  • R. Abraham et al. (1996):

C-A plane separation

  • C. J. Concelice (2003):

C-A-S volume separation

  • J. Lotz et al. (2004):

C-A and G-M20 separation

slide-38
SLIDE 38

PSI-LTP Theory seminar ETH Zürich - IfA 14

Pattern Recognition: Supervised Learning

Patterns Equations & Graphics rate Label / Target Astronomer: +1 Particle Physicist:

  • 1

Data = Pattern + Label Algorithms:

  • Principal component analysis
  • Artificial neural networks
  • κ-nearest neighbours
  • Support

Vector Machines

Toy example:

slide-39
SLIDE 39

PSI-LTP Theory seminar ETH Zürich - IfA 15

Pattern Recognition: Supervised Learning

Training sample

Patterns (x1 , x2) Label (y) Equations Images 0.05 0.23 +1 0.08 0.33 +1 0.10 0.30 +1 0.13 0.20 +1 0.25 0.05

  • 1

0.23 0.18

  • 1

0.33 0.15

  • 1

0.33 0.05

  • 1

… … …

slide-40
SLIDE 40

0.1 0.2 0.3 0.4 0.5 Equations rate 0.1 0.2 0.3 0.4 0.5 Graphics rate

hep astro

PSI-LTP Theory seminar ETH Zürich - IfA 15

Pattern Recognition: Supervised Learning

slide-41
SLIDE 41

0.1 0.2 0.3 0.4 0.5 Equations rate 0.1 0.2 0.3 0.4 0.5 Graphics rate

hep astro

PSI-LTP Theory seminar ETH Zürich - IfA 15

Pattern Recognition: Supervised Learning

?

slide-42
SLIDE 42

0.1 0.2 0.3 0.4 0.5 Equations rate 0.1 0.2 0.3 0.4 0.5 Graphics rate

hep astro

PSI-LTP Theory seminar ETH Zürich - IfA 15

Pattern Recognition: Supervised Learning

?

slide-43
SLIDE 43

0.1 0.2 0.3 0.4 0.5 Equations rate 0.1 0.2 0.3 0.4 0.5 Graphics rate

hep astro

PSI-LTP Theory seminar ETH Zürich - IfA 15

Pattern Recognition: Supervised Learning

?

slide-44
SLIDE 44

0.1 0.2 0.3 0.4 0.5 Equations rate 0.1 0.2 0.3 0.4 0.5 Graphics rate

hep astro

PSI-LTP Theory seminar ETH Zürich - IfA 15

Pattern Recognition: Supervised Learning

?

slide-45
SLIDE 45

0.1 0.2 0.3 0.4 0.5 Equations rate 0.1 0.2 0.3 0.4 0.5 Graphics rate

hep astro

PSI-LTP Theory seminar ETH Zürich - IfA 15

Pattern Recognition: Supervised Learning

?

x ∈ Π ⇔ w · x + b = 0 f(x) = sgn (w · x + b) (xi, yi) in training sample ⇒ yi = f(xi)

slide-46
SLIDE 46

0.1 0.2 0.3 0.4 0.5 Equations rate 0.1 0.2 0.3 0.4 0.5 Graphics rate

hep astro

PSI-LTP Theory seminar ETH Zürich - IfA 16

Pattern Recognition: Separating hyperplanes

slide-47
SLIDE 47

0.1 0.2 0.3 0.4 0.5 Equations rate 0.1 0.2 0.3 0.4 0.5 Graphics rate

hep astro

PSI-LTP Theory seminar ETH Zürich - IfA 16

Pattern Recognition: Separating hyperplanes

slide-48
SLIDE 48

0.1 0.2 0.3 0.4 0.5 Equations rate 0.1 0.2 0.3 0.4 0.5 Graphics rate

hep astro

PSI-LTP Theory seminar ETH Zürich - IfA 16

Pattern Recognition: Separating hyperplanes

slide-49
SLIDE 49

0.1 0.2 0.3 0.4 0.5 Equations rate 0.1 0.2 0.3 0.4 0.5 Graphics rate

hep astro

PSI-LTP Theory seminar ETH Zürich - IfA 16

Pattern Recognition: Separating hyperplanes

slide-50
SLIDE 50

0.1 0.2 0.3 0.4 0.5 Equations rate 0.1 0.2 0.3 0.4 0.5 Graphics rate

hep astro

PSI-LTP Theory seminar ETH Zürich - IfA 16

Pattern Recognition: Separating hyperplanes

slide-51
SLIDE 51

0.1 0.2 0.3 0.4 0.5 Equations rate 0.1 0.2 0.3 0.4 0.5 Graphics rate

hep astro

PSI-LTP Theory seminar ETH Zürich - IfA 16

Pattern Recognition: Separating hyperplanes

slide-52
SLIDE 52

0.1 0.2 0.3 0.4 0.5 Equations rate 0.1 0.2 0.3 0.4 0.5 Graphics rate

hep astro

PSI-LTP Theory seminar ETH Zürich - IfA 16

Pattern Recognition: Separating hyperplanes

slide-53
SLIDE 53

0.1 0.2 0.3 0.4 0.5 Equations rate 0.1 0.2 0.3 0.4 0.5 Graphics rate

hep astro

PSI-LTP Theory seminar ETH Zürich - IfA 16

Pattern Recognition: Separating hyperplanes

slide-54
SLIDE 54

0.1 0.2 0.3 0.4 0.5 Equations rate 0.1 0.2 0.3 0.4 0.5 Graphics rate

hep astro

PSI-LTP Theory seminar ETH Zürich - IfA 16

Pattern Recognition: Separating hyperplanes

slide-55
SLIDE 55

0.1 0.2 0.3 0.4 0.5 Equations rate 0.1 0.2 0.3 0.4 0.5 Graphics rate

hep astro

PSI-LTP Theory seminar ETH Zürich - IfA 16

Pattern Recognition: Separating hyperplanes

δ

slide-56
SLIDE 56

0.1 0.2 0.3 0.4 0.5 Equations rate 0.1 0.2 0.3 0.4 0.5 Graphics rate

hep astro

PSI-LTP Theory seminar ETH Zürich - IfA 16

Pattern Recognition: Separating hyperplanes

δ

x ∈ Π ⇔ w · x + b = 0 x ∈ Π± ⇔ w · x + b = ±1 ⇒ Margin : δ = 2 w f(x) = sgn (w · x + b) (xi, yi) in training sample ⇒ yi = f(xi) w · xi + b ≤ −1 i : hep w · xj + b ≥ +1 j : astro

slide-57
SLIDE 57

PSI-LTP Theory seminar ETH Zürich - IfA 17

Optimal hyperplane: Formal approach

x ∈ Π ⇔ w · x + b = 0 x ∈ Π± ⇔ w · x + b = ±1 ⇒ Margin : δ = 2 w f(x) = sgn (w · x + b) (xi, yi) in training sample ⇒ yi = f(xi) w · xi + b ≤ −1 i : hep w · xj + b ≥ +1 j : astro

(x1, y1), . . . , (xm, ym) ∈ X × {±1} with X ⊂ ℜN (xi, yi) ← → P(x, y) f : X → {±1} : f(xi) = yi P(x, y) − → (x, y) ∈ X ⇒ f(x) = y

slide-58
SLIDE 58

PSI-LTP Theory seminar ETH Zürich - IfA 18

Optimal hyperplane: Formal approach

minimize τ(w) = 1 2w2 s.t. yi ((w · xi) + b) ≥ 1, i = 1, . . . , m αi ≥ 0 i = 1, . . . , m L(w, b, α) = 1 2w2 −

m

  • i=1

αi(yi ((w · xi) + b) − 1)

m

  • i=1

αi yi = 0, w =

m

  • i=1

αi yi xi ≤ αi ≤ (yi (w · xi + b) − 1) = αi (yi (w · xi + b) − 1) f(x) = sgn m

  • i=1

yi αi(x · xi) + b

  • Karush-Kuhn-Tucker
slide-59
SLIDE 59

PSI-LTP Theory seminar ETH Zürich - IfA 19

maximize W(α) =

m

  • i=1

αi − 1 2

m

  • i,j=1

αi αj yi yj (xi · xj) s.t. αi ≥ 0 i = 1, . . . , m and

m

  • i=1

αiyi = 0 f(x) = sgn m

  • i=1

yi αi(x · xi) + b

  • b

= 1 −

m

  • i=1

αi yi (xi · xj) with αj = 0

Wolfe dual problem

Optimal hyperplane: Formal approach

slide-60
SLIDE 60

0.1 0.2 0.3 0.4 0.5 Equations rate 0.1 0.2 0.3 0.4 0.5 Graphics rate

hep astro

PSI-LTP Theory seminar ETH Zürich - IfA 20

Optimal hyperplane: Non-linear separable data

slide-61
SLIDE 61

0.1 0.2 0.3 0.4 0.5 Equations rate 0.1 0.2 0.3 0.4 0.5 Graphics rate

hep astro

PSI-LTP Theory seminar ETH Zürich - IfA 20

Optimal hyperplane: Non-linear separable data

slide-62
SLIDE 62

0.1 0.2 0.3 0.4 0.5 Equations rate 0.1 0.2 0.3 0.4 0.5 Graphics rate

hep astro

PSI-LTP Theory seminar ETH Zürich - IfA 20

Optimal hyperplane: Non-linear separable data

ξi ξj

slide-63
SLIDE 63

0.1 0.2 0.3 0.4 0.5 Equations rate 0.1 0.2 0.3 0.4 0.5 Graphics rate

hep astro

PSI-LTP Theory seminar ETH Zürich - IfA 20

Optimal hyperplane: Non-linear separable data

ξi ξj

w · xi + b ≤ −1 i : hep w · xj + b ≥ +1 j : astro yi (w · xi + b) ≥ +1 ξi ≥ 0 i = 1 . . . m yi (w · xi + b) ≥ 1 − ξi

slide-64
SLIDE 64

PSI-LTP Theory seminar ETH Zürich - IfA 21

minimize τ(w) = 1 2w2 + C

m

  • i=1

ξi s.t. ξ ≥ 0 and yi (w · xi + b) ≥ 1 − ξi, i = 1, . . . , m maximize W(α) =

m

  • i=1

αi − 1 2

m

  • i,j=1

αi αj yi yj (xi · xj) s.t. 0 ≤ αi ≤ C i = 1, . . . , m and

m

  • i=1

αiyi = 0 f(x) = sgn m

  • i=1

yi αi(x · xi) + b

  • b

= 1 −

m

  • i=1

αi yi (xi · xj) with 0 < αj < C ⇒ ξj = 0

Optimal hyperplane: Non-linear separable data

slide-65
SLIDE 65

PSI-LTP Theory seminar ETH Zürich - IfA 22 x 1 x 2 x 1 x 2

~ ~

Φ

Support vector machines: Mapping data

slide-66
SLIDE 66

PSI-LTP Theory seminar ETH Zürich - IfA 23

Support Vector Machines: Kernel methods

(x1, y1), . . . , (xm, ym) ∈ X × {±1} with X ⊂ ℜN (xi, yi) ← → P(x, y) f : X → {±1} : f(xi) = yi P(x, y) − → (x, y) ∈ X ⇒ f(x) = y k : X × X − → ℜ (x, x′) − → k(x, x′) If x, x′ ∈ ℜN ⇒ k(x, x′) = x · x′ Φ : X − → F ⊂ ℜN x − → x k(x, x′) ≡ x · x′ = Φ(x) · Φ(x′)

slide-67
SLIDE 67

PSI-LTP Theory seminar ETH Zürich - IfA 23

Support Vector Machines: Kernel methods

(x1, y1), . . . , (xm, ym) ∈ X × {±1} with X ⊂ ℜN (xi, yi) ← → P(x, y) f : X → {±1} : f(xi) = yi P(x, y) − → (x, y) ∈ X ⇒ f(x) = y k : X × X − → ℜ (x, x′) − → k(x, x′) If x, x′ ∈ ℜN ⇒ k(x, x′) = x · x′ Φ : X − → F ⊂ ℜN x − → x k(x, x′) ≡ x · x′ = Φ(x) · Φ(x′)

slide-68
SLIDE 68

PSI-LTP Theory seminar ETH Zürich - IfA 23

Support Vector Machines: Kernel methods

(x1, y1), . . . , (xm, ym) ∈ X × {±1} with X ⊂ ℜN (xi, yi) ← → P(x, y) f : X → {±1} : f(xi) = yi P(x, y) − → (x, y) ∈ X ⇒ f(x) = y k : X × X − → ℜ (x, x′) − → k(x, x′) If x, x′ ∈ ℜN ⇒ k(x, x′) = x · x′ Φ : X − → F ⊂ ℜN x − → x k(x, x′) ≡ x · x′ = Φ(x) · Φ(x′) maximize W(α) =

m

  • i=1

αi − 1 2

m

  • i,j=1

αi αj yi yj (xi · xj) s.t. 0 ≤ αi ≤ C i = 1, . . . , m and

m

  • i=1

αiyi = 0 f(x) = sgn m

  • i=1

yi αi(x · xi) + b

  • b

= 1 −

m

  • i=1

αi yi (xi · xj) with 0 < αj < C ⇒ ξj = 0

slide-69
SLIDE 69

PSI-LTP Theory seminar ETH Zürich - IfA 23

Support Vector Machines: Kernel methods

(x1, y1), . . . , (xm, ym) ∈ X × {±1} with X ⊂ ℜN (xi, yi) ← → P(x, y) f : X → {±1} : f(xi) = yi P(x, y) − → (x, y) ∈ X ⇒ f(x) = y k : X × X − → ℜ (x, x′) − → k(x, x′) If x, x′ ∈ ℜN ⇒ k(x, x′) = x · x′ Φ : X − → F ⊂ ℜN x − → x k(x, x′) ≡ x · x′ = Φ(x) · Φ(x′) maximize W(α) =

m

  • i=1

αi − 1 2

m

  • i,j=1

αi αj yi yj (xi · xj) s.t. 0 ≤ αi ≤ C i = 1, . . . , m and

m

  • i=1

αiyi = 0 f(x) = sgn m

  • i=1

yi αi(x · xi) + b

  • b

= 1 −

m

  • i=1

αi yi (xi · xj) with 0 < αj < C ⇒ ξj = 0

slide-70
SLIDE 70

PSI-LTP Theory seminar ETH Zürich - IfA 24

linear : K(xi, xj) = xi · xj RBF : K(xi, xj) = exp

  • −γ xi − xj2

polynomial : K(xi, xj) = (γ xi · xj + r0)d sigmoid : K(xi, xj) = tanh (γ xi · xj + r0)

Support Vector Machines: Kernel methods

slide-71
SLIDE 71

PSI-LTP Theory seminar ETH Zürich - IfA 24

linear : K(xi, xj) = xi · xj RBF : K(xi, xj) = exp

  • −γ xi − xj2

polynomial : K(xi, xj) = (γ xi · xj + r0)d sigmoid : K(xi, xj) = tanh (γ xi · xj + r0)

Support Vector Machines: Kernel methods

slide-72
SLIDE 72

PSI-LTP Theory seminar ETH Zürich - IfA 24

linear : K(xi, xj) = xi · xj RBF : K(xi, xj) = exp

  • −γ xi − xj2

polynomial : K(xi, xj) = (γ xi · xj + r0)d sigmoid : K(xi, xj) = tanh (γ xi · xj + r0)

Support Vector Machines: Kernel methods

slide-73
SLIDE 73

PSI-LTP Theory seminar ETH Zürich - IfA 24

linear : K(xi, xj) = xi · xj RBF : K(xi, xj) = exp

  • −γ xi − xj2

polynomial : K(xi, xj) = (γ xi · xj + r0)d sigmoid : K(xi, xj) = tanh (γ xi · xj + r0)

Support Vector Machines: Kernel methods

slide-74
SLIDE 74

PSI-LTP Theory seminar ETH Zürich - IfA 25

Remp [ f ] = 1 m

m

  • i=1

1 2|f(xi) − yi| , R [ f ] = 1 2|f(x) − y|dP(x, y) R [ f ] ≤ Remp [ f ] + φ h m, log(η) m

  • φ

h m, log(η) m

  • =
  • h
  • log 2m

h + 1

  • − log(η/4)

m

Capacity:

Support Vector Machines: Error constraints

slide-75
SLIDE 75

PSI-LTP Theory seminar ETH Zürich - IfA 26

2-class SVM generalization to k classes.

Multiclass Support Vector Machines

slide-76
SLIDE 76

PSI-LTP Theory seminar ETH Zürich - IfA 26

  • ne-versus-one:
  • Train k (k+1)/2 SVM
  • Classification: Poll

2-class SVM generalization to k classes.

Multiclass Support Vector Machines

slide-77
SLIDE 77

PSI-LTP Theory seminar ETH Zürich - IfA 26

  • ne-versus-all:
  • Train k SVM
  • Classification: max{d.f.}

2-class SVM generalization to k classes.

Multiclass Support Vector Machines

slide-78
SLIDE 78

PSI-LTP Theory seminar ETH Zürich - IfA 27

Support Vector Machines in HEP

A. Vaiciulis, SVM in analysis of Top quark production (Nucl. Instrum. Meth. A502 (2003) 492) P . Vannerem et al., Classifying LEP data with support vector algorithms (hep-ex/9905027) TMVA: Toolkit for Multivariate Analysis In e+ e- → q qbar:

  • Charm-tagging.
  • Muon identification.

ANN and SVM give consistent results Signal and background efficiency consistent with best set of cuts Integrated machine learning environment for CERN’s ROOT http://tmva.sourceforge.net

slide-79
SLIDE 79

PSI-LTP Theory seminar ETH Zürich - IfA 28

Non-parametric coefficients: C, A, S... Support Vector Machines: M = M(C, A, S, G, M20, . . .) Class M Early 1 Late 2 Irregular 3

Very large catalogue Training ? ? ? ?

SVM

Summary

slide-80
SLIDE 80

PSI-LTP Theory seminar ETH Zürich - IfA 29

The evolution of ZEST+ General Approach Details Applications: COSMOS Challenges ahead

ZEST+: The Zurich estimator of structural types

slide-81
SLIDE 81

PSI-LTP Theory seminar ETH Zürich - IfA 30

The evolution of ZEST+

ZEST+:

  • First C++ version, without classification (E. Weihs).
  • Further modifications (T. Bschorr).
  • Complete rewrite, new features, SVM classification (M.C.)

ZEST: C. Scarlata & M. Carollo (2007)

  • C, A, G, M20, ε, and Sérsic n.
  • PCA analysis: 3D classification grid.
  • IDL application, no public release.
slide-82
SLIDE 82

PSI-LTP Theory seminar ETH Zürich - IfA 31

ZEST+ Architecture

Initialisation Pre-processing Characterisation Classification Catalogue Coefficients Morphologies

slide-83
SLIDE 83

PSI-LTP Theory seminar ETH Zürich - IfA 32

ZEST+: Pre-processing

Pre-processing Image cleaning Segmentation refinement Basic segmentation

slide-84
SLIDE 84

PSI-LTP Theory seminar ETH Zürich - IfA 32

ZEST+: Pre-processing

Pre-processing Image cleaning Segmentation refinement Basic segmentation Basic segmentation

slide-85
SLIDE 85

PSI-LTP Theory seminar ETH Zürich - IfA 32

ZEST+: Pre-processing

Pre-processing Image cleaning Segmentation refinement Basic segmentation Image cleaning

slide-86
SLIDE 86

PSI-LTP Theory seminar ETH Zürich - IfA 32

ZEST+: Pre-processing

Pre-processing Image cleaning Segmentation refinement Basic segmentation Segmentation refinement

slide-87
SLIDE 87

ZEST+: Segmentation refinement

PSI-LTP Theory seminar ETH Zürich - IfA 33

Galaxy’s center: Center of asymmetry Galaxy’s size: Petrosian radius

η(R) = 2π

R

0 I(R′)dR′

πR2I(R) 1 η(Rαp) = α Rp ≡ R0.2

p

slide-88
SLIDE 88

Save results

PSI-LTP Theory seminar ETH Zürich - IfA 34

ZEST+: Characterisation

Characterisation Substructure analysis Diagnostics Structure analysis

slide-89
SLIDE 89

Save results

PSI-LTP Theory seminar ETH Zürich - IfA 34

ZEST+: Characterisation

Characterisation Substructure analysis Diagnostics Structure analysis Center iterations Rp roots Background Signal-to-noise Negative pixels Contamination

slide-90
SLIDE 90

Save results

PSI-LTP Theory seminar ETH Zürich - IfA 34

ZEST+: Characterisation

Characterisation Substructure analysis Diagnostics Structure analysis C, A, S, G, M20

slide-91
SLIDE 91

Save results

PSI-LTP Theory seminar ETH Zürich - IfA 34

ZEST+: Characterisation

Characterisation Substructure analysis Diagnostics Structure analysis Selection ε, C, A, G, M20

slide-92
SLIDE 92

Save results

PSI-LTP Theory seminar ETH Zürich - IfA 34

ZEST+: Characterisation

Characterisation Substructure analysis Diagnostics Structure analysis

Id … C A S M20 G

ε

C A G M20 Err.

slide-93
SLIDE 93

PSI-LTP Theory seminar ETH Zürich - IfA 35

ZEST+: Coefficients revisited

Differences from ideal case:

  • Negative pixels
  • Low signal-to-noise ratio
  • Background artefacts

A = A0 − Abkg S = S0 − Sbkg G = G(Ij) → G(|Ij|)

slide-94
SLIDE 94

PSI-LTP Theory seminar ETH Zürich - IfA 35

ZEST+: Coefficients revisited

Differences from ideal case:

  • Negative pixels
  • Low signal-to-noise ratio
  • Background artefacts

A = A0 − Abkg S = S0 − Sbkg G = G(Ij) → G(|Ij|)

slide-95
SLIDE 95

PSI-LTP Theory seminar ETH Zürich - IfA 35

ZEST+: Coefficients revisited

Differences from ideal case:

  • Negative pixels
  • Low signal-to-noise ratio
  • Background artefacts

A = A0 − Abkg S = S0 − Sbkg G = G(Ij) → G(|Ij|)

slide-96
SLIDE 96

PSI-LTP Theory seminar ETH Zürich - IfA 36

ZEST+: Substructure analysis

  • 1. Self-subtract smoothed image.
  • 2. Threshold and eliminate isolated pixels.
  • 3. Measure all morphological coefficients.
slide-97
SLIDE 97

ZEST+: Classification

PSI-LTP Theory seminar ETH Zürich - IfA 37

  • SVM stand-alone C applications.
  • SVM library
  • Kernels: Linear, polynomial, RBF, sigmoid and user provided.
  • Multiclass algorithm: one-versus-one.
  • Supports SVM probabilities.

Algorithm: Support Vector Machines Implementation: libSVM-3.88 (C. Chang & C. Lin 2001)

slide-98
SLIDE 98

ZEST+: Classification

PSI-LTP Theory seminar ETH Zürich - IfA 38

libSVM Preprocessing Characterization Interface Catalogue C, A, S, G, M20 External Data (z, B/D,etc.) SVM info. Morphologies

slide-99
SLIDE 99

Recent ZEST+ applications

PSI-LTP Theory seminar ETH Zürich - IfA 39

  • Automatic preprocessing of large datasets.
  • Mask preparation to use with external fitting programs.
  • Petrosian radius calculation in simulated galaxies.
  • Morphological data calculation for a recent

VLT proposal.

  • Star-forming E/S0 galaxies study.
  • Substructure identification for tidal features study.
  • Morphological analysis in the COSMOS survey.
slide-100
SLIDE 100

PSI-LTP Theory seminar ETH Zürich - IfA 40

  • HST Treasury Project with ACS
  • Largest HST survey
  • 2 square degrees equatorial field
  • 2 million objects IAB > 27 mag
  • Up to z ~ 5

NASA

Objective Probe galaxy formation and evolution as a function of z and LSS environment

slide-101
SLIDE 101

PSI-LTP Theory seminar ETH Zürich - IfA 41

Image courtesy of ESA Subaru Telescope, NAOJ ESO/H.Zodet NASA/JPL-Caltech

Chandra, UKIRT, NOAO,CFHT, and others

NASA/JPL-Caltech Image courtesy of NRAO/AUI

slide-102
SLIDE 102

PSI-LTP Theory seminar ETH Zürich - IfA 42

The COSMOS field

slide-103
SLIDE 103

HST - ACS camea

PSI-LTP Theory seminar ETH Zürich - IfA 43

slide-104
SLIDE 104

HST - ACS camea

PSI-LTP Theory seminar ETH Zürich - IfA 43

slide-105
SLIDE 105

HST - ACS camea

PSI-LTP Theory seminar ETH Zürich - IfA 43

slide-106
SLIDE 106

HST - ACS camea

PSI-LTP Theory seminar ETH Zürich - IfA 43

Visual ~ 4000 Å - 7000 Å

7000 8000 9000 10000 ! [Å] 0.2 0.4 0.6 0.8 1

Filter Throughput

F814W

slide-107
SLIDE 107

PSI-LTP Theory seminar ETH Zürich - IfA 44

HST - ACS Data

slide-108
SLIDE 108

PSI-LTP Theory seminar ETH Zürich - IfA 44

HST - ACS Data

slide-109
SLIDE 109

PSI-LTP Theory seminar ETH Zürich - IfA 45

HST

  • ACS data: From the instrument to ZEST+
slide-110
SLIDE 110

PSI-LTP Theory seminar ETH Zürich - IfA 45

HST

  • ACS data: From the instrument to ZEST+
slide-111
SLIDE 111

PSI-LTP Theory seminar ETH Zürich - IfA 45

HST

  • ACS data: From the instrument to ZEST+
slide-112
SLIDE 112

PSI-LTP Theory seminar ETH Zürich - IfA 46

ZEST+ results

N ~ 120 000

  • IAB > 24
  • 1-b/a < 0.85
  • Stamp size < 1200 px.
slide-113
SLIDE 113

PSI-LTP Theory seminar ETH Zürich - IfA 46

ZEST+ results

slide-114
SLIDE 114

PSI-LTP Theory seminar ETH Zürich - IfA 47

Challenges: Cleaning errors tolerance

Bright nearby objects

slide-115
SLIDE 115

PSI-LTP Theory seminar ETH Zürich - IfA 47

Challenges: Cleaning errors tolerance

Object truncation

slide-116
SLIDE 116

PSI-LTP Theory seminar ETH Zürich - IfA 47

Challenges: Cleaning errors tolerance

Object truncation

slide-117
SLIDE 117

PSI-LTP Theory seminar ETH Zürich - IfA 47

Challenges: Cleaning errors tolerance

Background features

slide-118
SLIDE 118

PSI-LTP Theory seminar ETH Zürich - IfA 47

Challenges: Cleaning errors tolerance

Background features

slide-119
SLIDE 119

PSI-LTP Theory seminar ETH Zürich - IfA 48

Challenges: Point Spread Function

˜ I(x, y) = I(x, y) F(x, y) + n(x, y)

No PSF With PSF

slide-120
SLIDE 120

PSI-LTP Theory seminar ETH Zürich - IfA 48

Challenges: Point Spread Function

˜ I(x, y) = I(x, y) F(x, y) + n(x, y)

No PSF With PSF

slide-121
SLIDE 121

PSI-LTP Theory seminar ETH Zürich - IfA 48

Challenges: Point Spread Function

˜ I(x, y) = I(x, y) F(x, y) + n(x, y)

No PSF With PSF

slide-122
SLIDE 122

PSI-LTP Theory seminar ETH Zürich - IfA 49

Challenges: Point Spread Function

Problem 1: PSF effects

  • Parametric Approach
  • Richardson-Lucy Deconvolution
  • Simulations

Problem I1: PSF determination

  • Analytic PSF
  • PSF fitting
  • HST: TinyTim
  • Blind deconvolution

˜ I(x, y) = I(x, y) F(x, y) + n(x, y)

slide-123
SLIDE 123

PSI-LTP Theory seminar ETH Zürich - IfA 50

Conclusions

ZEST+:

  • uses a comprehensive set of non-parametric morphology descriptors.
  • analyses substructure.
  • uses a state-of-the-art classification algorithm.
  • supports the use of ad-hoc external data.
  • is flexible and fast.
  • is fully documented and its source is freely available.
  • is a portable, modular, standalone application written in C++.

More information:

  • http://www.astro.phys.ethz.ch
  • http://cosmos.astro.caltech.edu
  • http://www.kernel-machines.org
  • http://www.csie.ntu.edu/~cjlin/libsvm
slide-124
SLIDE 124

PSI-LTP Theory seminar ETH Zürich - IfA 51

Top quark SVM analysis (A. Vaiciulis, 2003)

(GeV)

T

Jet 1 E

50 100 150 200

0.02 0.04 0.06 0.08 0.1 0.12 0.14

top WW

(GeV)

T

max lepton E

50 100 150 200

0.02 0.04 0.06 0.08 0.1 0.12

top WW

(GeV)

T

sum jet E

100 200 300

0.05 0.1 0.15

top WW

(GeV)

T

missing E

100 200

0.02 0.04 0.06 0.08 0.1 0.12

top WW

background eff.

0.2 0.4 0.6 0.8 1

signal eff.

0.2 0.4 0.6 0.8 1 SVM cuts

t t

q q

b b

W+ W- l+ , q

!l , q, l- , q, !

– l , q –

slide-125
SLIDE 125

Revised Hubble system

PSI-LTP Theory seminar ETH Zürich - IfA 52

  • Narrowing of the bins
  • Extension: Sd, Im, Am
  • Arm attachment: (r) and (s)
  • Rings
slide-126
SLIDE 126

Revised Hubble system

PSI-LTP Theory seminar ETH Zürich - IfA 52

  • Narrowing of the bins
  • Extension: Sd, Im, Am
  • Arm attachment: (r) and (s)
  • Rings

Modern Notation E - S0 - S0a - Sab - Sb - Sbc - Sc - Scd - Sd - Sd/m - Sm - Im

slide-127
SLIDE 127

Revised Hubble system

PSI-LTP Theory seminar ETH Zürich - IfA 52

  • Narrowing of the bins
  • Extension: Sd, Im, Am
  • Arm attachment: (r) and (s)
  • Rings

NGC 2523: Sbr NGC 1300: Sbs

Kormendy, J. 1979a, Ap.J., 227, 714.

slide-128
SLIDE 128

Revised Hubble system

PSI-LTP Theory seminar ETH Zürich - IfA 52

  • Narrowing of the bins
  • Extension: Sd, Im, Am
  • Arm attachment: (r) and (s)
  • Rings

NGC 1291: RSb0

Kormendy, J. 1979a, Ap.J., 227, 714.

slide-129
SLIDE 129

PSI-LTP Theory seminar ETH Zürich - IfA 53

Visualization of the revised system

slide-130
SLIDE 130

PSI-LTP Theory seminar ETH Zürich - IfA 53

Visualization of the revised system

Hodge (1966)

slide-131
SLIDE 131

PSI-LTP Theory seminar ETH Zürich - IfA 53

Visualization of the revised system

de Vaucouleurs (1959)

slide-132
SLIDE 132

PSI-LTP Theory seminar ETH Zürich - IfA 53

Visualization of the revised system

de Vaucouleurs (1959)

slide-133
SLIDE 133

The Universe large scale structure Text

Volker Springel, Carlos S. Frenk and Simon D. M. White Nature 440, 1137-1144 (27 April 2006)

slide-134
SLIDE 134

PSI-LTP Theory seminar ETH Zürich - IfA 55

Galaxies: The hierarchical formation model

  • R. G. Abraham, Science 293, 1273 (2001)
slide-135
SLIDE 135

PSI-LTP Theory seminar ETH Zürich - IfA 55

Galaxies: The hierarchical formation model

  • R. G. Abraham, Science 293, 1273 (2001)