Two new approaches to smoothing over complex regions David Lawrence - - PowerPoint PPT Presentation

two new approaches to smoothing over complex regions
SMART_READER_LITE
LIVE PREVIEW

Two new approaches to smoothing over complex regions David Lawrence - - PowerPoint PPT Presentation

Two new approaches to smoothing over complex regions David Lawrence Miller Mathematical Sciences University of Bath useR! 2009, Rennes Outline Smoothing over complex regions Intro Solutions Schwarz-Christoffel transform Multidimensional


slide-1
SLIDE 1

Two new approaches to smoothing over complex regions

David Lawrence Miller

Mathematical Sciences University of Bath

useR! 2009, Rennes

slide-2
SLIDE 2

Outline

Smoothing over complex regions Intro Solutions Schwarz-Christoffel transform Multidimensional Scaling Details Simulation Results Conclusions

slide-3
SLIDE 3

Outline

Smoothing over complex regions Intro Solutions Schwarz-Christoffel transform Multidimensional Scaling Details Simulation Results Conclusions

slide-4
SLIDE 4

Smoothing in 2 dimensions

◮ Have some geographical region and wish to find out

something about the biological population in it.

◮ Response is eg. animal distribution, wish to predict based

  • n (x, y) and other covariates eg. habitat, size, sex, etc.

◮ This problem is relatively easy if the domain is simple.

slide-5
SLIDE 5

Smoothing over complex domains

◮ Smoothing of complex domains makes this a lot more

difficult.

◮ Problem of leakage. ◮ Euclidean distance doesn’t always make sense. ◮ Models need to incorporate information about the intrinsic

structure of the domain.

−4 − 3 . 5 − 3 −2.75 − 2 . 5 − 2 −1.75 − 1 . 5 −1 −0.75 −0.5 0.5 . 7 5 1 1 . 2 5 1 . 5 1 . 7 5 2 2 . 2 5 2 . 5 2 . 7 5 3 3 . 2 5 3 . 5 3.75 4 − 4 −3.75 −3.25 −3 − 2 . 7 5 −2.5 − 2 . 2 5 −2 −1.75 −1.25 −1 −0.75 −0.5 −0.25 0.25 0.5 0.75 . 7 5 1 1.25 1 . 5 1 . 7 5 2 2 . 2 5 2.5 2.75 3 3.25 3.75 4

(modified) Ramsay test function Thin plate spline fit

slide-6
SLIDE 6

Smoothing with penalties

◮ Objective function takes the form: n

  • i=1

(zi − f(xi, yi; θ))2 + λ

Pf(x, y; θ)dΩ

◮ f is the function you want to estimate, made up of some

combination of basis functions.

◮ P is some squared derivative penalty operator, usually

P = ( ∂2

∂x2 + ∂2 ∂y2 )2. ◮ This can be generalized to an additive model or GAM.

slide-7
SLIDE 7

Possible solutions to leakage problems

◮ FELSPLINE (Ramsay, (2002).) ◮ Domain morphing (Eilers, (2006).) ◮ Within-area distance (Wang and Ranalli, (2007).) ◮ Soap film smoothers (Wood et al, (2008).)

slide-8
SLIDE 8

Why morph the domain?

◮ Takes into account within-area distance. ◮ Gives a known domain that is easier to smooth over. ◮ Potentially less computationally intensive.

However:

◮ Don’t maintain isotropy - distribution of points odd. ◮ Not clear what this does to the smoothness penalty.

−2 2 −2 −1 1 2 3 4 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 1 −4 −2 2 4 1 2 3 4 5 6 7 2 3 4 5 6 7 8 9 10 11 12 13 14 15 1617 18 1

slide-9
SLIDE 9

Outline

Smoothing over complex regions Intro Solutions Schwarz-Christoffel transform Multidimensional Scaling Details Simulation Results Conclusions

slide-10
SLIDE 10

The Schwarz-Christoffel transform

◮ Take a polygon in some domain W and morph it to a new

domain W ∗.

◮ We then have a function for the mapping, ϕ(x, y). ◮ ϕ(x, y) is a conformal mapping. ◮ Do this by starting at the new domain and working back to

the polygon.

◮ Can draw a polygonal bounding box around some arbitrary

shape.

φ(x) φ (x)

  • 1

W W*

slide-11
SLIDE 11

The mapping

◮ Use a bounding box around the horseshoe.

1 4 8 5 2 3 6 7

◮ Morphing the horseshoe shape still gives a slightly odd

domain however, we are still doing better than before.

slide-12
SLIDE 12

Horseshoe plots

Truth

−4 −3.75 −3.5 − 3 . 2 5 −3 − 2 . 7 5 −2.5 − 2 . 2 5 −2 − 1 . 7 5 −1.5 − 1 . 2 5 −1 − . 7 5 −0.5 −0.25 0.25 . 5 0.75 1 1 . 2 5 1.5 1 . 7 5 2 2 . 2 5 2.5 2 . 7 5 3 3 . 2 5 3.5 3 . 7 5 4

SC+PS

−4 − 3 . 7 5 − 3 . 5 −3.25 − 3 −2.75 −2.5 −2 −1.75 −1.5 −1.25 − 1 − . 7 5 − . 5 − . 2 5 0.25 0.5 0.75 1 1.25 1.5 1.75 2 2 . 2 5 2.5 2.75 3 3 . 2 5 3.5 3.75 4

SC+TPRS

− 4 − 3 . 7 5 − 3 . 5 −3.25 −3 − 2 . 7 5 −2.5 − 2 −1.75 −1.5 −1.25 − 1 −0.75 −0.5 . 2 5 0.5 0.75 1 1.25 1 . 5 1 . 7 5 2 2 . 2 5 2 . 5 2.75 3 3.25 3.5 3 . 7 5 4

Soap film

−4 −3.75 −3.5 −3.25 −3 −2.75 −2.5 − 2 −1.75 − 1 . 5 −1.25 −1 −0.75 −0.5 . 2 5 0.5 . 7 5 1 1.25 1.5 1.75 2 2 . 2 5 2.5 2.75 3 3 . 2 5 3.5 3.75 4

slide-13
SLIDE 13

Problems

◮ Small:

◮ Implementation is Matlab+R. (YUCK!)

◮ BIG:

◮ Weird artifacts. ◮ Morphing of domain appears to cause features to be

smoothed over.

◮ Arbitrary selection of vertices.

slide-14
SLIDE 14

A more realistic domain

0.0 0.4 0.8 0.0 0.4 0.8

truth

x y 0.0 0.4 0.8 0.0 0.4 0.8

sc+tprs prediction

x y 0.0 0.4 0.8 0.0 0.4 0.8

tprs prediction

x y 0.0 0.4 0.8 0.0 0.4 0.8

soap prediction

x y

slide-15
SLIDE 15

A more realistic domain - what’s happening?

◮ Weird “crowding” effect. ◮ Different with each vertex choice. All bad.

slide-16
SLIDE 16

Outline

Smoothing over complex regions Intro Solutions Schwarz-Christoffel transform Multidimensional Scaling Details Simulation Results Conclusions

slide-17
SLIDE 17

Multidimensional scaling and within-area distances

◮ Idea: use MDS to to arrange points in the domain

according to their “within-domain distance.” Scheme:

◮ First need to find the within-area distances. ◮ Perform MDS on the matrix of within-area distances. ◮ Smooth over the new points.

slide-18
SLIDE 18

Multidimensional scaling refresher

◮ Double centre matrix of between point distances, D,

(subtract row and column means) then find DDT.

◮ Finds a configuration of points such that Euclidean

distance between points in new arrangement is approximately the same as distance in the domain.

◮ Already implemented in R by cmdscale.

−3 −2 −1 1 2 3 −3 −2 −1 1 2 3 x y

  • −4

−2 2 4 6 −1 1 2 3 newcoords[,1] newcoords[,2]

slide-19
SLIDE 19

Finding within-area distances

◮ Use a new algorithm to find the within area distances.

1 2 3 4 5 6 1 2 3 4 x y 1 2 3 4 5 6 1 2 3 4 x y 1 2 3 4 5 6 1 2 3 4 x y 1 2 3 4 5 6 1 2 3 4 x y

slide-20
SLIDE 20

Ramsay simulations

−1 1 2 3 −1.0 −0.5 0.0 0.5 1.0

truth

x y −1 1 2 3 −1.0 −0.5 0.0 0.5 1.0

MDS

x y −1 1 2 3 −1.0 −0.5 0.0 0.5 1.0

tprs

x y −1 1 2 3 −1.0 −0.5 0.0 0.5 1.0

soap

x y

slide-21
SLIDE 21

A different domain

−3 −2 −1 1 2 3 −2 −1 1 2

truth

x y −3 −2 −1 1 2 3 −2 −1 1 2

mds

x y −3 −2 −1 1 2 3 −2 −1 1 2

tprs

x y −3 −2 −1 1 2 3 −2 −1 1 2

soap

x y

slide-22
SLIDE 22

Outline

Smoothing over complex regions Intro Solutions Schwarz-Christoffel transform Multidimensional Scaling Details Simulation Results Conclusions

slide-23
SLIDE 23

Conclusions

◮ Seems that the S-C transform does not have much utility. ◮ MDS shows more promise, easier to transfer to higher

dimensions.

◮ MDS does not impose strict boundary conditions so

leakage still possible.

◮ Pushing the data into more dimensions might be useful to

separate points.

◮ After initial “transform” calculation, both methods only use

the same computational time as a thin plate regression

  • spline. (Soap is expensive.)
slide-24
SLIDE 24

References

◮ S.N. Wood, M.V. Bravington, and S.L. Hedley. Soap film

  • smoothing. JRSSB, 2008

◮ H. Wang and M.G. Ranalli. Low-rank smoothing splines on

complicated domains. Biometrics, 2007

◮ T.A. Driscoll and L.N. Trefethen. Schwarz-Christoffel

  • Mapping. Cambridge, 2002

◮ T. Ramsay. Spline smoothing over difficult regions. JRSSB,

2001

◮ P

.H.C. Eilers. P-spline smoothing on difficult domains. University of Munich seminar, 2006

◮ J.C. Gower. Adding a point to vector diagrams in

multivariate analysis. Biometrika, 1968. Slides available at http://people.bath.ac.uk/dlm27