Novel Meshes for Multivariate Interpolation and Approximation - - PowerPoint PPT Presentation

novel meshes for multivariate interpolation and
SMART_READER_LITE
LIVE PREVIEW

Novel Meshes for Multivariate Interpolation and Approximation - - PowerPoint PPT Presentation

Novel Meshes for Multivariate Interpolation and Approximation Thomas C. H. Lux, Layne T. Watson, Tyler H. Chang, Jon Bernard, Bo Li, Xiaodong Yu, Li Xu, Godmar Back, Ali R. Butt, Kirk W. Cameron, Yili Hong, Danfeng Yao Virginia Polytechnic


slide-1
SLIDE 1

Novel Meshes for Multivariate Interpolation and Approximation

Thomas C. H. Lux, Layne T. Watson, Tyler H. Chang, Jon Bernard, Bo Li, Xiaodong Yu, Li Xu, Godmar Back, Ali R. Butt, Kirk W. Cameron, Yili Hong, Danfeng Yao

Virginia Polytechnic Institute and State University


 This work was supported by the National Science Foundation Grant CNS-1565314.

slide-2
SLIDE 2

Motivation

Regression and interpolation are problems of considerable importance that find applications across many fields of science.

!2

slide-3
SLIDE 3

Motivation

Regression and interpolation are problems of considerable importance that find applications across many fields of science. Pollution and air quality analysis
 Energy consumption management
 Student performance prediction

!2

slide-4
SLIDE 4

Motivation

Regression and interpolation are problems of considerable importance that find applications across many fields of science. Pollution and air quality analysis
 Energy consumption management
 Student performance prediction These techniques are applied here to:

!2

slide-5
SLIDE 5

Motivation

Regression and interpolation are problems of considerable importance that find applications across many fields of science. Pollution and air quality analysis
 Energy consumption management
 Student performance prediction These techniques are applied here to: High performance computing file input/output (HPC I/O)
 Parkinson's patient clinical evaluations 
 Forest fire risk assessment

!2

slide-6
SLIDE 6

Problem Formulation

Given underlying function f : ℝ

d ⇾ ℝ


data matrix X

n × d with row vectors x (i)

∈ℝ

d


response values f (x

(i)) for all x (i) 


matrix f (X) has rows f (x

(i))

!3

slide-7
SLIDE 7

Problem Formulation

Given underlying function f : ℝ

d ⇾ ℝ


data matrix X

n × d with row vectors x (i)

∈ℝ

d


response values f (x

(i)) for all x (i) 


matrix f (X) has rows f (x

(i))

Generate a function g: ℝ

d ⇾ ℝ such that:

!3

slide-8
SLIDE 8

Problem Formulation

Given underlying function f : ℝ

d ⇾ ℝ


data matrix X

n × d with row vectors x (i)

∈ℝ

d


response values f (x

(i)) for all x (i) 


matrix f (X) has rows f (x

(i))

Generate a function g: ℝ

d ⇾ ℝ such that:

Interpolation
 g(x

(i)) equals f (x (i)) for all x (i)

!3

slide-9
SLIDE 9

Problem Formulation

Given underlying function f : ℝ

d ⇾ ℝ


data matrix X

n × d with row vectors x (i)

∈ℝ

d


response values f (x

(i)) for all x (i) 


matrix f (X) has rows f (x

(i))

Generate a function g: ℝ

d ⇾ ℝ such that:

Interpolation
 g(x

(i)) equals f (x (i)) for all x (i)

Approximation
 g has parameters P and is the
 solution to minP ║ f (X) - g(X)║

!3

slide-10
SLIDE 10

Box Splines

Proposed by C. de Boor as an extension of B-Splines into multiple dimensions (without using tensor products).

!4

0.5 1.0 1.5 2.0 0.2 0.4 0.6 0.8 1.0

0.5 1.0 1.5 2.0 2.5 3.0 0.2 0.4 0.6

slide-11
SLIDE 11

Box Splines

Proposed by C. de Boor as an extension of B-Splines into multiple dimensions (without using tensor products).

!4

0.5 1.0 1.5 2.0 0.2 0.4 0.6 0.8 1.0

0.5 1.0 1.5 2.0 2.5 3.0 0.2 0.4 0.6


 
 
 Can be shifted and 
 scaled without losing smoothness.

slide-12
SLIDE 12

Box Splines

Proposed by C. de Boor as an extension of B-Splines into multiple dimensions (without using tensor products).

!4

0.5 1.0 1.5 2.0 0.2 0.4 0.6 0.8 1.0

0.5 1.0 1.5 2.0 2.5 3.0 0.2 0.4 0.6


 
 
 Can be shifted and 
 scaled without losing smoothness. Computationally tractable.

slide-13
SLIDE 13

Box Splines

Proposed by C. de Boor as an extension of B-Splines into multiple dimensions (without using tensor products).

!4

0.5 1.0 1.5 2.0 0.2 0.4 0.6 0.8 1.0

0.5 1.0 1.5 2.0 2.5 3.0 0.2 0.4 0.6


 
 
 Can be shifted and 
 scaled without losing smoothness. Computationally tractable.

slide-14
SLIDE 14

Max Box Mesh (MBM)

Properties Largest max norm distance from anchor to edge of support. Not always a covering for the space.
 Construction Complexity For each box (n) Distance to all (nd)
 For each dimension (2d) Sort distances (n log n)


퓞(n2d log n)

!5

slide-15
SLIDE 15

Iterative Box Mesh (IBM)

!6

Properties Built-in bootstrapping used to guide construction. Always a covering for the space by construction.
 Construction Complexity Until all points are added (n) Identify boxes containing new anchor to add (n) Shrink boxes containing new anchor along all dimensions (d)


퓞(n2 + nd)

slide-16
SLIDE 16

Voronoi Mesh (VM)

!7

Properties Naturally shaped geometric regions (not forcibly axis aligned) Always a covering for the space by construction. Construction Complexity

퓞(n2d)

Prediction Complexity For each cell anchor (n) For each other anchor,
 compute distance (nd)

퓞(n2d)

slide-17
SLIDE 17

Fitting and Bootstrapping

Fitting Evaluate all basis functions in the mesh at all points n.

!8

slide-18
SLIDE 18

Fitting and Bootstrapping

Fitting Evaluate all basis functions in the mesh at all points n. When “c” is the number of control points used for a mesh, using an (n × c) matrix A of basis function evaluations at all points, solve the least squares problem A x = f (X) with cost 퓞(nc2 + c3).

!8

slide-19
SLIDE 19

Fitting and Bootstrapping

Fitting Evaluate all basis functions in the mesh at all points n. When “c” is the number of control points used for a mesh, using an (n × c) matrix A of basis function evaluations at all points, solve the least squares problem A x = f (X) with cost 퓞(nc2 + c3). Bootstrapping

!8

slide-20
SLIDE 20

Fitting and Bootstrapping

Fitting Evaluate all basis functions in the mesh at all points n. When “c” is the number of control points used for a mesh, using an (n × c) matrix A of basis function evaluations at all points, solve the least squares problem A x = f (X) with cost 퓞(nc2 + c3). Bootstrapping Initialize mesh only using the most central point

!8

slide-21
SLIDE 21

Fitting and Bootstrapping

Fitting Evaluate all basis functions in the mesh at all points n. When “c” is the number of control points used for a mesh, using an (n × c) matrix A of basis function evaluations at all points, solve the least squares problem A x = f (X) with cost 퓞(nc2 + c3). Bootstrapping Initialize mesh only using the most central point Fit mesh and evaluate error at all other points

!8

slide-22
SLIDE 22

Fitting and Bootstrapping

Fitting Evaluate all basis functions in the mesh at all points n. When “c” is the number of control points used for a mesh, using an (n × c) matrix A of basis function evaluations at all points, solve the least squares problem A x = f (X) with cost 퓞(nc2 + c3). Bootstrapping Initialize mesh only using the most central point Fit mesh and evaluate error at all other points Add (batch of) point(s) with largest error to mesh

!8

slide-23
SLIDE 23

Fitting and Bootstrapping

Fitting Evaluate all basis functions in the mesh at all points n. When “c” is the number of control points used for a mesh, using an (n × c) matrix A of basis function evaluations at all points, solve the least squares problem A x = f (X) with cost 퓞(nc2 + c3). Bootstrapping Initialize mesh only using the most central point Fit mesh and evaluate error at all other points Add (batch of) point(s) with largest error to mesh If average error is not below error tolerance, repeat

!8

slide-24
SLIDE 24

Fitting and Bootstrapping

Fitting Evaluate all basis functions in the mesh at all points n. When “c” is the number of control points used for a mesh, using an (n × c) matrix A of basis function evaluations at all points, solve the least squares problem A x = f (X) with cost 퓞(nc2 + c3). Bootstrapping Initialize mesh only using the most central point Fit mesh and evaluate error at all other points Add (batch of) point(s) with largest error to mesh If average error is not below error tolerance, repeat Increased cost up to 퓞(n)

!8

slide-25
SLIDE 25

Testing and Evaluation: Data

!9

slide-26
SLIDE 26

Testing and Evaluation: Data

High Performance Computing File I/O
 n = 532, d = 4
 predicting file I/O throughput

!9

  • ×

× × × ×

slide-27
SLIDE 27

Testing and Evaluation: Data

High Performance Computing File I/O
 n = 532, d = 4
 predicting file I/O throughput

!9

  • ×

× × × ×


 Forest Fire
 n = 517, d = 12
 predicting area burned

slide-28
SLIDE 28

Testing and Evaluation: Data

High Performance Computing File I/O
 n = 532, d = 4
 predicting file I/O throughput

!9

  • ×

× × × ×


 Forest Fire
 n = 517, d = 12
 predicting area burned Parkinson’s Clinical Evaluation
 n = 468, d = 16
 predicting total clinical “UPDRS” score

slide-29
SLIDE 29

Time to Fit (y-axis)
 versus Error Tolerance (x-axis)

!10

  • /
slide-30
SLIDE 30

Time to Fit (y-axis)
 versus Error Tolerance (x-axis)

!10

  • /
  • Optimal Tolerance & Accuracy
slide-31
SLIDE 31

Average Relative Testing Error (y-axis)
 versus Relative Error Tolerance (x-axis)

!11

0.5 1.0 1.5 2.0

  • 2

2 4

Max Box Mesh on HPC I/O

0.5 1.0 1.5 2.0

  • 5

5 10

Iterative Box Mesh on HPC I/O

0.5 1.0 1.5 2.0 2 4 6 8 10 12

Voronoi Mesh on HPC I/O

0.5 1.0 1.5 2.0

  • 5

5 10 15

Max Box Mesh on Forest Fire

0.5 1.0 1.5 2.0 5 10 15

Iterative Box Mesh on Forest Fire

0.5 1.0 1.5 2.0

  • 40
  • 20

20

Voronoi Mesh on Forest Fire

0.5 1.0 1.5 2.0 0.5 1.0

Max Box Mesh on Parkinsons

0.5 1.0 1.5 2.0

  • 3
  • 2
  • 1

1 2 3

Iterative Box Mesh on Parkinsons

0.0 0.5 1.0 1.5 2.0 1.8 2.0 2.2 2.4 2.6 2.8 3.0

Voronoi Mesh on Parkinsons

slide-32
SLIDE 32

Histograms of Signed Relative Error

!12

  • /
  • /
  • /
slide-33
SLIDE 33

Summary of Contributions

Three techniques were proposed:
 Max Box Mesh, Iterative Box Mesh, and Voronoi Mesh Each is theoretically straightforward, flexible, and suitable for applications in many dimensions.

!13

slide-34
SLIDE 34

Summary of Contributions

Three techniques were proposed:
 Max Box Mesh, Iterative Box Mesh, and Voronoi Mesh Each is theoretically straightforward, flexible, and suitable for applications in many dimensions.

!13

Future Work

Alternative smoothing methods may be preferred that scale better with the number of data points. Further theoretical and empirical comparisons may be done against more widely used statistical / ML techniques.