v ? ? ? ? ? ? ? ? ? ? ? ? ? ??? ? ?? ? ? ? ? ?? - PowerPoint PPT Presentation

MASSACHUSETTS GENERAL HOSPITAL RADIATION ONCOLOGY A CCELERATING MI - B ASED B - S PLINE R EGISTRATION U SING CUDA E NABLED GPU S James Shackleford (1) , Nagarajan Kandasamy (2), Gregory C. Sharp (1) (1) Massachusetts General Hospital, Radiation Oncology (2) Drexel University, Electrical and Computer Engineering

S LIDE 2 OF 33 F IXED I MAGE M OVING I MAGE I NTRODUCTION W HAT IS D EFORMABLE R EGISTRATION?

S LIDE 3 OF 33 F IXED I MAGE M OVING I MAGE I NTRODUCTION W HAT IS D EFORMABLE R EGISTRATION?

S LIDE 4 OF 33 F IXED I MAGE D EFORMATION V ECTOR F IELD M OVING I MAGE I NTRODUCTION W HAT IS D EFORMABLE R EGISTRATION?

S LIDE 5 OF 33 B-S PLINE G RID P ARAMETERIZATION M ETHOD P Y β X β Y P X P ARAMETER C OEFF P ARAMETER W EIGHT R EGIONAL I NFLUENCE

S LIDE 6 OF 33 v Y v Y v Y P Y v X v X = ( β X β Y ) P X v Y = ( β X β Y ) P Y P X P Y β X β Y P X P ARAMETER C OEFF P ARAMETER W EIGHT R EGIONAL I NFLUENCE

S LIDE 7 OF 33 16 C ONTRIBUTIONS 4 4 v X = Σ Σ ( β X,i β Y,j ) P X,i,j j=1 i=1 4 4 v Y = Σ Σ ( β X,i β Y,j ) P Y,i,j j=1 i=1 P Y β X β Y P X P ARAMETER C OEFF P ARAMETER W EIGHT R EGIONAL I NFLUENCE

S LIDE 8 OF 33 v ? ? ? ? ? ? ? ? ? ? ? ? ? ??? ? ?? ? ? ? ? ?? ? ? ?? ? ? ? F F F M ? ? ? ?? C ORRESPONDANCE AND C OST 𝚬 C OST w.r.t. V ECTORS D ECOMPRESS V ECTOR F IELD C ∂C New ∂v P ∂C ? ∂P ? ? ? ? ? ? ? ? ? ? ? ? ? ?? ??? ? ? ? ? ?? ? ? ?? ? ? ? Q UASI-NEWTONIAN ? ? ? ?? F O PTIMIZER 𝚬 C OST w.r.t. C OEFFICIENTS

S LIDE 9 OF 33 M OVING I MAGE V ALUE F M H(F) + H(M) – H(F,M) C = F IXED I MAGE V ALUE H(F) H(F) H(F | M) ⨉ h j (i, j) N C = 1 B F B M Σ Σ h j (i, j) ln C H(F,M) N ⨉ h M ( j ) h F ( i ) j=1 i=1 H(M | F) H(M) H(M)

S LIDE 10 OF 33 F M F M 2 3 # of voxels 4 1 Static Image Moving Image intensity 4 1 B C 3 2 A D A B C D Nearest Neighbors Partial Volumes ( ∂v ) ⨉ h j (i n , j n ) N ∂C ∂P = ∂C ∂h ∂v ∂C ∂C ∂C ⨉ ∂w n 4 ⨉ ⨉ Σ ln - C = ∂v = ∂h ∂v ∂P ∂h ∂h x n ⨉ h M ( j n ) h F ( i n ) n=1 x n

S LIDE 11 OF 33 S ERIAL I MPLEMENTATION F OLLOWING A S INGLE T HREAD

S LIDE 12 OF 33 use partial volumes for moving & joint MOVING IMAGE INTENSITY Generate Histograms get corresponding voxels in moving image 4 1 B C 3 2 A D F M Nearest Neighbors Partial Volumes compute compute vector partial volumes for each voxel FIXED IMAGE INTENSITY

S LIDE 13 OF 33 use partial volumes for moving & joint MOVING IMAGE INTENSITY Generate Histograms get corresponding voxels in moving image 4 1 B C 3 2 A D F M Nearest Neighbors Partial Volumes compute compute vector partial volumes for each voxel FIXED IMAGE INTENSITY Compute Score simply cycle Traditional Serial CPU thru histograms ⨉ h j (i, j) N C = 1 B F B M is very fast Σ Σ h j (i, j) ln N ⨉ h M ( j ) h F ( i ) (time required is negligible) j=1 i=1

S LIDE 14 OF 33 use partial volumes for moving & joint MOVING IMAGE INTENSITY Generate Histograms get corresponding voxels in moving image 4 1 B C 3 2 A D F M Nearest Neighbors Partial Volumes compute compute vector partial volumes for each voxel FIXED IMAGE INTENSITY Compute Score simply cycle Traditional Serial CPU thru histograms ⨉ h j (i, j) N C = 1 B F B M is very fast Σ Σ h j (i, j) ln N ⨉ h M ( j ) h F ( i ) (time required is negligible) j=1 i=1 change in cost as Compute Gradient ( ∂v ) vector changes get corresponding ∂C ∂C ⨉ ∂w n 4 voxels in moving image Σ ∂v = 4 1 ∂h x n B C n=1 3 2 A D F M Nearest Neighbors Partial Volumes compute NEXT get vector partial volume for each voxel derivatives ∂C ∂P = ∂C ∂v ⨉ h j (i n , j n ) N ⨉ ∂C ln - C = ∂v ∂P ∂h ⨉ h M ( j n ) h F ( i n ) x n

S LIDE 15 OF 33 v ? ? ? ? ? ? ? ? ? ? ? ? ? ??? ? ?? ? ? ? ? ?? ? ? ?? ? ? ? F F F M ? ? ? ?? C ORRESPONDANCE AND C OST 𝚬 C OST w.r.t. V ECTORS D ECOMPRESS V ECTOR F IELD C ∂C New ∂v P ∂C ? ∂P ? ? ? ? ? ? ? ? ? ? ? ? ? ?? ??? ? ? ? ? ?? ? ? ?? ? ? ? Q UASI-NEWTONIAN ? ? ? ?? F O PTIMIZER 𝚬 C OST w.r.t. C OEFFICIENTS

S LIDE 16 OF 33 β X ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?? ? ? ? ? ? 1 2 3 4 ? ? ? ? ? ? ? ? ? ? ? ? F 7 8 5 6 C HANGE IN C OST w.r.t. C OEFFICIENTS 9 10 11 12 4 4 v X = Σ Σ ( β X,i β Y,j ) P X,i,j β Y 13 14 15 16 j=1 i=1 ∂C ∂C ∂v ∂C 4 4 Σ Σ Σ β X,i β Y,j ∂P = ∂P = ∂v ∂v j=1 i=1

S LIDE 17 OF 33 β X ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?? ? ? ? ? ? 1 2 3 4 ? ? ? ? ? ? ? ? ? ? ? ? F 7 8 5 6 C HANGE IN C OST w.r.t. C OEFFICIENTS 9 10 11 12 4 4 v X = Σ Σ ( β X,i β Y,j ) P X,i,j β Y 13 14 15 16 j=1 i=1 ∂C ∂C ∂v ∂C 4 4 Σ Σ Σ β X,i β Y,j ∂P = ∂P = ∂v ∂v j=1 i=1

S LIDE 18 OF 33 P ARALLELIZATION L EVERAGING GPU S , O PEN MP, ETC

S LIDE 19 OF 33 What do we parallelize ? ✓ ✓ ✓ v ? ? ? ? ? ? ? ? ? ? ? ? ? ??? ? ?? ? ? ? ? ?? ? ? ?? ? ? ? F F F M ? ? ? ?? C ORRESPONDANCE AND C OST 𝚬 C OST w.r.t. V ECTORS D ECOMPRESS V ECTOR F IELD C ✗ ∂C New ∂v P ✓ ✗ ∂C ? ∂P ? ? ? ? ? ? ? ? ? ? ? ? ? ?? ??? ? ? ? ? ?? ? ? ?? ? ? ? Q UASI-NEWTONIAN ? ? ? ?? F O PTIMIZER 𝚬 C OST w.r.t. C OEFFICIENTS

S LIDE 20 OF 33 C OMPUTE V ECTOR F ROM C OEFF F C OMPUTE H ISTOGRAMS C YCLE H IST C OST ( MI ) F M C OMPUTE C HANGE IN C OST w.r.t V ECTOR ? ? ? ? ? ? ? ? ? ? ? ? ? ?? ? ? ? ? ?? ? ? ? ? ∂C ? ? ? ? ? ? ? ? ∂v ? ? ? ? ? ? ? ? ? ? ? ? F ? ? ? ? ? ?

S LIDE 21 OF 33 use partial volumes for moving & joint MOVING IMAGE INTENSITY Generate Histograms get corresponding voxels in moving image 4 1 B C 3 2 A D F M Nearest Neighbors Partial Volumes compute compute vector partial volumes for each voxel FIXED IMAGE INTENSITY Compute Score simply cycle Traditional Serial CPU thru histograms ⨉ h j (i, j) N C = 1 B F B M is very fast Σ Σ h j (i, j) ln N ⨉ h M ( j ) h F ( i ) (time required is negligible) j=1 i=1 change in cost as Compute Gradient ( ∂v ) vector changes get corresponding ∂C ∂C ⨉ ∂w n 4 voxels in moving image Σ ∂v = 4 1 ∂h x n B C n=1 3 2 A D F M Nearest Neighbors Partial Volumes compute NEXT get vector partial volume for each voxel derivatives ∂C ∂P = ∂C ∂v ⨉ h j (i n , j n ) N ⨉ ∂C ln - C = ∂v ∂P ∂h ⨉ h M ( j n ) h F ( i n ) x n

S LIDE 22 OF 33 β X β X 1 2 3 4 1 2 3 4 5 6 7 8 ? 5 6 7 8 9 10 11 12 ? ? ? ? ? ? ? ? ? ? ? ? β Y 9 10 11 12 13 1415 16 β Y ? ? ? ? ? ?? ? ? ? ? ? ? ? 13 1415 16 ? ? ? ? ? ? ? ? ? ? F C HANGE IN C OST w.r.t. C OEFFICIENTS CPU 1 4 4 v X = Σ Σ ( β X,i β Y,j ) P X,i,j j=1 i=1 . . . 1 2 3 4 5 16 ∂C ∂C ∂v ∂C 4 4 Σ Σ β X,i β Y,j ∂P = ∂P = ∂v ∂v j=1 i=1 . . . 1 2 3 4 5 16 . . . 1 2 3 4 5 16

S LIDE 23 OF 33 β X β X 1 2 3 4 1 2 3 4 5 6 7 8 ? 5 6 7 8 9 10 11 12 ? ? ? ? ? ? ? ? ? ? ? ? β Y 9 10 11 12 13 1415 16 β Y ? ? ? ? ? ?? ? ? ? ? ? ? ? 13 1415 16 ? ? ? ? ? ? ? ? ? ? F C HANGE IN C OST w.r.t. C OEFFICIENTS CPU 2 CPU 1 4 4 v X = Σ Σ ( β X,i β Y,j ) P X,i,j j=1 i=1 . . . 1 2 3 4 5 16 ∂C ∂C ∂v ∂C 4 4 Σ Σ β X,i β Y,j ∂P = ∂P = ∂v ∂v j=1 i=1 . . . 1 2 3 4 5 16 . . . 1 2 3 4 5 16

S LIDE 24 OF 33 C ONSTANT C ONTROL P OINT S PACING 16x 15 x 15 x 15 speedup 30 min → 1.8 min J. Shackleford, N. Kandasamy, and G. Sharp, Deformable Volumetric Registration using B-splines. GPU Computing Gems: Emerald Edition, Morgan Kaufmann Pub, 2011. J. Shackleford, N. Kandasamy, and G. Sharp, “On developing B-spline registration algorithms for multi-core processors,” Physics in Medicine and Biology , vol. 55, p. 6329, 2010.

S LIDE 25 OF 33 C ONSTANT V OLUME S IZE 256 x 256 x 256 J. Shackleford, N. Kandasamy, and G. Sharp, Deformable Volumetric Registration using B-splines. GPU Computing Gems: Emerald Edition, Morgan Kaufmann Pub, 2011. J. Shackleford, N. Kandasamy, and G. Sharp, “On developing B-spline registration algorithms for multi-core processors,” Physics in Medicine and Biology , vol. 55, p. 6329, 2010.

S LIDE 26 OF 33 OpenMP CUDA thread-level histograms (shared memory) + block-level histograms (global memory) complete histograms (global memory) H ISTOGRAM C OMPUTATION L EVERAGING GPU S , O PEN MP, ETC

S LIDE 27 OF 33 OpenMP CUDA block thread-level histograms (shared memory) + block-level histograms (global memory) complete histogram (global memory) H ISTOGRAM C OMPUTATION L EVERAGING GPU S , O PEN MP, ETC

v ? ? ? ? ? ? ? ? ? ? ? ? ? ??? ? ?? ? ? ? ? ?? - PowerPoint PPT Presentation

MASSACHUSETTS GENERAL HOSPITAL RADIATION ONCOLOGY A CCELERATING MI - B ASED B - S PLINE R EGISTRATION U SING CUDA E NABLED GPU S James Shackleford (1) , Nagarajan Kandasamy (2), Gregory C. Sharp (1) (1) Massachusetts General Hospital, Radiation

Introduction Part R Radu Nicolescu Department of Computer Science University of Auckland 16

C# Design Patterns: Proxy APPLYING THE PROXY PATTERN Steve Smith FORCE MULTIPLIER FOR DEV TEAMS

VS2017 Web http://www.timecockpit.com Mail rainer@timecockpit.com Twitter @rstropek C# Dev

Programming C# Course prerequisites Programming experience required in some high-level

IETF 63 IETF 63 GSS- -API Next Generation WG API Next Generation WG GSS Chair: Jeffrey Altman

Fundamentals of Programming Session 10 Instructor: Reza Entezari-Maleki Email:

CS CS 683 683 - Se Securi rity and Pri rivacy Fall 2019 Fa Instr Ins truc uctor: Ka

ethics in NLP CS 685, Fall 2020 Introduction to Natural Language Processing

CS 162 Intro to Computer Science II Separate Compila4on 1

Feature Bagging for Author Attribution PAN - CLEF 2012 Franois-Marie Giraud / Thierry

Carnegie Mellon Univ. Dept. of Computer Science 15-415/615 - DB Applications Lecture #26:

Systems Infrastructure for Data Science Web Science Group Uni Freiburg WS 2012/13 Lecture IX:

Cloud Mobile Computing Ed Crowley Tonights Topics Communicate class expectations

Source: Prof. Scott Eberhardt AE-714_332M Aircraft Design Capsule-0 Cost is the Key ! Needs

Quadrilateral Mesh Generation: Meromorphic Quartic Differential and Abel-Jacobi Condition Na Lei 1

Flynns Taxonomy Prof. Mike Flynns famous taxonomy of parallel computers 1 Flynns

Disclaimer neither speakers are paid by any company or have fi nancial interests in any of the

Heterogeneous Volume Modelling and Variable Microstructures Speaker: Alexander Pasko Co-authors:

Vulnerabilities in Similarity Search Based Systems Sami Vaarala Helsinki University of Technology

Convolutional Autoencoder (CAE) Prof. Seungchul Lee Industrial AI Lab. Convolutional Autoencoder

ECE 3130: Microcomputer Systems Chapter 0: Important information Course Instructor: Ahmad

FCE,CAE and CPE levels Aims of the workshop Analyse the process involved in marking students

Concrete Autoencoders Abubakar Abid Muhammed Fatih Balin James Zou Poster: Thu Jun 13th

JUST THE MATHS SLIDES NUMBER 15.4 ORDINARY DIFFERENTIAL EQUATIONS 4 (Second order