SLIDE 5 9
compressive sensing level set methods fluid registration total variational algorithm
Application Domains: Medical Image Processing Pipeline Application Domains: Medical Image Processing Pipeline
denoising denoising registration registration segmentation segmentation analysis analysis reconstruction reconstruction
Navier-Stokes equations
non-iterative, highly parallel, local & global communication sparse linear algebra, structured grid, optimization methods parallel, global communication dense linear algebra, optimization methods local communication sparse linear algebra, n-body methods, graphical models local communication dense linear algebra, spectral methods, MapReduce iterative, local or global communication dense and sparse linear algebra, optimization methods
- These algorithms have diverse
These algorithms have diverse computation & computation & communication patterns communication patterns
- A single homogenous system
A single homogenous system can not perform very well on can not perform very well on all these algorithms all these algorithms
10
compressive sensing level set methods fluid registration total variational algorithm Navier-Stokes equations
Non-iterative, highly parallel, local & global communication sparse linear algebra, structured grid, optimization methods parallel, global communication dense linear algebra, optimization methods local communication sparse linear algebra, n-body methods, graphical models local communication dense linear algebra, spectral methods, MapReduce iterative, local or global communication dense and sparse linear algebra, optimization methods
Need of Customization for Medical Image Processing Pipeline Need of Customization for Medical Image Processing Pipeline
denoising denoising registration registration segmentation segmentation analysis analysis reconstruction reconstruction
- These algorithms have diverse
These algorithms have diverse computation & communication computation & communication patterns patterns
- A single, homogeneous system
A single, homogeneous system cannot perform very well on all cannot perform very well on all
- f these algorithms
- f these algorithms
- Need architecture
Need architecture customization and hardware customization and hardware-
software co-
- optimization
- ptimization
- Include many common
Include many common computation kernels ( computation kernels (“ “motifs motifs” ”) )
- Applicable to other domains
Applicable to other domains Bi Bi-
- harmonic registration (Using the same algorithm on all
harmonic registration (Using the same algorithm on all platforms) platforms)
CPU (Xenon 2.0 GHz) CPU (Xenon 2.0 GHz) 1x 1x ~100 W ~100 W GPU (Tesla C1060) GPU (Tesla C1060) 93x 93x ~150 W ~150 W FPGA (xc4vlx100) FPGA (xc4vlx100) 11x 11x ~5W ~5W
3D median filter: For each 3D median filter: For each voxel voxel, compute the median of , compute the median of the 3 x 3 x 3 neighboring the 3 x 3 x 3 neighboring voxels voxels
CPU (Xenon 2.0 GHz) CPU (Xenon 2.0 GHz) Quick select Quick select 1x 1x ~100 W ~100 W GPU (Tesla C1060) GPU (Tesla C1060) Median of medians Median of medians 70x 70x ~140 W ~140 W FPGA (xc4vlx100) FPGA (xc4vlx100) Bit Bit-
by-
bit majority voting 1200x 1200x ~3 W ~3 W