gpu data mining in neuroimaging genomics
play

GPU Data Mining in Neuroimaging Genomics Bob Zigon Beckman Coulter - PowerPoint PPT Presentation

GPU Data Mining in Neuroimaging Genomics Bob Zigon Beckman Coulter Indianapolis, Indiana May 10, 2017 1 / 20 Outline Background ANOVA for Voxels and SNPs VEGAS for Voxels and Genes High Speed GPU Monte-Carlo Simulator 2 / 20 What is


  1. GPU Data Mining in Neuroimaging Genomics Bob Zigon Beckman Coulter Indianapolis, Indiana May 10, 2017 1 / 20

  2. Outline Background ANOVA for Voxels and SNPs VEGAS for Voxels and Genes High Speed GPU Monte-Carlo Simulator 2 / 20

  3. What is Neuroimaging Genomics? 1 Neuroimaging Genomics is the fusion of brain imaging and genotyping data. 2 Study the influence of genetic variation on brain structure and function. 3 / 20

  4. MRI and Sequencing data MRI instrument MRI data Genotyping instrument Genotyping data 4 / 20

  5. Problem Definition Develop an interactive tool for studying Alzheimer’s Disease by coupling a 3D brain explorer with a genome explorer. Prior Art Our Goal 120 ROI’s ⇒ 1,000,000 voxels 20,000 SNP’s 1,000,000 SNP’s 5 / 20

  6. Problem Definition Brain with 120 Regions of Interest Brain with 1,000,000 voxels 6 / 20

  7. How We Do It – The UI

  8. How We Do It – The UI Brain Explorer

  9. How We Do It – The UI SNP Explorer Brain Explorer

  10. How We Do It – The UI SNP Explorer Brain Explorer Heat Map 7 / 20

  11. ANOVA ANOVA - Analysis of Variance Understand the relationship between the gray matter density from the MRI and the SNP genotype. ”Did the combination happen by chance or not?” 8 / 20

  12. ANOVA ANOVA - Analysis of Variance Understand the relationship between the gray matter density from the MRI and the SNP genotype. ”Did the combination happen by chance or not?” Computational complexity O ( N v ∗ N j ∗ N s ) N v - number of voxels N j - number of subjects N s - number of SNPS 8 / 20

  13. ANOVA Subjects   v 11 · · · v 1 M Voxels v 21 · · · v 2 M     . ... .  . .  . .   v N 1 · · · v NM SNPs   vs 11 · · · vs 1 M Voxels  vs 21 · · · vs 2 M    ANOVA . . ...   . . . .   vs N 1 · · · vs NM Subjects   s 11 · · · s 1 M SNPs s 21 · · · s 2 M     . . ...  . .  . .   s K 1 · · · s KM 9 / 20

  14. VEGAS VEGAS - VErsatile Gene based Association Study Understand the relationship between the gray matter density from the MRI and the collective effect of multiple SNPs within a gene. ”Did the combination happen by chance or not?” 10 / 20

  15. VEGAS VEGAS - VErsatile Gene based Association Study Understand the relationship between the gray matter density from the MRI and the collective effect of multiple SNPs within a gene. ”Did the combination happen by chance or not?” Computational complexity O ( N v ∗ N j ∗ N s + ) N v ∗ N g ∗ N i � �� � � �� � ANOVA component Monte-Carlo component N v - number of voxels N g - number of genes N i - number of Monte-Carlo iterations (10 2 , 10 3 , 10 4 , 10 5 , or 10 6 ) 10 / 20

  16. VEGAS Subjects   v 11 · · · v 1 M Voxels  v 21 · · · v 2 M    . ... .  . .  . .   v N 1 · · · v NM SNPs Genes     vs 11 · · · vs 1 M vg 11 · · · vg 1 G Voxels Voxels  vs 21 · · · vs 2 M   vg 21 · · · vg 2 G      ANOVA VEGAS . . . . ... ...     . . . . . . . .     vs N 1 · · · vs NM vg N 1 · · · vg NG Subjects   s 11 · · · s 1 M SNPs  s 21 · · · s 2 M    Monte Carlo Simulation . . ...   . . . .   s K 1 · · · s KM 11 / 20

  17. Video Video of the Interactive Neuroimaging Genomic Browser 12 / 20

  18. Lessons Learned How do you build a high speed Monte-Carlo Simulator for an N dimensional problem on a GPU? 13 / 20

  19. Lessons Learned One Dimensional for i = 1 to K do Choose X from N(0,1) Y = F ( X ) Make decision about Y end 14 / 20

  20. Lessons Learned One Dimensional N-Dimensional for i = 1 to K do for i = 1 to K do Choose n values from N(0,1) giving X n Choose X from N(0,1) Y n = F ( X n ) Y = F ( X ) Make decision about Y n Make decision about Y end end 14 / 20

  21. Lessons Learned First Attempt at N-Dimensional foreach voxel V in parallel do foreach gene G in parallel do for i = 1 to K do Choose n values from N(0,1) giving X n Y n = F ( X n ) Make decision about Y n end end end 15 / 20

  22. Lessons Learned First Attempt at N-Dimensional foreach voxel V in parallel do foreach gene G in parallel do for i = 1 to K do Choose n values from N(0,1) giving X n Y n = F ( X n ) Make decision about Y n end end end Slow Fast Theoretical Memory Bandwidth (gb/sec) 20 320 GFLOPS 20 10,000 15 / 20

  23. Lessons Learned - Solution N-Dimensional Ah-Ha In parallel, generate n × K values from N(0,1) giving X n × K Y n × K = F n × n × X n × K In parallel, decide about Y n × K 16 / 20

  24. Lessons Learned Slow N-Dimensional Fast N-Dimensional foreach voxel V in parallel do foreach voxel V sequentially do foreach gene G in parallel do foreach gene G sequentially do for i = 1 to K do In parallel, generate nK values Y n × K = F n × n × X n × K n values from N(0,1) Y n = F ( X n ) In parallel, decide about Y n × K Make decision about Y n end end end end X end X 17 / 20

  25. Lessons Learned Slow N-Dimensional Fast N-Dimensional foreach voxel V in parallel do foreach voxel V sequentially do foreach gene G in parallel do foreach gene G sequentially do for i = 1 to K do In parallel, generate nK values Y n × K = F n × n × X n × K n values from N(0,1) Y n = F ( X n ) In parallel, decide about Y n × K Make decision about Y n end end end end X end X 17 / 20

  26. Lessons Learned Slow N-Dimensional Fast N-Dimensional foreach voxel V in parallel do foreach voxel V sequentially do foreach gene G in parallel do foreach gene G sequentially do for i = 1 to K do In parallel, generate nK values Y n × K = F n × n × X n × K n values from N(0,1) Y n = F ( X n ) In parallel, decide about Y n × K Make decision about Y n end end end end X end X Slow Fast Theoretical Memory Bandwidth (gb/sec) 20 155 320 GFLOPS 20 2,000 10,000 800X improvement! 17 / 20

  27. Vegas Results 10 4 1-CPU 4-CPU Time in seconds GeForce Titan X 10 3 10 2 10 1 10 0 70x70 80x80 90x90 100x100 V-Voxels by G-Genes Figure 1: Execution times for 1 VEGAS run with K=10,000 Monte-Carlo iterations 18 / 20

  28. Acknowledgements 1 Professor Li Shen, Dept. of Radiology and Imaging Sciences, IU School of Medicine, NIH R01 EB022574, R01 LM011360, U01 AG024904, and IUPUI ITDP Program. 2 Professor Shiaofen Fang, Computer Science Dept. Chair, Indiana University-Purdue University of Indianapolis 3 Professor Mohammad Al Hasan, Associate Professor Computer Science, Indiana University-Purdue University of Indianapolis 4 Huang Li, PhD Candidate, Computer Science, Indiana University-Purdue University of Indianapolis 19 / 20

  29. Thank you robert.zigon@beckman.com 20 / 20

  30. 21 / 20

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend