Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic - - PowerPoint PPT Presentation

hybrid cpu gpu acceleration of detection of 2 snp
SMART_READER_LITE
LIVE PREVIEW

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic - - PowerPoint PPT Presentation

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Jorge Gonzlez-Domnguez*, Bertil Schmidt*, Jan C. Kssens**, Lars Wienbrandt**


slide-1
SLIDE 1

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS

Jorge González-Domínguez*, Bertil Schmidt*, Jan C. Kässens**, Lars Wienbrandt**

*Parallel and Distributed Architectures Group, Johannes Gutenberg University of Mainz, Germany {j.gonzalez,bertil.schmidt}@uni-mainz.de **Department of Computer Science, Christian-Albrechts-University of Kiel, Germany {jka,lwi}@informatik.uni-kiel.de

20th International Euro-Par Conference Euro-Par 2014

slide-2
SLIDE 2

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS

1

Introduction

2

Methodology

3

Implementation

4

Experimental Evaluation

5

Conclusion

slide-3
SLIDE 3

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Introduction

1

Introduction

2

Methodology

3

Implementation

4

Experimental Evaluation

5

Conclusion

slide-4
SLIDE 4

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Introduction

Genome-Wide Association Studies (I)

Analyses of genetic influence

  • n diseases
slide-5
SLIDE 5

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Introduction

Genome-Wide Association Studies (I)

Analyses of genetic influence

  • n diseases

M individuals

slide-6
SLIDE 6

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Introduction

Genome-Wide Association Studies (I)

Analyses of genetic influence

  • n diseases

M individuals

K cases

slide-7
SLIDE 7

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Introduction

Genome-Wide Association Studies (I)

Analyses of genetic influence

  • n diseases

M individuals

K cases C controls

slide-8
SLIDE 8

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Introduction

Genome-Wide Association Studies (I)

Analyses of genetic influence

  • n diseases

M individuals

K cases C controls

N genetic markers, Single Nucleotide Polymorphisms (SNPs). 3 genotypes:

Homozygous Wild (w, AA, 0) Heterozygous (h, Aa, 1) Homozygous Variant (v, aa, 2)

slide-9
SLIDE 9

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Introduction

Genome-Wide Association Studies (II)

Cases Controls SNP 1 1 2 1 2 1 2 1 2 1 2 1 SNP 2 1 1 2 1 2 2 1 1 1 2 SNP 3 1 2 1 1 1 2 1 1 SNP 4 1 1 1 1 2 2 2 2 1 1 1 1 SNP 5 2 2 2 1 1 1 1 1 1 2 2 SNP 6 1 1 1 1 1 2 1 2 1 2 2 1

slide-10
SLIDE 10

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Introduction

Genome-Wide Association Studies (II)

Cases Controls SNP 1 1 2 1 2 1 2 1 2 1 2 1 SNP 2 1 1 2 1 2 2 1 1 1 2 SNP 3 1 2 1 1 1 2 1 1 SNP 4 1 1 1 1 2 2 2 2 1 1 1 1 SNP 5 2 2 2 1 1 1 1 1 1 2 2 SNP 6 1 1 1 1 1 2 1 2 1 2 2 1

slide-11
SLIDE 11

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Introduction

Genome-Wide Association Studies (II)

Cases Controls SNP 1 1 2 1 2 1 2 1 2 1 2 1 SNP 2 1 1 2 1 2 2 1 1 1 2 SNP 3 1 2 1 1 1 2 1 1 SNP 4 1 1 1 1 2 2 2 2 1 1 1 1 SNP 5 2 2 2 1 1 1 1 1 1 2 2 SNP 6 1 1 1 1 1 2 1 2 1 2 2 1

slide-12
SLIDE 12

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Introduction

Genome-Wide Association Studies (and III)

Definition Two SNPs present epistasis or interaction if: Their joint genotype frequencies show a statistically significant difference between cases and controls which potentially explains the effect of the genetic variation leading to disease. The difference between cases and controls shown by the joint values is significantly higher than using only the individual SNP values.

slide-13
SLIDE 13

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Introduction

BOOST

BOolean Operation-based Screening and Testing Binary traits Exhaustive search Statistical regression Good accuracy (used by biologists) Returns a list of SNP pairs with high interaction probability Fastest available tool. Intel Core i7 3.20GHz:

40,000 SNPs and 3,200 individuals

About 800 million pairs 51 minutes

500,000 SNPs and 5,000 individuals

About 125 billion pairs (moderated size) Estimated 12 days

slide-14
SLIDE 14

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Introduction

GBOOST

CUDA version for GPUs Same accuracy as BOOST 40,000 SNPs and 6,400 individuals

About 800 million pairs 28 seconds on a GTX Titan

500,000 SNPs and 5,000 individuals

About 125 billion pairs (moderated size) 1 hour on a GTX Titan

slide-15
SLIDE 15

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Introduction

GBOOST

CUDA version for GPUs Same accuracy as BOOST 40,000 SNPs and 6,400 individuals

About 800 million pairs 28 seconds on a GTX Titan

500,000 SNPs and 5,000 individuals

About 125 billion pairs (moderated size) 1 hour on a GTX Titan

High-throughput genotyping technologies collect few million SNPs of an individual within a few minutes → Expected datasets with 5M SNPs and 10,000 individuals

slide-16
SLIDE 16

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Introduction

Goal of the Work

Development of EpistSearch, improving BOOST and GBOOST for GWAS Same accuracy CPU computation

Faster algorithm Multithreaded version

GPU computation

Faster algorithm Improvement of the CUDA kernel

CPU/GPU computation

Inter-task hybrid parallelism

slide-17
SLIDE 17

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Methodology

1

Introduction

2

Methodology

3

Implementation

4

Experimental Evaluation

5

Conclusion

slide-18
SLIDE 18

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Methodology

Contingency Tables in (G)BOOST (I)

For each SNP-pair → Number of occurrences of each combination of genotypes Cases SNP2=0 SNP2=1 SNP2=2 SNP1=0 n000 n010 n020 SNP1=1 n100 n110 n120 SNP1=2 n200 n210 n220 Controls SNP2=0 SNP2=1 SNP2=2 SNP1=0 n001 n011 n021 SNP1=1 n101 n111 n121 SNP1=2 n201 n211 n221

slide-19
SLIDE 19

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Methodology

Contingency Tables in (G)BOOST (II)

SNP 4 1 1 1 1 2 2 2 2 1 1 1 1 SNP 6 1 1 1 1 1 2 1 2 1 2 2 1 Cases SNP6=0 SNP6=1 SNP6=2 SNP4=0 4 SNP4=1 4 SNP4=2 Controls SNP6=0 SNP6=1 SNP6=2 SNP4=0 SNP4=1 2 2 SNP4=2 1 2

slide-20
SLIDE 20

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Methodology

Contingency Tables in (G)BOOST (III)

Boolean Representation of Genotype Data Applied in BOOST and GBOOST 6 strings per SNP

3 per cases and 3 per controls (one per genotype {0,1,2}) One bit per individual Represents whether the individual has the corresponding genotype

slide-21
SLIDE 21

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Methodology

Contingency Tables in (G)BOOST (IV)

SNP 1 1 2 1 2 1 2 1 2 1 2 1

slide-22
SLIDE 22

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Methodology

Contingency Tables in (G)BOOST (IV)

SNP 1 1 2 1 2 1 2 1 2 1 2 1 SNP 1 = 0; 1 1 1 SNP 1 = 1; 1 1 1 SNP 1 = 2; 1 1 SNP 1 = 0; 1 1 SNP 1 = 1; 1 1 1 SNP 1 = 2; 1 1 1

slide-23
SLIDE 23

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Methodology

Contingency Tables in (G)BOOST (and V)

Drawback 50% memory overhead

slide-24
SLIDE 24

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Methodology

Contingency Tables in (G)BOOST (and V)

Drawback 50% memory overhead Advantage More efficient creation of contingency tables Only logical AND computations

Strings packed in arrays of 32 bits Only m

32 32-bit AND operations per value of the table

nxy0 = (SNP 1=x) AND (SNP 2=y)

slide-25
SLIDE 25

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Methodology

Contingency Tables in EpistSearch (I)

Optimization in EpisSearch Only 8 values of the contingency table explicitly calculated with AND Only four strings per SNP Additional information with the total count of each genotype for cases and controls (6 integers)

Calculated once per SNP when loading data sum0,sum1,sum2,sum0,sum1,sum2

slide-26
SLIDE 26

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Methodology

Contingency Tables in EpistSearch (II)

Cases SNP2=0 SNP2=1 SNP2=2 SNP1=0 n000 − n020 SNP1=1 − − − SNP1=2 n200 − n220 Controls SNP2=0 SNP2=1 SNP2=2 SNP1=0 n001 − n021 SNP1=1 − − − SNP1=2 n201 − n221

slide-27
SLIDE 27

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Methodology

Contingency Tables in EpistSearch (II)

Cases SNP2=0 SNP2=1 SNP2=2 SNP1=0 n000 − n020 SNP1=1 − − − SNP1=2 n200 − n220 Controls SNP2=0 SNP2=1 SNP2=2 SNP1=0 n001 − n021 SNP1=1 − − − SNP1=2 n201 − n221 n010 = sum0-n000-n020

slide-28
SLIDE 28

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Methodology

Contingency Tables in EpistSearch (and III)

Advantages Less memory requirements Faster calculation of the contingency tables

Only 8 values of the table need the m/32 32-bit AND

  • perations

The other values calculated with a few arithmetic operations

slide-29
SLIDE 29

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Methodology

Filtering Stage in (G)BOOST (I)

Measuring interaction via log-linear models

slide-30
SLIDE 30

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Methodology

Filtering Stage in (G)BOOST (I)

Measuring interaction via log-linear models Log-Linear Measure (I) ˆ LS − ˆ LH = N

  • ijk
  • ˆ

πijk log ˆ πijk ˆ pijk

  • ˆ

LS log-likelihood of the saturated regression model ˆ LH log-likelihood of the homogeneous association model ˆ πijk joint distribution obtained under the saturated model ˆ pijk distribution obtained under the homogeneous association model

slide-31
SLIDE 31

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Methodology

Filtering Stage in (G)BOOST (II)

Measuring interaction via log-linear models Log-Linear Measure (II) ˆ LS − ˆ LH = N

  • ijk
  • ˆ

πijk log ˆ πijk ˆ pijk

  • T the threshold for epistasis

If ˆ LS − ˆ LH > T ⇒ Epistasis

slide-32
SLIDE 32

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Methodology

Filtering Stage in (G)BOOST (II)

Measuring interaction via log-linear models Log-Linear Measure (II) ˆ LS − ˆ LH = N

  • ijk
  • ˆ

πijk log ˆ πijk ˆ pijk

  • T the threshold for epistasis

If ˆ LS − ˆ LH > T ⇒ Epistasis Computationally expensive

ˆ pijk computed through iterative methods

slide-33
SLIDE 33

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Methodology

Filtering Stage in (G)BOOST (III)

Kirkwood Superposition Approximation (KSA) ˆ LS − ˆ LKSA = N

ijk

  • ˆ

πijk log

  • ˆ

πijk ˆ pk

ijk

  • ˆ

pk

ijk = 1 η πij.πi.kπ.jk πi..π.j.π..k

η =

ijk πij.πi.kπ.jk πi..π.j.π..k

slide-34
SLIDE 34

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Methodology

Filtering Stage in (G)BOOST (III)

Kirkwood Superposition Approximation (KSA) ˆ LS − ˆ LKSA = N

ijk

  • ˆ

πijk log

  • ˆ

πijk ˆ pk

ijk

  • ˆ

pk

ijk = 1 η πij.πi.kπ.jk πi..π.j.π..k

η =

ijk πij.πi.kπ.jk πi..π.j.π..k

Upper bound: ˆ LS − ˆ LH ≤ ˆ LS − ˆ LKSA

slide-35
SLIDE 35

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Methodology

Filtering Stage in (G)BOOST (III)

Kirkwood Superposition Approximation (KSA) ˆ LS − ˆ LKSA = N

ijk

  • ˆ

πijk log

  • ˆ

πijk ˆ pk

ijk

  • ˆ

pk

ijk = 1 η πij.πi.kπ.jk πi..π.j.π..k

η =

ijk πij.πi.kπ.jk πi..π.j.π..k

Upper bound: ˆ LS − ˆ LH ≤ ˆ LS − ˆ LKSA ˆ LS − ˆ LKSA < T ⇒ No epistasis

slide-36
SLIDE 36

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Methodology

Filtering Stage in (G)BOOST (III)

Kirkwood Superposition Approximation (KSA) ˆ LS − ˆ LKSA = N

ijk

  • ˆ

πijk log

  • ˆ

πijk ˆ pk

ijk

  • ˆ

pk

ijk = 1 η πij.πi.kπ.jk πi..π.j.π..k

η =

ijk πij.πi.kπ.jk πi..π.j.π..k

Upper bound: ˆ LS − ˆ LH ≤ ˆ LS − ˆ LKSA ˆ LS − ˆ LKSA < T ⇒ No epistasis ˆ LS − ˆ LKSA is computationally simpler and faster

slide-37
SLIDE 37

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Methodology

Filtering Stage in (G)BOOST (and IV)

Pseudocode of (G)BOOST

slide-38
SLIDE 38

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Methodology

Filtering Stage in (G)BOOST (and IV)

Pseudocode of (G)BOOST For each SNP-pair P

1

Calculate Contingency Table of P

slide-39
SLIDE 39

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Methodology

Filtering Stage in (G)BOOST (and IV)

Pseudocode of (G)BOOST For each SNP-pair P

1

Calculate Contingency Table of P

2

v = KSA_Value(P)

slide-40
SLIDE 40

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Methodology

Filtering Stage in (G)BOOST (and IV)

Pseudocode of (G)BOOST For each SNP-pair P

1

Calculate Contingency Table of P

2

v = KSA_Value(P)

3

If v > T

1

v = LogLinear_Value(P)

slide-41
SLIDE 41

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Methodology

Filtering Stage in (G)BOOST (and IV)

Pseudocode of (G)BOOST For each SNP-pair P

1

Calculate Contingency Table of P

2

v = KSA_Value(P)

3

If v > T

1

v = LogLinear_Value(P)

2

If v > T include P in the output list as pair with epistasis

slide-42
SLIDE 42

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Methodology

Filtering Stage in EpistSearch (I)

KSA’s Superposition Approximation (KSASA) DKL(E, O) =

ij πij1 log

πij1

πij0

  • E count of expected (control) studies

O count of observed (case) studies DKL is discrete Kullback-Leibler divergence

slide-43
SLIDE 43

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Methodology

Filtering Stage in EpistSearch (I)

KSA’s Superposition Approximation (KSASA) DKL(E, O) =

ij πij1 log

πij1

πij0

  • E count of expected (control) studies

O count of observed (case) studies DKL is discrete Kullback-Leibler divergence Upper bound: ˆ LS − ˆ LH ≤ ˆ LS − ˆ LKSA ≤ N ∗ DKL(E, O)

slide-44
SLIDE 44

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Methodology

Filtering Stage in EpistSearch (I)

KSA’s Superposition Approximation (KSASA) DKL(E, O) =

ij πij1 log

πij1

πij0

  • E count of expected (control) studies

O count of observed (case) studies DKL is discrete Kullback-Leibler divergence Upper bound: ˆ LS − ˆ LH ≤ ˆ LS − ˆ LKSA ≤ N ∗ DKL(E, O) Calculation of N ∗ DKL(E, O) even faster

slide-45
SLIDE 45

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Methodology

Filtering Stage in EpistSearch (and II)

Pseudocode of EpistSearch For each SNP-pair P

slide-46
SLIDE 46

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Methodology

Filtering Stage in EpistSearch (and II)

Pseudocode of EpistSearch For each SNP-pair P

1

Calculate Contingency Table of P

slide-47
SLIDE 47

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Methodology

Filtering Stage in EpistSearch (and II)

Pseudocode of EpistSearch For each SNP-pair P

1

Calculate Contingency Table of P

2

v = KSASA_Value(P)

slide-48
SLIDE 48

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Methodology

Filtering Stage in EpistSearch (and II)

Pseudocode of EpistSearch For each SNP-pair P

1

Calculate Contingency Table of P

2

v = KSASA_Value(P)

3

If v > T

1

v = KSA_Value(P)

2

If v > T

1

v = LogLinear_Value(P)

2

If v > T include P in the output list as pair with epistasis

slide-49
SLIDE 49

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Implementation

1

Introduction

2

Methodology

3

Implementation

4

Experimental Evaluation

5

Conclusion

slide-50
SLIDE 50

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Implementation

Parallel Implementations

Each CPU/GPU core performs the whole calculation of different SNP-pairs

Calculation of the contingency table Filtering

slide-51
SLIDE 51

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Implementation

Parallel Implementations

Each CPU/GPU core performs the whole calculation of different SNP-pairs

Calculation of the contingency table Filtering CPU multicore: PThreads GPU: CUDA CPU&GPU: CUDA&PThreads

GPU computes much more SNP-pairs than CPUs

slide-52
SLIDE 52

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Implementation

CUDA Implementation (I)

CUDA kernel Genotyping information loaded in device memory through pinned copies In each kernel many SNP-pairs are analyzed Each thread performs the whole calculation of independent SNP-pairs

slide-53
SLIDE 53

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Implementation

CUDA Implementation (I)

CUDA kernel Genotyping information loaded in device memory through pinned copies In each kernel many SNP-pairs are analyzed Each thread performs the whole calculation of independent SNP-pairs Only one kernel for all the computation

slide-54
SLIDE 54

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Implementation

CUDA Implementation (I)

CUDA kernel Genotyping information loaded in device memory through pinned copies In each kernel many SNP-pairs are analyzed Each thread performs the whole calculation of independent SNP-pairs Only one kernel for all the computation

Thread divergence: only few threads need to compute the KSA and Log-Linear filters

slide-55
SLIDE 55

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Implementation

CUDA Implementation (I)

CUDA kernel Genotyping information loaded in device memory through pinned copies In each kernel many SNP-pairs are analyzed Each thread performs the whole calculation of independent SNP-pairs Only one kernel for all the computation

Thread divergence: only few threads need to compute the KSA and Log-Linear filters GBOOST solve it performing the Log-Linear filter on the CPUs

Contingency tables must be copied to host memory Less performance

slide-56
SLIDE 56

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Implementation

CUDA Implementation (II)

Format of the Genotyping Information 4 strings of binary values per SNP

2 for controls and 2 for cases 1 bit per individual Represents whether the individual has the corresponding genotype ({0,2})

slide-57
SLIDE 57

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Implementation

CUDA Implementation (II)

Format of the Genotyping Information 4 strings of binary values per SNP

2 for controls and 2 for cases 1 bit per individual Represents whether the individual has the corresponding genotype ({0,2})

For each string, information of 32 individuals packed in 32-bit arrays of length m/32

slide-58
SLIDE 58

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Implementation

CUDA Implementation (and III)

slide-59
SLIDE 59

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Implementation

CUDA Implementation (and III)

Increasing Coalescence Consecutive threads usually access to consecutive pairs

Stride of m/32 Bad coalescence

slide-60
SLIDE 60

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Implementation

CUDA Implementation (and III)

Increasing Coalescence Consecutive threads usually access to consecutive pairs

Stride of m/32 Bad coalescence

Entries of the arrays reordered when loading into device memory

slide-61
SLIDE 61

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Experimental Evaluation

1

Introduction

2

Methodology

3

Implementation

4

Experimental Evaluation

5

Conclusion

slide-62
SLIDE 62

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Experimental Evaluation

System Characteristics

Hex-core Intel Core i7 Sandy Bridge 3.20GHz 2 different NVIDIA GPUs (Kepler architecture):

Name Number of cores Core frequency Memory size GTX 650Ti 768 980MHz 2GB GTX Titan 2688 875.5MHz 6GB

slide-63
SLIDE 63

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Experimental Evaluation

Experiments for CPU

Table : Percentage of pairs that pass the KSASA and log-linear filters in the CPU experiments.

  • Num. Inds. →

800 1600 3200

  • Num. SNPs →

10K 40K 10K 40K 10K 40K KSASA 18.84 15.95 12.17 8.88 25.46 14.27 log-linear 11 × 10−4 6 × 10−4 27 × 10−4 8 × 10−4 170 × 10−4 19 × 10−4

0.5 1 1.5 2 2.5 3 3.5 800 1,600 3,200 Execution Time (min) Number of Individuals 10K SNPs

(2.20) (11.25) (2.03) (11.05) (1.75) (9.59)

BOOST EpistSearch-1Th EpistSearch-6Th 10 20 30 40 50 60 800 1,600 3,200 Execution Time (min) Number of Individuals 40K SNPs

(2.29) (10.92) (2.07) (10.90) (1.79) (10.07)

BOOST EpistSearch-1Th EpistSearch-6Th

slide-64
SLIDE 64

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Experimental Evaluation

Experiments for GPU (I)

Table : Percentage of pairs that pass the KSASA and log-linear filters in the GPU experiments.

  • Num. Inds. →

6400 12800 25600

  • Num. SNPs →

40K 160K 40K 160K 40K 160K KSASA 20.27 6.13 35.49 7.03 52.02 9.35 log-linear 110 × 10−4 6 × 10−4 800 × 10−4 7 × 10−4 4000 × 10−4 12 × 10−4

1 2 3 4 5 6 7 8 6,400 12,800 25,600 Execution Time (min) Number of Individuals 40K SNPs

(1.42) (1.54) (1.83) (1.96) (2.94) (3.12)

GBOOST EpistSearch EpistSearch-6Th 10 20 30 40 50 60 70 6,400 12,800 25,600 Execution Time (min) Number of Individuals 160K SNPs

(1.54) (1.65) (1.84) (1.95) (2.07) (2.20)

GBOOST EpistSearch EpistSearch-6Th

Figure : Execution times on the GTX 650 Ti GPU.

slide-65
SLIDE 65

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Experimental Evaluation

Experiments for GPU (II)

Table : Percentage of pairs that pass the KSASA and log-linear filters in the GPU experiments.

  • Num. Inds. →

6400 12800 25600

  • Num. SNPs →

40K 160K 40K 160K 40K 160K KSASA 20.27 6.13 35.49 7.03 52.02 9.35 log-linear 110 × 10−4 6 × 10−4 800 × 10−4 7 × 10−4 4000 × 10−4 12 × 10−4

0.5 1 1.5 2 2.5 3 3.5 4 6,400 12,800 25,600 Execution Time (min) Number of Individuals 40K SNPs

(1.48) (1.48) (2.09) (2.13) (5.27) (5.34)

GBOOST EpistSearch EpistSearch-6Th 5 10 15 20 25 6,400 12,800 25,600 Execution Time (min) Number of Individuals 160K SNPs

(1.60) (1.62) (1.82) (1.83) (1.95) (1.96)

GBOOST EpistSearch EpistSearch-6Th

Figure : Execution times on the GTX Titan GPU.

slide-66
SLIDE 66

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Experimental Evaluation

Experiments for GPU (and III)

Dataset with real information from the Wellcome Trust Case Control Consortium (WTCCC)

500,568 SNPs 2,005 cases with bipolar disorder 3,004 controls

Tool Architecture Time Speed (106 tests per second) EpistSearch GTX Titan + 6 Intel Core i7 42 m 49.81 EpistSearch GTX Titan 43 m 49.04 GBOOST GTX Titan 1 h 01 m 34.23 EpistSearch GTX 650Ti + 6 Intel Core i7 1 h 48 m 19.29 EpistSearch GTX 650Ti 1 h 57 m 17.81 GBOOST GTX 650Ti 2 h 41 m 12.97 GBOOST* GTX 285 2 h 43 m 12.81 EpiGPU* GTX 580 2 h 55 m 11.90 SHEsisEPI* GTX 285 27 h 1.29

slide-67
SLIDE 67

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Conclusion

1

Introduction

2

Methodology

3

Implementation

4

Experimental Evaluation

5

Conclusion

slide-68
SLIDE 68

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Conclusion

Summary

Development of EpistSearch Tool to search for epistasis between SNP-pairs in a fast manner taking advantage of CPU and GPU parallelism

Based on regression model

EpistSearch improves (G)BOOST

Faster calculation of the contingency tables Novel faster KSASA filter Multithreaded CPU version Log-linear filter also calculated on the GPU Memory accesses more coalesced Collaboration among CPU and GPU cores

Able to reach very high speedups over (G)BOOST

11.3 on CPU (with 6 cores) 5.3 on a GTX Titan GPU

Future work: multiGPU version

slide-69
SLIDE 69

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Conclusion

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS

Jorge González-Domínguez*, Bertil Schmidt*, Jan C. Kässens**, Lars Wienbrandt**

*Parallel and Distributed Architectures Group, Johannes Gutenberg University of Mainz, Germany {j.gonzalez,bertil.schmidt}@uni-mainz.de **Department of Computer Science, Christian-Albrechts-University of Kiel, Germany {jka,lwi}@informatik.uni-kiel.de

20th International Euro-Par Conference Euro-Par 2014