Efficient all-against-all protein similarity matrix computation - PowerPoint PPT Presentation

Efficient all-against-all protein similarity matrix computation using OpenCL Genome-oriented bioinformatics lab - WS2013/2014 Uli Köhler & Anton Smirnov LMU & TUM Helmholtz-Zentrum München Supervisor: Mathias Walter February 24th, 2014

Introduction SIMAP SIMAP I Similarity Matrix of Proteins: p 1 p 2 p 3 Database of protein ... − 5 p 1 similarities ... ... − p 2 Compares all-against-all ... p 3 170 − Currently ~73 million protein sequences → 5 . 3 · 10 15 alignments BOINC-SIMAP: distributed computing Köhler U, Smirnov A (LMU, TUM) Efficient S/W using OpenCL February 24th, 2014 3 / 21

Introduction SIMAP SIMAP II Currently uses FASTA algorithm (fast, but suboptimal heuristics) For high-scoring hits, Smith-Waterman is currently in use Smith-Waterman provides better accuracy Requires efficient, parallelized implementation Köhler U, Smirnov A (LMU, TUM) Efficient S/W using OpenCL February 24th, 2014 4 / 21

Introduction Hardware Computational hardware CPU: ~1-12 cores, available anywhere GPU: 1000+ cores, good availability FPGA ( field programmable gate array ) Configurable number of cores Difficult to use Expensive Köhler U, Smirnov A (LMU, TUM) Efficient S/W using OpenCL February 24th, 2014 5 / 21

Parallelization and OpenCL OpenCL OpenCL Programming framework for parallel computing Top level abstraction for low level routines Runs on CPUs, GPUs & FPGAs without modification Driver optimizes code for specific devices Köhler U, Smirnov A (LMU, TUM) Efficient S/W using OpenCL February 24th, 2014 6 / 21

Parallelization and OpenCL Smith-Waterman Smith-Waterman parallelization Intra-task Inter-task Köhler U, Smirnov A (LMU, TUM) Efficient S/W using OpenCL February 24th, 2014 7 / 21

Parallelization and OpenCL Padding & sizeclasses Sequence length optimization Maximal efficiency of Smith-Waterman implementation: For many optimizations, we need sequences with equal length Equal length can boost performance by multiple magnitudes Pad sequence with ε Alignment score must not change → Substitution score: −∞ Problem: Padding increases matrix size → Large overhead Köhler U, Smirnov A (LMU, TUM) Efficient S/W using OpenCL February 24th, 2014 8 / 21

Parallelization and OpenCL Padding & sizeclasses Sizeclasses Solution: Extension sizeclasses / Adaptive binning A K L ε ε Divide sequence length ... ... ... A 0 0 into different classes ... ... ... C 0 0 Pad only within one ... ... ... M 0 0 sizeclass ... ... ... M 0 0 Multiple sizeclasses ... ... ... reduce overall padding L 0 0 Köhler U, Smirnov A (LMU, TUM) Efficient S/W using OpenCL February 24th, 2014 9 / 21

Parallelization and OpenCL Padding & sizeclasses SIMAP sequence length distribution 90000 Absolute frequency 60000 30000 0 0 500 1000 1500 2000 Sequence length [AA] Köhler U, Smirnov A (LMU, TUM) Efficient S/W using OpenCL February 24th, 2014 10 / 21

Results and benchmarks CLSW Implementation details CLSW: OpenCL Smith-Waterman Objective: Develop proof-of-concept score-only OpenCL Smith-Waterman Use inter-task parallelization All-against-all with affine gap costs Can be used to build vendor-independent fast Smith-Waterman implementation Köhler U, Smirnov A (LMU, TUM) Efficient S/W using OpenCL February 24th, 2014 11 / 21

Results and benchmarks Implementation Implementation aspects Written in pure C++11 & OpenCL 1.1 No external dependencies, compact binary Tested with SIMAP subset Verified using SeqAn library Köhler U, Smirnov A (LMU, TUM) Efficient S/W using OpenCL February 24th, 2014 12 / 21

Results and benchmarks Advantages Core advantages SWIPE : Integer ↔ CLSW : Floating point → Composition based score adjustment → Higher accuracy Concise codebase: < 1,000 C++ lines of code OpenCL Smith-Waterman: <50 lines of code (SWIPE: 10,000 lines of code) Existing implementations are based on CUDA → Only runs on NVidia GPUs Köhler U, Smirnov A (LMU, TUM) Efficient S/W using OpenCL February 24th, 2014 13 / 21

Results and benchmarks Outlook 1,000 x 1,000 sequences benchmark ; 1,000 AA (query) ; 1,000 AA (target) 250 200 Runtime [s] 150 100 50 0 ssearch36 swipe swipe−MT CLSW Program Köhler U, Smirnov A (LMU, TUM) Efficient S/W using OpenCL February 24th, 2014 14 / 21

Results and benchmarks Outlook 4000x1000 sequences benchmark, 20 AA (query), 1.000 AA (target) 60 Runtime [s] 40 20 0 ssearch36 swipe swipe−MT CLSW Program Köhler U, Smirnov A (LMU, TUM) Efficient S/W using OpenCL February 24th, 2014 15 / 21

Results and benchmarks Outlook Integration into SIMAP Since 2005, only CPU clients Since 2014, also ARM client for Android Users ask for GPU clients regularly since 2005 CLSW was built to be integratable into BOINC → Leverage huge amount of computing power Still, a lot of work needs to be done... Köhler U, Smirnov A (LMU, TUM) Efficient S/W using OpenCL February 24th, 2014 16 / 21

Results and benchmarks Outlook Other uses 3-4 times faster than SWIPE for short query sequences → Shotgun proteomics, NGS? Huge optimization potential → Reduce overhead, 5-10x speedup Platforms unsupported by SWIPE (e.g. 32 bit platforms) Köhler U, Smirnov A (LMU, TUM) Efficient S/W using OpenCL February 24th, 2014 17 / 21

Conclusion Conclusion CLSW: Portable, GPU-based Smith-Waterman Fast for small queries, can be optimized for large queries Floating point score calculation → Composition-based score adjustment GPU computing is underestimated in computational biology Köhler U, Smirnov A (LMU, TUM) Efficient S/W using OpenCL February 24th, 2014 18 / 21

Conclusion Acknowledgements Thank you for your attention! Special thanks to Mathias Walter & Thomas Rattei who made this project possible! Questions? Köhler U, Smirnov A (LMU, TUM) Efficient S/W using OpenCL February 24th, 2014 19 / 21

Advanced Topics Kernel sizeclasses 20 AA x 20 AA ; 4,000 x 4,000 alignment, with variable row buffer 20000 Runtime [s] 15000 10000 5000 0 500 1000 1500 2000 Buffer size Köhler U, Smirnov A (LMU, TUM) Efficient S/W using OpenCL February 24th, 2014 20 / 21

Advanced Topics Sizeclass mathematical background � sizeclass penalty ) + ( β · | sizeclass | ) Sizeclass: ( α · Difficult to determine optimal values for α and β Idea: Use population quantiles (e.g. q 0 . 01 % to q 100 % ) as sizeclass boundaries. Postprocessing: Divide sizeclasses with penalty > threshold Köhler U, Smirnov A (LMU, TUM) Efficient S/W using OpenCL February 24th, 2014 21 / 21

Efficient all-against-all protein similarity matrix computation - PowerPoint PPT Presentation

Efficient all-against-all protein similarity matrix computation using OpenCL Genome-oriented bioinformatics lab - WS2013/2014 Uli Khler & Anton Smirnov LMU & TUM Helmholtz-Zentrum Mnchen Supervisor: Mathias Walter February 24th,

Protein Sequence Analysis Protein Sequence Analysis Protein sequence motifs Protein sequence

Protein-Protein interactions Reducing the complexity Why are protein-protein interactions

Semantic Similarity MultiJEDI ERC 259234 Semantic Similarity Semantic Similarity Mostly

RECOGNITION OF RECOGNITION OF PROTEIN FUNCTION PROTEIN FUNCTION USING THE LOCAL SIMILARITY

Animal protein production in a Animal protein production in a Animal protein production in a

DNA RNA Protein synthesis AMINO ACIDS PROTEIN Protein degradation FUNCTION Some properties

CSE182-L7 CSE182-L7 Protein structure Basics Protein structure Basics Protein sequencing via MS

Dynamics of Protein-Protein Interactions: A Probabilistic Model Toward Protein Function Amir

I/O-EFFICIENT SIMILARITY JOIN R. Pagh, N. Pham, F. Silvestri, M. Stckel Similarity Join R = Q

[3] The Matrix What is a matrix? Traditional answer Neo: What is the Matrix? Trinity: The answer

Matrix Multiplication Matrix Multiplication via Matrix-Vector Mult Defn. If matrix A is m n

Align, Disambiguate, and Walk A Unified Approach for Measuring Semantic Similarity Semantic

Time- -dependent Similarity Measure dependent Similarity Measure Time Time-dependent Similarity

Building an IoT Platform with Matrix matthew@matrix.org http://www.matrix.org What is Matrix?

Liberating Communication with Matrix matthew@matrix.org http://www.matrix.org What is Matrix?

blood, but against the rulers, against the authorities, against the powers of this dark world and

Life Sciences Identifiers. Life Sciences Identifiers. Finally? Finally? Presented by: Martin

NATIONAL MICROFINANCE BANK PLC NATIONAL MICROFINANCE BANK PLC INVESTOR PRESENTATION On the occasion

DEBT MANAGEMENT OFFICE NIGERIA OVERVIEW OF PROGRAMMATIC APPROACH FOR DEBT MANAGEMENT IN LICs: The

TRADE AND DEVELOPMENT BOARD Sixty-seventh executive session Geneva, 4 6 February 2019 NOTES

Offic e o f the Pre side nt Administra tive / Ope ra tio na l Suppo rt * Spe c ia l Pro je c ts

Strength and Con Conditioning for or boxing Pt1 Pt1 Santiago Nieva High Performance Director

GCSE Readiness Evening September 5 2019 Our vision and motto Vision statement: Valuing

IDENTITIES, DISCRIMINATION AND VIOLENCE IN AFRICA PROFESSOR SABELO J. NDLOVU-GATSHENI Department