Hex Modeling Protein Docking Using Polar Fourier Correlations Dave - PowerPoint PPT Presentation

Hex – Modeling Protein Docking Using Polar Fourier Correlations Dave Ritchie Team Orpailleur Inria Nancy – Grand Est

Outline Basic Principles of Docking Fast Fourier Transform (FFT) Docking Methods Hex Polar Fourier Correlation Method Explained The CAPRI Experiment Demo: Using Hex on Linux Practical: CAPRI Target 40 – API-A/Trypsin 2 / 29

Biological Importance of Protein-Protein Interactions Protein interactions (PPIs) are central to many biological systems Humans have about 30,000 proteins, each having about 5 PPIs Understanding PPIs could lead to immense scientific advances Protein-protein interactions as therapeutic drug targets Small “drug” molecules often inhibit or interfere with PPIs 3 / 29

Protein Docking – A Molecular Recognition Problem A six-dimensional puzzle – do these proteins fit together? 4 / 29

Protein Docking – A Molecular Recognition Problem A six-dimensional puzzle – do these proteins fit together? Yes, they fit! 4 / 29

Protein Docking – A Molecular Recognition Problem A six-dimensional puzzle – do these proteins fit together? Yes, they fit! It is mostly a rotational problem: ONE translation plus FIVE rotations... 4 / 29

Protein Docking – A Molecular Recognition Problem A six-dimensional puzzle – do these proteins fit together? Yes, they fit! It is mostly a rotational problem: ONE translation plus FIVE rotations... But proteins are flexible = > multi-dimensional space! 4 / 29

Protein Docking – A Molecular Recognition Problem A six-dimensional puzzle – do these proteins fit together? Yes, they fit! It is mostly a rotational problem: ONE translation plus FIVE rotations... But proteins are flexible = > multi-dimensional space! So, how to calculate whether two proteins recognise each other? 4 / 29

ICM Docking – Multi-Start Pseudo-Brownian Search Stick pins in protein surfaces at 15˚ A intervals For each pair of pins, find minimum energy (6 rotations for each): E = E HVW + E CVW + 2 . 16 E el + 2 . 53 E hb + 4 . 35 E hp + 0 . 20 E solv Often gives good results, but is computationally expensive Fern´ andez-Recio, Abagyan (2004), J Mol Biol, 335, 843–865 5 / 29

Protein Docking Using Fast Fourier Transforms Conventional approaches digitise proteins into 3D Cartesian grids... ...and use FFTs to calculated TRANSLATIONAL correlations: � C [∆ x , ∆ y , ∆ z ] = A [ x , y , z ] × B [ x + ∆ x , y + ∆ y , z + ∆ z ] x , y , z BUT for docking, have to repeat for many rotations – expensive! Conventional grid-based FFT docking = SEVERAL CPU-HOURS Katchalski-Katzir et al. (1992) PNAS, 89 2195–2199 6 / 29

Protein Docking Using Polar Fourier Correlations Rigid docking can be considered as a largely ROTATIONAL problem This means we should use ANGULAR coordinate systems With FIVE rotations, we should get a good speed-up? 7 / 29

Some Theory – 2D Spherical Harmonic Surfaces Spherical harmonics (SHs) are classical “special functions” z r=(r, θ,φ) θ r y φ x SHs are products of Legendre polynomials and circular functions: Real SHs: y lm ( θ, φ ) = P lm ( θ ) cos m φ + P lm ( θ ) sin m φ Y lm ( θ, φ ) = P lm ( θ ) e im φ Complex SHs: � � Orthogonal: y lm y kj d Ω = Y lm Y kj d Ω = δ lk δ mj j R ( l ) y lm ( θ ′ , φ ′ ) = � Rotation: jm ( α, β, γ ) y lj ( θ, φ ) 8 / 29

Spherical Harmonic Molecular Surfaces Use spherical harmonics (SHs) as orthogonal shape “building blocks” Reals SHs y lm ( θ, φ ) , and coeffcients a lm Encode distance from origin as SH series: L l � � r ( θ, φ ) = a lm y lm ( θ, φ ) l = 0 m = − l Calculate coefficients by numerical integration Good for shape-matching, not so good for docking... Ritchie and Kemp (1999), J. Comp. Chem. 20, 383–395 9 / 29

Docking Needs 3D Polar Fourier Representation Special orthonormal Laguerre-Gaussian radial functions, R nl ( r ) R nl ( r ) = N ( q ) nl e − ρ/ 2 ρ l / 2 L ( l + 1 / 2 ) ρ = r 2 / q , n − l − 1 ( ρ ); q = 20 . � � 1 ; r ∈ surface skin 1 ; r ∈ protein atom σ ( r ) = τ ( r ) = 0 ; otherwise 0 ; otherwise n − 1 N l � � � a σ Polar Fourier polynomial: σ ( r ) = nlm R nl ( r ) y lm ( θ, φ ) n = 1 l = 0 m = − l N T ( | m | ) a σ ′ � nl , n ′ l ′ ( R ) a σ Analytic translations: nlm = (1) n ′ l ′ m n ′ l ′ 10 / 29

SPF Protein Shape-Density Reconstruction N � a τ Interior density: τ ( r ) = nlm R nl ( r ) y lm ( θ, φ ) nlm Image Order Coeffs A Gaussians - B N = 16 1,496 C N = 25 5,525 D N = 30 9,455 Ritchie (2003), Proteins Struct. Funct. Bionf. 52, 98–106 11 / 29

Protein Docking Using SPF Density Functions � Favourable: ( σ A ( r A ) τ B ( r B ) + τ A ( r A ) σ B ( r B )) d V � Unfavourable: τ A ( r A ) τ B ( r B ) d V � Score: S AB = ( σ A τ B + τ A σ B − Q τ A τ B ) d V , Penalty Factor: Q = 11 � � � �� Orthogonality: S AB = a σ nlm b τ nlm + a τ b σ nlm − Qb τ nlm nlm nlm Search: 6D space = 1 distance + 5 Euler rotations: ( R , β A , γ A , α B , β B , γ B ) Ritchie and Kemp (2000), Proteins Struct. Funct. Bionf. 39, 178–194 12 / 29

Hex SPF Correlation Example – 3D Rotational FFTs Set up 3D rotational FFT as a series of matrix multiplications: t = − l R ( l ) ′ nlm = � l Rotate: mt ( 0 , β A , γ A ) a lt a kj T ( | m | ) nlm = � N ′′ ′ Translate: a nl , kj ( R ) a kjm ′′ nlt U ( l ) t b nlt U ( l ) Real to complex: A nlm = � t a tm , B nlm = � tm nl A ∗ nlm B nlv Λ um Multiply: C muv = � lv muv C muv e − i ( m α B + 2 u β B + v γ B ) 3D FFT: S ( α B , β B , γ B ) = � On one CPU, docking takes from 15 to 30 minute... 13 / 29

Exploiting Proir Knowledge in SPF Docking Knowing just one key residue can reduce search space enormously... This accelerates calculation and helps to reduce false-positives... 14 / 29

Docking Very Large Molecules Using Multi-Sampling Example: docking an antibody to the VP2 viral surface protein 15 / 29

The CAPRI Experiment CAPRI = “Critical Assessment of PRedicted Interactions” Predictor Software Algorithm T1 T2 T3 T4 T5 T6 T7 Abagyan ICM FF ** *** ** Camacho CHARMM FF * *** *** Eisenstein MolFit FFT * * *** Sternberg FTDOCK FFT * ** * Ten Eyck DOT FFT * * ** Gray MC ** *** Ritchie Hex SPF ** *** Weng ZDOCK FFT ** ** Wolfson BUDDA/PPD GH * *** Bates Guided Docking FF - - - *** Palma BIGGER GF - - ** * Gardiner GAPDOCK GA * * - - - - - Olson Surfdock SH * - - - - Valencia ANN * - - - - - - Vakser GRAMM FFT * - - - - ∗ low, ∗∗ medium, ∗ ∗ ∗ high accuracy prediction; − no prediction Mendez et al. (2003) Proteins Struct. Funct. Bionf. 52, 51–67 16 / 29

Hex Protein Docking Example – CAPRI Target 3 Example: best prediction for CAPRI Target 3 – Hemagglutinin/HC63 Ritchie and Kemp (2000), Proteins Struct. Funct. Bionf. 39, 178–194 Ritchie (2003), Proteins Struct. Funct. Genet. 52, 98–106 17 / 29

Best Hex Orientation for Target 6 – Amylase/AMD9 CAPRI “high accuracy” (Ligand RMSD ≤ 1˚ A) 18 / 29

Subsequent CAPRI Targets 8 – 19 Target Description Comments T8 Nidogen- γ 3 - Laminin U/U build from monomer – 12˚ T9 LiCT homodimer A RMS deviation build from monomer – 11˚ T10 TBEV trimer A RMS deviation T11 Cohesin - dockerin U/U; model-build dockerin T12 Cohesin - dockerin U/B SAG1 conformational change: 10˚ T13 SAG1 - antibody Fab A RMS T14 MYPT1 - PP1 δ U/U; model-build PP1 α → PP1 δ T18 TAXI - xylanase U/B T19 Ovine prion - antibody Fab model-build prion T15-T17 cancelled: solutions were on-line & found by Google !! T11, T14, T19 involved homology model-building step... 19 / 29

CAPRI Results: Targets 8–19 (2003 – 2005) Software T8 T9 T10 T11 T12 T13 T14 T18 T19 ICM ** * ** *** * *** ** ** PatchDock ** * * * * - ** ** * ZDOCK/RDOCK ** * *** *** *** ** ** FTDOCK * * ** * ** ** * RosettaDock - ** *** ** *** *** SmoothDock ** *** *** ** ** * RosettaDock *** - - ** *** ** Haddock - - ** ** *** *** ClusPro ** *** * * 3D-DOCK ** * * ** * MolFit *** * *** ** Hex ** *** * * Zhou - - - *** ** * * DOT *** *** ** ATTRACT ** - - - - *** ** Valencia * * * - - GRAMM - - - - - ** ** Umeyama ** * Kaznessis - - *** Fano - - * Mendez et al. (2005) Proteins Struct. Funct. Bionf. 60, 150-169 20 / 29

“Hex” and “HexServer” Hex: interactive docking ( ∼ 33,000 downloads) – http://hex.loria.fr/ Hexserver ( ∼ 1,000 docking jobs/month) – http://hexserver.loria.fr/ Ritchie and Kemp (2000), Proteins 39 178–194 ... Macindoe et al. (2010), Nucleic acids Research, 38, W445–W449 21 / 29

Hex Modeling Protein Docking Using Polar Fourier Correlations Dave - PowerPoint PPT Presentation

Hex Modeling Protein Docking Using Polar Fourier Correlations Dave Ritchie Team Orpailleur Inria Nancy Grand Est Outline Basic Principles of Docking Fast Fourier Transform (FFT) Docking Methods Hex Polar Fourier Correlation Method

Hex Modeling Protein Docking Using Polar Fourier Correlations The CAPRI Experiment Demo:

Protein Docking and 3D Ligand-Based Virtual Screening Schedule Lecture 1 Rigid Body

Fourier Series and Transform Overview Why Fourier transform? Trigonometric functions Who is

Chapter 4 Chapter 4 The Fourier Series and The Fourier Series and Fourier Transform Fourier

Chapter 4 Chapter 4 The Fourier Series and The Fourier Series and Fourier Transform Fourier

Research on polar bears at Norwegian Polar Tracking the Polar Bear Institute Dr. Jon Aars,

Integrative modeling of ! General aspects of docking ! Information-driven docking with HADDOCK

Protein Docking and 3D Ligand-Based Virtual Screening Modeling Protein Flexibility Using Elastic

Fourier Series Fourier Sine Series Fourier Cosine Series Fourier Series Convergence

Topic 5: Discrete-Time Fourier Transform (DTFT) o DT Fourier Transform o Overview of Fourier

Protein Sequence Analysis Protein Sequence Analysis Protein sequence motifs Protein sequence

Protein Docking Amit P. Singh Biochemistry 218/MIS 231 November 30, 1998 Why is Docking

Polar Tankers - ConocoPhillips Five Endeavour Class Tankers Built in 2001-2006 30,000

Polar Decomposition of a Matrix Garrett Buffington May 4, 2014 The Polar Decomposition SVD and

Topic 4: Continuous-Time Fourier Transform (CTFT) o Introduction to Fourier Transform o Fourier

KBDOCK A Case-Based Reasoning Approach for Protein Docking Dave Ritchie Team Orpailleur

TOTAL PANCREATECTOMY AND ISLET AUTOTRANSPLANTATION Disclosure AS TREATMENT FOR CHRONIC

Minor International PCL Minor International PCL 2Q06 Analyst Meeting Michael Sagild, COO Four

Webinar Overview ! ! Partnership between the Pennsylvania Department of Education and

Plutus: scalable secure file sharing on untrusted storage Mahesh Kallahalla HP Labs Joint work

PTAS for Euclidean Traveling Salesman and Other Geometric Problems Sanjeev Arora Journal of the

Traveling Salesman Problem Math 482, Lecture 35 Misha Lavrov May 1, 2020 The problem Subtour

21. Set cover and TSP Set covering Cutting problems and column generation Traveling

New Inapproximability Bounds for TSP Marek Karpinski, Michael Lampis and Richard Schmied ISAAC