how is python used in biomolecular sciences
play

How is python used in biomolecular sciences? Antonia Mey - PowerPoint PPT Presentation

How is python used in biomolecular sciences? Antonia Mey antonia.mey@ed.ac.uk @ppxasjsm L. Hedges C. Woods Europython 2018 Edinburgh 25/07/2018 1 What is computational biosomething? 42 x10 -10 m X-ray


  1. How is python used in biomolecular sciences? Antonia Mey antonia.mey@ed.ac.uk @ppxasjsm L. Hedges C. Woods Europython 2018 — Edinburgh 25/07/2018 1

  2. What is computational biosomething? 42 x10 -10 m X-ray structure of a protein: set of x,y,z spatial coordinates of atoms � 2

  3. Typical questions tackled with MD How well do small molecules inhibit a protein’s many potential … functionality? compounds How do proteins fold? How do proteins interact with each other? � 3

  4. Timescales in biology Timescales of processes NMR Molecular Dynamics Other experimental techniques: e.g. cryo EM � 4

  5. Generate timeseries to compare to experiments F113 200 ns of protein dynamics time trace of an angle � 5

  6. What is MD? box + integrator + + h A i ensemble = h A i time Physics engine usually written in C++, to integrate Newtons equation of motion using leap frog type algorithms. � 6

  7. Typical MD workflow Download pdb pdb api pdb2gmx Commercial Cresset suite Schrödinger suite OpenMM tools Prep for simulation Amber tools RDkit […] GUI, TCL, bash, python, Perl … get creative Simulation setup OpenMM Python API Gromacs NAMD DLPoly mostly C++ Run simulation protocol commandline Amber SOMD Charmm […] Time-series analysis MDAnalysis md-analysis-tools PyEMMA Python MDAnalysis MDTraj MMTools […] TCL, bash, python, Perl … get creative � 7

  8. Typical file formats coordinates .pdb .crd .gro .xyz .mol2 trajectories .dcd .xtc .trr forcefields .psf .parm7 .itp And of course all MD programs can read all these file formats? — No � 8

  9. Scenario: Simulate .gro file with Amber I have a coordinate .gro (Gromacs) file I want to simulate with Amber, how do I do this? visualise coordinates convert to pdb format use a setup tool run simulation � 9

  10. Scenario: Simulate .gro file with Amber I have a coordinate .gro (Gromacs) file I want to simulate with Amber, how do I do this? ppxasjsm::azuma { ~/Documents/ } -> vmd dAla.gro visualise coordinates convert to pdb format use a setup tool run simulation or, use one of the many other tools… � 10

  11. Scenario: Simulate .gro file with Amber I have a coordinate .gro (Gromacs) file I want to simulate with Amber, how do I do this? visualise coordinates convert to pdb format use a setup tool or run simulation or …. � 11

  12. Scenario: Simulate .gro file with Amber I have a coordinate .gro (Gromacs) file I want to simulate with Amber, how do I do this? ppxasjsm::azuma { ~/Documents/ } -> FESetup setup.in visualise coordinates convert to pdb format use a setup tool run simulation taken from AMBER tutorial pretty much all the arrows are bash scripts with input files � 12

  13. Scenario: Simulate .gro file with Amber I have a coordinate .gro (Gromacs) file I want to simulate with Amber, how do I do this? ppxasjsm::azuma { ~/Documents/ } -> FESetup setup.in [globals] visualise coordinates forcefield = amber, ff99SBildn, tip3p [protein] basedir = protein file.name = protein.pdb molecules = ala convert to pdb format box.type = rectangular box.length = 10.0 align_axes = True neutralize = yes use a setup tool min.nsteps = 1000 min.restr_force = 10.0 min.restraint = notsolvent md.heat.nsteps = 1000 run simulation md.heat.restr_force = 10.0 md.heat.restraint = notsolvent md.constT.nsteps = 1000 md.constT.restr_force = 10.0 md.constT.restraint = notsolvent md.press.T = 298.0 md.press.nsteps = 50000 md.press.p = 1.0 � 13

  14. Scenario: Simulate .gro file with Amber I have a coordinate .gro (Gromacs) file I want to simulate with Amber, how do I do this? visualise coordinates convert to pdb format use a setup tool run simulation � 14

  15. Scenario: Simulate .gro file with Amber I have a coordinate .gro (Gromacs) file I want to simulate with Amber, how do I do this? visualise coordinates And another command line tool… convert to pdb format pmemd -O -i ala.in -p ala.parm7 use a setup tool -c ala.rst7 -o myout.mdout -x myout.nc -e myout.mdene -r run simulation myout.rst7 � 15

  16. Zoo of applications leads to hacky workflows - Most tools have organically grown from academic software with poor software practices - In order for tools to work with each other in complicated workflows a lot of hacky bash scripting is used by academic users - Users need to be experts in many different software with command line interfaces to interlink them - Many tools can do similar things and there may not be obvious solutions for one problem (google trap: try suggestions until one works) -> Loss of focus on the science � 16

  17. The same example as before BioSimSpace � 17

  18. Cloud server demo — using BioSimSpace docker run chryswoods/biosimspace http://130.61.69.221/hub/tmplogin http://130.61.69.221/ � 18

  19. BioSimSpace phase I Commercially Download pdb available software some manual prep Prep for simulation Manual BioSimSpace Simulation setup ccpbiosim.org FESetup L. Hedges siremol.org Sire/SOMD Run simulation protocol Sire/analyse_freenrg based on pymbar Time-series analysis freenrgworkflows C. Woods � 19

  20. API overview Core: Sire — Molecular library in C++ Trajectory Gateway IO Process Protocol MD � 20

  21. What is BioSimSpace — summary System Setup Protein Software Protein + ligand BioSimSpace pdb2gmx ligand in water tleap Trajectory generation ligand in organic FESetup solvent … MD Software … Interoperable tool for MC biomolecular Amber Enhanced simulations Gromacs sampling OpenMM … Simulation Analysis HTMD Plumed MSM Software … Perturbation pyemma Map gmx wham Reweighting pymbar … https://github.com/michellab/BioSimSpace … � 21

  22. Conclusions - Python API to write workflow components for Biomolecular simulations - Allows to focus on science and not software: No need to BioSimSpace become an expert at different MD packages, setup or analysis tools - Ease of use in the cloud, with scalable computing resources and future academic fool proof pricing model - Planned support for Knime and CWL workflow managers - A workflow engine BioSimSpace - A top down approach by trying to reinvent the wheel again - A ‘finished’ piece of software: It is very much in an alpha development stage with a large list of features and capabilities to be added in the future � 22

  23. Questions � 23

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend