e-In Infrastructure for th the Multi- Sc Scale Complex Genomics - - PowerPoint PPT Presentation

e in infrastructure for th the multi sc scale complex
SMART_READER_LITE
LIVE PREVIEW

e-In Infrastructure for th the Multi- Sc Scale Complex Genomics - - PowerPoint PPT Presentation

e-In Infrastructure for th the Multi- Sc Scale Complex Genomics VRE Jose Josep Ll Ll. . Gelp Gelp BSC BSC - UB UB https://vre.multiscalegenomics.eu DI4R Brussels 30 Nov This project has received funding from the European Unions


slide-1
SLIDE 1

This project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 676556.

e-In Infrastructure for th the Multi- Sc Scale Complex Genomics VRE

Jose Josep Ll Ll. . Gelp Gelpí BSC BSC - UB UB

https://vre.multiscalegenomics.eu DI4R Brussels 30 Nov

slide-2
SLIDE 2

Why MuG: The two Genomics worlds

2

Structural Biology Biochemistry Biophysics,… Output  structures  ensembles  Chem. mechanisms Molecular Biology Cell Biology,… Output  Sequences  Images  Functions

slide-3
SLIDE 3

Why MuG: The dream

3

slide-4
SLIDE 4

The Problems

MuG VRE

Tools Visualization Data

4

Interoperability Fast-evolving->immature Usability Lack of standards No FAIR data Formats & Types Disconnected Undigested Unfriendly

Methods developers HPC facilities Biologists

slide-5
SLIDE 5

Users need to forget about the infrastructure …

State-of-the-art Tools and workflows Friendly environment. Known interfaces. Only scientific decisions needed Integrated workspace. Data and tools in a single portal Hidden (and scalable) infrastructure Stable and Sustainable ecosystem

5

slide-6
SLIDE 6

Design Guidelines

Flexible and easy to deploy research platform Software scheduler(s) to manage infrastructure and tools Multi-scale execution (Cluster to HPC) Web & Programmatic access, easy user access and support Procedure to integrate analysis, simlation, and visualization tools Compatible with European e-infrastructures

6

First release of MuG VRE 15th November

slide-7
SLIDE 7

User perspective: Authentication

7

slide-8
SLIDE 8

User perspective: the Workspace

8

File system layout Rich set of Data types and Formats Intuitive Toolkits for Data mngt. Analysis Visualization

slide-9
SLIDE 9

Prot DNA Complexes Binding sites

User perspective: Tools

9

CG NA Simulation Genome indexes ChipSeq Analysis HiC analysis Nucleosome positioning

slide-10
SLIDE 10

User perspective: Visualizing data

10

slide-11
SLIDE 11

MuG VRE Backend

11

Web Access Users Web Services Galaxy interface Resource Management COMPSs Stand- alone apps. Programming Model Execution Service SGE

Apps

HPC Tools Repository User Workspace Data Access API User Workspace Data Access API User support Public Repos metadata

slide-12
SLIDE 12

Tool execution life cycle (metadata driven)

12

slide-13
SLIDE 13

Tool execution life cycle

13

Execution scheduler Python wrappers to enclose tools

slide-14
SLIDE 14

Data management (aim)

14

Tool’s execution

?

Public Repos

Single virtual data space

slide-15
SLIDE 15

Linking to e-infrastructures

15 15

EBI Repos MuG @ BSC/IRB Shared storage MuG Repos User interface

slide-16
SLIDE 16

The MuG’s team

16

https://youtu.be/DRtf7b6M9WY Laia Codo (BSC) Genis Bayarri (IRB) Marco Pasi (UNot) Mark McDowall (EBI)

slide-17
SLIDE 17

Register now!!

17

slide-18
SLIDE 18

This project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 676556.

@MuG_genomics irbinfo.mug@irbbarcelona.org www.multiscalegenomics.eu

Thank you!