g-INFO portal Doan Trung Tung Aurlien BERNARD, Ana Lucia DA-COSTA, - - PowerPoint PPT Presentation

g info portal
SMART_READER_LITE
LIVE PREVIEW

g-INFO portal Doan Trung Tung Aurlien BERNARD, Ana Lucia DA-COSTA, - - PowerPoint PPT Presentation

g-INFO portal Doan Trung Tung Aurlien BERNARD, Ana Lucia DA-COSTA, Vincent BLOCH, Thanh-Hoa LE, Yannick LEGRE, Lydia MAIGNE, Jean SALZEMANN, Hong-Quang NGUYEN, Vincent BRETON 1 Outline Introduction Overview of g-INFO


slide-1
SLIDE 1

1

g-INFO portal

Doan Trung Tung

Aurélien BERNARD, Ana Lucia DA-COSTA, Vincent BLOCH, Thanh-Hoa LE, Yannick LEGRE, Lydia MAIGNE, Jean SALZEMANN, Hong-Quang NGUYEN, Vincent BRETON

slide-2
SLIDE 2

2

Outline

 Introduction  Overview of g-INFO  Implementation of g-INFO  Conclusions and perspectives

slide-3
SLIDE 3

3

Why g-INFO?

 H5N1 (avian flu)

262 deaths 436 cases

WHO - July 2009

287 deaths 486 cases

WHO – March 2010

slide-4
SLIDE 4

4

Influenza surveillance

 Data collection

  • BioHealthBase
  • NCBI
  • LosAlamos

 Data processing in batch mode

  • General phylogenetic

pipelines

  • Specific phylogenetic

pipelines  Deployment of phylogenetic tools on clusters / grids g-INFO: Grid-based International Network for Flu Oservation

slide-5
SLIDE 5

5

g-INFO’s overview Global Surveillance Network

slide-6
SLIDE 6

g-INFO’s goals

6

 Integration of influenza virus data sources into a federation of databases  Automatic phylogenetic pipelines  Specific molecular epidemiology studies

slide-7
SLIDE 7

7

Architecture of g-INFO system

  • Each data provider has

its own server(s) to store his data

  • Data provider export
  • nly selected data to a

data grid interface server

  • The data exported is

integrated in a common schema on the interface servers

  • Providers can keep the

privilege of granting access rights to their data

slide-8
SLIDE 8

8

Architecture of g-INFO system

  • Epidemiologic pipelines

will be deployed on the grid

 BLAST  Alignment  Phylogenetic trees  Visualisation  ... and more

slide-9
SLIDE 9

9

g-INFO’s implementation Phylogenetic workflow

slide-10
SLIDE 10

10

Data collection

>ABV25634 MKAILLVLLCAFAATNADTLCIGYHANNSTDTVDTVLEKNVTVTHSVNLLEDSHNGKLCRLGGIAPLQLG KCNIAGWLLGNPECDLLLTVSSWSYIVETSNSDNGTCYPGDFIDYEELREQLSSVSSFEKFEIFPKTSSW PNHETTRGVTAACPYAGASSFYRNLLWLVKKENSYPKLSKSYVNNKGKEVLVLWGVHHPPTSTDQQSLYQ NADAYVSVGSSKYDRRFTPEIAARPKVRGQAGRMNYYWTLLEPGDTITFEATGNLVAPRYAFALNRGSES GIITSDAPVHDCDTKCQTPHGAINSSLPFQNIHPVTIGECPKYVKSTKLRMVTGLRNIPSIQSRGLFGAI AGFIEGGWTGLIDGWYGYHHQNGQGSGYAADQKSTQNAIDGITNKVNSVIEKMNTQFTVVGKEFNNLERR IKNLNKKVDDGFLDVWTYNAELLVLLENERTLDFHDSNVKNLYEKARSQLRNNAKEIGNGCFEFYHKCDD ACMESVRNGTYDYPKYSEESKLNREEIDGVKLESMMVYQILAIYSTVASSLVLLVSLGAISFWMCSNGSL QCRICI

Grid DB

Daily updates

FTP NCBI

Metadata Sequences

Protein, Nucleotide, Coding region

IDs

slide-11
SLIDE 11

11

WISDOM Production Environment

slide-12
SLIDE 12

Integration of g-INFO into WPE

12 Data collection Data service Amga MUSCLE PhyML Gblocks BLAST Task Manager Job Manager WISDOM Information System Amga Job Submitter g-INFO Data Manager

slide-13
SLIDE 13

Automatic phylogenetic pipeline

13 g-INFO database g-INFO pipeline Task Manager Job Job Manager Job Job Wisdom IS g-INFO portal

slide-14
SLIDE 14

14

Automatic phylogenetic pipeline

> Run daily a phylogenetic workflow on the grid

AMGA

Metadata Sequences

Protein, Nucleotide, Coding region

IDs

Grid portal

Prepare Data in correct format

Alignment + Curation (Muscle + Gblocks)

NCBI

Phylogenetic Analysis (PhyML) Visualisation tool

slide-15
SLIDE 15

Manual phylogenetic workflow

15 g-INFO database MOTEUR web services Task Manager Job Job Manager Job Job Wisdom IS g-INFO portal MOTEUR desktop tool

slide-16
SLIDE 16

Workflow execution example

16

slide-17
SLIDE 17

17

g-INFO portal Phylogenetic workflow

slide-18
SLIDE 18

g-INFO portal

18

 Collaboration among IFI, IOIT and HPC:  IFI: web services to interact between the portal and the system  IOIT: design and develop the portal  HPC: visualization tool  Technologies:  JSF 2.0, Ajax, web services, Java aplet, …

slide-19
SLIDE 19

g-INFO portal

19 g-INFO database MOTEUR web services Task Manager Job Job Manager Job Job Wisdom IS g-INFO portal Intermediate web services JSF 2.0 Web services Ajax JDBC

slide-20
SLIDE 20

20

g-INFO portal – home page

slide-21
SLIDE 21

21

g-INFO portal - search

slide-22
SLIDE 22

22

g-INFO portal – search results

slide-23
SLIDE 23

23

g-INFO portal – search results

slide-24
SLIDE 24

24

g-INFO portal – working sessions

slide-25
SLIDE 25

25

g-INFO portal – define working session template

slide-26
SLIDE 26

26

g-INFO portal – define working session template

slide-27
SLIDE 27

27

g-INFO portal – define working session template

slide-28
SLIDE 28

28

g-INFO portal – define working session template

slide-29
SLIDE 29

29

g-INFO portal – run working session

slide-30
SLIDE 30

30

g-INFO portal – run working session

Pipeline01 WorkingSession Pipeline02 Pipeline03 Pipeline04 PipelineN Input01 Input02 Input03 inputN result01 result02 result03 result04 resultN

slide-31
SLIDE 31

31

g-INFO portal – visualization

slide-32
SLIDE 32

32

Conclusions

 A success in terms of international collaboration  An example of developping grid application in Vietnam A complementary service for the public health research community

slide-33
SLIDE 33

33

Perspectives

 Provide more tools and pipelines  Import other database resources  Improve system’s performance  We are expecting the research community to contribute with more useful tools  Can be applied for other emerging diseases

slide-34
SLIDE 34

Thank you!

34