Personal Genomes Project as a potential EGI community Next - - PowerPoint PPT Presentation

▶

Oct 19, 2023 113 likes •269 views

Personal Genomes Project as a potential EGI community Next Generation Federated HPC infrastructure to drive international genome discovery Peter Walgemoed Carelliance & Dutch Health Hub http://DHH-IPC.nl peterwalgemoed@gmail.com

SLIDE 1

Personal Genomes Project as a potential EGI community

Next Generation Federated HPC infrastructure to drive international genome discovery

Peter Walgemoed Carelliance & Dutch Health Hub

http://DHH-IPC.nl peterwalgemoed@gmail.com

Presented by Ad Emmen Dutch Health Hub & Contrail 1

SLIDE 2

Community: Personal Genomes Project

http://www.personalgenomes.org

SLIDE 3

ARVADOS

Open source platform for managing and analyzing biomedical big data Usage catching on in genome community http://arvados.org

SLIDE 4

ARVADOS

Open source platform for managing and analyzing biomedical big data Usage catching on in genome community http://arvados.org

SLIDE 5

Challenges

1.Store and organize 100’s of TB’s of large files with multiple

meta-data schema

2.Run informatics analyses that do distributed computations on

very large datasets

3.Do real-time high-performance queries on compact genome

data (e.g. variants)

4.Ensure validity and maintain provenance on all data in the

system over time

5.Make it easy to reproduce pipelines exactly as they were done

in the past

6.Protect all data with flexible access control rules and strong

encryption

7.Share large data sets between data centers and organizations

without physically moving data

SLIDE 6

“Keep” Data Storage & Management Distributed Computation “Crunch” Provenance

Real-Time Analysis

f Genomic Data

“Lightning”

Analysis Governance Sharing

Application Framework (APIs and SDKs)

Cancer Diagnostics Variant Visualization etc.

Cloud Operating System

Private Cloud Public Cloud

Arvados technology

Arvados Apps Cloud

Arvados Cloud

SLIDE 7

“Keep” Data Storage & Management Distributed Computation “Crunch” Provenance

Real-Time Analysis

f Genomic Data

“Lightning”

Analysis Governance Sharing

Application Framework (APIs and SDKs)

Cancer Diagnostics Variant Visualization etc.

Cloud Operating System

Private Cloud Public Cloud

Arvados technology

Arvados Apps Cloud

Arvados Cloud

Investigate Integration with EGI federated Cloud

EGI Fed Cloud

SLIDE 8

Trusted Digital Repositories

E-Discovery Data & Information Services

UMC Data Breastcancer Collection Hospital Data Breastcancer Collection LRCB Data Breastcancer Collection General Practitioner Data BC Breastcancer Collection BVN Data LRCB Data CBS Data BC Breastcancer Collection NBCA Data Breastcancer Collection HEBON Data Personal Health Record Data BC

Dutch Breastcancer Data Collection Catalogue

IKNL Data BC 6

SLIDE 9

Service Marketplaces Dutch Health Hub start

Information Data/app IT-infra Infrastructure-as-a-Service (IaaS) marketplace

SLIDE 10

Some Contrail tools for Dutch Health Hub Market place

HealthHub Data-as-a-service layer Market Place Data Service Interface Market Place user interface

Portal (RESTful) APIs

Market Place services (DHH Platform)

Gateway Services for specific uses Data enrichement servicies IaaS services

(computing, storage, networking, ...) 8

SLIDE 11

Some Contrail tools for Dutch Health Hub Market place

HealthHub Data-as-a-service layer Market Place Data Service Interface Market Place user interface

Portal (RESTful) APIs

Market Place services (DHH Platform)

Gateway Services for specific uses Data enrichement servicies IaaS services

(computing, storage, networking, ...) Provider Federation Federated identity management XtreemFS Cloud file system ConPaaS Web XtreemFS XtreemFS ConPaaS BoT,NoSQL XtreemFS SLA ConPaaS SQL, Hadoop 8

SLIDE 12

Next steps

1.Organise interest in community in Europe 2.Implement test environment as part of Dutch Health Hub 3.Investigate integration with EGI Federated Cloud

SLIDE 13 10

SLIDE 14 10

End