genomicsandhealth.org
CanDIG
Distributed na0onal analyses of locally- controlled genomic data h:p://distributedgenomics.ca
1
CanDIG Distributed na0onal analyses of locally- controlled genomic - - PowerPoint PPT Presentation
CanDIG Distributed na0onal analyses of locally- controlled genomic data h:p://distributedgenomics.ca 1 genomicsandhealth.org Canadian Distributed Infrastructure for Genomics (CanDIG) New (start date: this spring) 4-year funded Canadian project
genomicsandhealth.org
1
genomicsandhealth.org
Canadian Distributed Infrastructure for Genomics (CanDIG)
2
New (start date: this spring) 4-year funded Canadian project to enable batch and interac=ve analysis over na=onal cohorts with provincially controlled private genomic data - send analyses to data.
genomicsandhealth.org
CanDIG:
3
Canadian Distributed Infrastructure for Genomics (CanDIG)
genomicsandhealth.org
PlaBorm Goals - Fully Distributed:
to data, source of user requests
apps available, project membership, etc.
data
4
Canadian Distributed Infrastructure for Genomics (CanDIG)
genomicsandhealth.org
PlaBorm Goals - API access:
logging, audibility; no processes dropped in directory of files.
data stores (files, variant data bases, etc)
communica=ng internally via htsget (Large- Scale Genomics)
Pheno Data Capture)
5
Variants
Workflows
Canadian Distributed Infrastructure for Genomics (CanDIG)
genomicsandhealth.org
PlaBorm Goals - AAI:
Connect
based on remote ID and distributed role informa=on
amongst services
interoperability with DURI
6
? ! ? !
Canadian Distributed Infrastructure for Genomics (CanDIG)
genomicsandhealth.org
Work so far - interac0ve analysis
thousand genomes figures across federated datasets - small regions for interac=vity
7
Canadian Distributed Infrastructure for Genomics (CanDIG)
genomicsandhealth.org
Work so far - interac0ve analysis
thousand genomes figures across federated datasets - small regions for interac=vity
8
Canadian Distributed Infrastructure for Genomics (CanDIG)
genomicsandhealth.org
Work so far - interac0ve analysis
thousand genomes figures across federated datasets - small regions for interac=vity
9
Canadian Distributed Infrastructure for Genomics (CanDIG)
genomicsandhealth.org
Work so far - interac0ve analysis
thousand genomes figures across federated datasets - small regions for interac=vity
10
Canadian Distributed Infrastructure for Genomics (CanDIG)
genomicsandhealth.org
Work so far - interac0ve analysis
performance
(e.g.) FORMAT fields
aggrega0on, filtering queries will be needed
11
Canadian Distributed Infrastructure for Genomics (CanDIG)
genomicsandhealth.org
Work so far - differen0al privacy
for introducing (e.g.) differen=al privacy
data they might not otherwise
differen=al privacy over R&V API:
privacy model?
have different privacy requirements?
12
Canadian Distributed Infrastructure for Genomics (CanDIG)
genomicsandhealth.org
Work so far - authen0ca0on
authen=ca=on for R&V server
13
Canadian Distributed Infrastructure for Genomics (CanDIG)
genomicsandhealth.org
Current work - PROFYLE
project
simple analyses (joint variant calling at locus)
need con=nued work
14
Canadian Distributed Infrastructure for Genomics (CanDIG)
genomicsandhealth.org
Current work - CaMPACT
explora=on
cBioPortal
15
Canadian Distributed Infrastructure for Genomics (CanDIG)
genomicsandhealth.org
Coming months
& Pheno capture)
16
Canadian Distributed Infrastructure for Genomics (CanDIG)
genomicsandhealth.org
Longer-term work
just mapped loca=on
data use/authoriza=on
access models
17
Canadian Distributed Infrastructure for Genomics (CanDIG)