GNU Guix
Reproducible genomics analysis pipelines with
使用 可重复性的 基因组学 分析管道 提供
- R. Wurmus, B. Uyar, B. Osberg, V. Franke,
- A. Gosdschan, K. Wreczycka, J. Ronen, A. Akalin
https://doi.org/10.1093/gigascience/giy123
GNU Guix R. Wurmus , B. Uyar, B. Osberg, - - PowerPoint PPT Presentation
Reproducible genomics analysis pipelines with GNU Guix R. Wurmus , B. Uyar, B. Osberg, V. Franke, https://doi.org/10.1093/gigascience/giy123 A. Gosdschan, K. Wreczycka, J. Ronen, A. Akalin
使用 可重复性的 基因组学 分析管道 提供
https://doi.org/10.1093/gigascience/giy123
a b 笔记本
a = 10ml b = 30ml Supplier: ACME Temp: 22 deg C
How hard could this possibly be?
to the rescue?
lack transparency
strawberry? whale oil?
Automate genomics analyses
RNAseq
U C G G A C A C C C G U A A AChIPseq single cell BSseq
1
Improve read quality Trim-Galore Align reads Bowtie2 Call peaks MACS2 ChIP QC & reproducibility ChIPQC + IDR Peak annotation genomation Compute read coverage R Scripts Check sequencing quality FastQC Pan-sample quality check MultiQC
Simple user interface
Settings Sample sheet
interactive reports browser tracks alignments QC reports sample clustering
2
Easy to install reproducibly
guix package
3
Reproducible package manager Full environment declarations Builds software in isolation
source / binary transparency
higher order source description lower-level binary application bundles
90%
Status
not reproducible minor problems reproducible
all pipelines PiGx BSseq PiGx ChIPseq PiGx RNAseq PiGx scRNAseq
~98%
Constrain software variables Containers are not transparent (smoothies) Guix builds software reproducibly and transparently PiGx shows that Guix makes reproducibility easy PiGx brings analysis to non-bioinformaticians
2 3 4 1 5
http://bioinformatics.mdc-berlin.de/pigx/ https://hpc.guixsd.org https://gnu.org/s/guix
#guix on irc.freenode.net
ricardo.wurmus@mdc-berlin.de
Improve read quality Trim-Galore Align reads STAR Quantify expression STAR / Salmon Analyze differential expression DESeq2 Find enriched GO terms g:ProfileR Compute read coverage Bedtools Check sequencing quality FastQC Pan-sample quality check MultiQC
Improve read quality Trim-Galore Align reads Bismark Call methylation methylkit Differential methylation methylkit Annotate DMRs and segments genomation Check sequencing quality FastQC Pan-sample quality check MultiQC Methylation segmentation methylkit
Improve read quality Trim-Galore Align reads STAR Determine cell number Dropbead Dropout rate and QC Scater Dimension reduction tSNE + PCA Compute read coverage Bedtools Check sequencing quality FastQC Pan-sample quality check MultiQC
headers sources build tools libraries ...
headers sources build tools libraries ...
cabba9e-samtools-1.7/ bin samtools lib ...