EuroBioc 2019 – Brussels
Dario Righelli – PhD Istituto per le Applicazioni del Calcolo «M. Picone» – CNR - Napoli d.righelli@na.iac.cnr.it || dario.righelli@gmail.com
Epigenomic enrichment analysis using Bioconductor
drighelli
Epigenomic enrichment analysis using Bioconductor EuroBioc 2019 - - PowerPoint PPT Presentation
Epigenomic enrichment analysis using Bioconductor EuroBioc 2019 Brussels Dario Righelli PhD Istituto per le Applicazioni del Calcolo M. Picone CNR - Napoli d.righelli@na.iac.cnr.it || dario.righelli@gmail.com drighelli Whats
Dario Righelli – PhD Istituto per le Applicazioni del Calcolo «M. Picone» – CNR - Napoli d.righelli@na.iac.cnr.it || dario.righelli@gmail.com
drighelli
Compare methods and provide guidelines on epigenomic data analysis
Yijing Su et al. 2017 - Nature Neuroscience - Neuronal activity modifies the chromatin accessibility landscape in the adult brain
Before Fear Induction Condition (E0) After Fear Induction Condition (E1)
4 biological replicates 4 biological replicates } Catching differences in open chromatine regions
Home Cage Controls - Histon 3, Lysine 9 Acetilation (H3K9ac)
9 biological replicates} How many random differences are we able to catch inside a control dataset?
epigenomics data
ChIP-seq data coverages
plotCorrelation tool
coverages of the same samples on BWA and Bowtie2 bams have value of 1.
Peak Callers DESCan2 MACS2 CSAW Narrow Broad Peak Consensus & Matrices DESCan2 DiffBind CSAW Differential Enrichment edgeR
process
normalization
same results
normalization is required for this kind of data
ATAC-seq dataset
16652 11982 7956 7505 6597 6530 4491 976 976 654 599 523 514 344 282
5000 10000 15000
Intersection Size
DiffBindBroad CSAW DiffBindNarrow DEScan_Z10_K4_DARs 20000 40000
Set Size
ATAC−seq DARs
not-overlapping regions
regions by CSAW and DiffBind suggests a possible high-level of false positive regions detected.
H3K9ac ChIP-seq dataset
dataset of ChIP-seq H3K9ac samples
two groups
samples (126)
Differential Enriched Peaks on the random conditions.
found
2 4 6 8 D E S c a n 2 _ N
m D E S c a n 2 _ N
m D E S 2 _ M 2 _ B r
_ N
m D E S 2 _ M 2 _ B r
_ N
m D E S 2 _ M 2 _ N a r r _ N
m D E S 2 _ M 2 _ N a r r _ N
m D i f f B i n d _ N a r r _ N
m D i f f B i n d _ B r
d _ N
m
method nElem normalized
NO YES
On-going and future works
http://lists.moo.gs/mailman/listinfo/biocmeetup.naples napoli.r.bioc@gmail.com https://www.facebook.com/pg/NapoliRBiocMeetup
turn-over of attendees
Detection
Bowtie2 vs BWA
ATAC-seq dataset
73600 60794 24236 16120 13994 5854 4376 2879 2424 2122 1913 1857 1457 1046722 703 466 410 351 350 327 241 137 100 93 89 9 5 5 3 2 20000 40000 60000 80000
Intersection Size
DEScanMACSNarrow DEScanMACSBroad DiffBindMACSBroad DEScanZ10K4 DiffBindMACSNarrow 25000 50000 75000 100000 125000
Set Size
ATAC−seq Regions Nar/Broa & DEScan2
counts matrices show that there is no big differences between duplicates and no-duplicates samples
noDup_E0_1 noDup_E0_2 noDup_E0_3 noDup_E0_4 noDup_E1_1 noDup_E1_2 noDup_E1_3 noDup_E1_4 withDup_E0_1 withDup_E0_2 withDup_E0_3 withDup_E0_4 withDup_E1_1 withDup_E1_2 withDup_E1_3 withDup_E1_4
DEScan2
−1 −0.8 −0.6 −0.4 −0.2 0.2 0.4 0.6 0.8 1noDup_E0_1 noDup_E0_2 noDup_E0_3 noDup_E0_4 noDup_E1_1 noDup_E1_2 noDup_E1_3 noDup_E1_4 withDup_E0_1 withDup_E0_2 withDup_E0_3 withDup_E0_4 withDup_E1_1 withDup_E1_2 withDup_E1_3 withDup_E1_4
DiffBind
DiffBind
Dup_DEScan2 noDup_DEScan2 Dup_DiffBind noDup_DiffBind Final Peaks with/without Duplicates 10000 20000 30000 40000user-defined threshold
samples
in number of final peaks detected