Introduc)on to Chroma)n IP sequencing (ChIP-seq) data

Introduc)on ¡to ¡Chroma)n ¡IP ¡– ¡ sequencing ¡(ChIP-‑seq) ¡data ¡analysis ¡ Introduc)on ¡to ¡Bioinforma)cs ¡using ¡NGS ¡data ¡ ¡ Linköping, ¡21 ¡April ¡2016 ¡ ¡ Agata ¡Smialowska ¡ BILS ¡/ ¡NBIS, ¡SciLifeLab, ¡Stockholm ¡University ¡

Chroma)n ¡state ¡and ¡gene ¡expression ¡ PEV ¡ Posi)on ¡effect ¡ variega)on ¡ in ¡Drosophila ¡eye ¡ (nature.com) ¡ First ¡observed ¡by ¡ H. ¡Muller ¡ 1930 ¡ Juxtaposi)on ¡of ¡eye ¡colour ¡genes ¡with ¡heterochroma)n ¡results ¡in ¡the ¡“moWled” ¡eye ¡ coloura)on ¡(red ¡and ¡white). ¡ ¡ Proteins, ¡which ¡bind ¡heterochroma)n, ¡act ¡to ¡“spread” ¡the ¡silencing ¡signal ¡by ¡ providing ¡a ¡forward ¡feedback ¡loop. ¡ ¡ Heterochroma)n ¡Protein ¡1; ¡Histone ¡methyltransferase ¡Su(var)3-‑9; ¡H3K9 ¡ methyla)on ¡

Chroma)n ¡immunoprecipita)on ¡ RnDsystems ¡

Applica)ons ¡ General ¡transcrip)on ¡machinery ¡

Applica)ons ¡ Promoter-‑associated ¡ transcrip)on ¡factors ¡

Applica)ons ¡ Distal ¡enhancers ¡

Applica)ons ¡ Histone ¡modifica)ons ¡ and ¡variants ¡ Ac)va)on ¡states ¡ Co-‑factors ¡

Workflow ¡of ¡a ¡ChIP-‑seq ¡study ¡ design ¡study ¡ ¡ obtain ¡input ¡chroma)n ¡ ¡ perform ¡precipita)on ¡ ¡ construct ¡library ¡ ¡ sequence ¡library ¡ ¡ bioinforma1c ¡analysis ¡

ChIP-‑seq ¡workflow ¡ Liu, ¡PoW ¡and ¡Huss, ¡BMC ¡Biology ¡2010 ¡

Cri)cal ¡factors ¡ • An)body ¡selec)on ¡ • Library ¡cloning ¡and ¡sequencing ¡ • Algorithm ¡for ¡peak ¡detec)on ¡ • Proper ¡control ¡sample ¡(input ¡chroma)n ¡or ¡mock ¡IP) ¡ • Reproducibility ¡in ¡chroma)n ¡fragmenta)on ¡ • Cross-‑linker ¡choice ¡ • Enough ¡material ¡and ¡biological ¡replicates ¡

Experiment ¡design ¡ • Sound ¡experimental ¡design: ¡replica)on, ¡randomisa)on ¡and ¡ blocking ¡(R.A. ¡Fisher, ¡1935) ¡ • In ¡the ¡absence ¡of ¡a ¡proper ¡design, ¡it ¡is ¡essen)ally ¡impossible ¡ to ¡par))on ¡biological ¡varia)on ¡from ¡technical ¡varia)on ¡ • Sequencing ¡depth: ¡depends ¡on ¡the ¡structure ¡of ¡the ¡signal; ¡ cannot ¡be ¡linearly ¡scaled ¡to ¡genome ¡size ¡ • Single-‑ ¡vs. ¡paired-‑end ¡reads: ¡PE ¡improves ¡read ¡mapping ¡ confidence ¡and ¡gives ¡a ¡direct ¡measure ¡of ¡fragment ¡size, ¡which ¡ otherwise ¡has ¡to ¡be ¡modelled ¡or ¡es)mated ¡

Experiment ¡design ¡ Ideal ¡design: ¡ ¡ ChIP ¡ input ¡ library/sequencing ¡ replicates ¡ X ¡ Each ¡sample ¡has ¡a ¡matched ¡input ¡ Input ¡sequenced ¡to ¡a ¡comparable ¡depth ¡ ¡ as ¡IP ¡sample ¡ ¡ ChIP ¡ ≥2 ¡biological ¡replicates ¡for ¡site ¡iden)fica)on ¡ input ¡ replicates ¡ library/sequencing ¡ ≥3 ¡biological ¡replicates ¡for ¡differen)al ¡binding ¡ X ¡ ChIP ¡ under-‑sequenced ¡input ¡ ChIP ¡ input ¡ library/sequencing ¡ replicates ¡ ✓ ¡ ChIP ¡ well-‑sequenced ¡input ¡

Importance ¡of ¡biological ¡replicates ¡ libraries ¡ sequencing ¡ X ¡ sample ¡ technical ¡replicates ¡are ¡generally ¡a ¡waste ¡of ¡)me ¡ and ¡money ¡ ¡ samples ¡ replicates ¡ libraries ¡ sequencing ¡ X ¡ many ¡studies ¡do ¡not ¡account ¡for ¡batch ¡ origin ¡ effects ¡ experiment ¡ i. )me ¡ ii. origin ¡ so ¡if ¡you ¡care ¡about ¡reproducibility ¡ ✓ ¡ experiment1 ¡ experiment2 ¡ Experiment3… ¡ libraries, ¡sequencing, ¡etc ¡ )me ¡-‑-‑-‑-‑-‑-‑-‑> ¡

Importance ¡of ¡sequencing ¡depth ¡ actual ¡replicates ¡ pooled ¡data ¡ X ¡ ✓ if ¡you ¡need ¡to ¡pool ¡your ¡data, ¡then ¡it ¡is ¡under-‑sequenced ¡ under-‑sequenced ¡data ¡ pooled ¡data ¡

Sequencing ¡depth ¡depends ¡on ¡data ¡type ¡ Chroma)n ¡ ¡ Transcrip)on ¡ Chroma)n ¡ ¡ Remodellers ¡ Factors ¡ Remodellers ¡ ¡ Histone ¡marks ¡ ¡ Histone ¡marks ¡ ¡ RNA ¡polymerase ¡II ¡ point-‑source ¡ mixed ¡signal ¡ broad ¡signal ¡ TF: ¡20 ¡M ¡ ? ¡ Human: ¡ ? ¡ H3K4me3: ¡25 ¡M ¡ H3K27me3: ¡40 ¡M ¡ H3K36me3: ¡35 ¡M ¡ H3K9me3: ¡>55 ¡ ¡M ¡ No ¡clear ¡guidelines ¡for ¡mixed ¡and ¡broad ¡type ¡of ¡peaks ¡ Source: ¡The ¡ENCODE ¡consor)um; ¡ ¡Jung ¡et ¡al, ¡NAR ¡2014 ¡

The ¡ ENCODE ¡(Encyclopedia ¡of ¡DNA ¡Elements) ¡Consor)um ¡and ¡the ¡ Roadmap ¡Epigenomics ¡ Consor)um ¡are ¡a ¡vast ¡resource ¡of ¡various ¡ kinds ¡of ¡func)onal ¡genomics ¡data ¡(as ¡well ¡as ¡RNA-‑seq ¡data). ¡ ¡

• ChIP ¡– ¡sequencing: ¡introduc)on ¡from ¡a ¡ bioinforma)cs ¡point ¡of ¡view ¡ ¡ • Principles ¡of ¡analysis ¡of ¡ChIP-‑seq ¡data ¡ • ChIP-‑seq: ¡downstream ¡analyses ¡ • Resources ¡ • Exercise ¡overview ¡

Chroma)n ¡= ¡DNA ¡+ ¡proteins ¡ Park, ¡Nature ¡Rev ¡Gene)cs, ¡2009 ¡ ¡

Data ¡analysis ¡

Profile ¡of ¡protein ¡binding ¡sites ¡vs. ¡input ¡ Chromator ¡( Drosophila ) ¡– ¡protein ¡binding ¡ methylated ¡histones ¡ Park, ¡Nature ¡Rev ¡Gene)cs, ¡2009 ¡ ¡

design ¡study ¡ Workflow ¡of ¡a ¡ChIP-‑seq ¡study ¡ ¡ obtain ¡input ¡chroma)n ¡ ¡ perform ¡precipita)on ¡ ¡ construct ¡library ¡ ¡ sequence ¡library ¡ ¡ library ¡quality ¡control ¡ filter ¡sequences ¡ ¡ align ¡sequences ¡ ¡ filter ¡alignments ¡ Itera)ve ¡process ¡ ¡ iden1fy ¡peaks ¡/ ¡regions ¡of ¡enrichment ¡ ¡ assess ¡data ¡quality ¡ ¡ understand ¡the ¡data ¡/ ¡results ¡ ¡ downstream ¡analyses ¡

• ChIP ¡– ¡sequencing: ¡introduc)on ¡from ¡a ¡ bioinforma)cs ¡point ¡of ¡view ¡ ¡ • Principles ¡of ¡analysis ¡of ¡ChIP-‑seq ¡data ¡ • ChIP-‑seq: ¡downstream ¡analyses ¡ • Resources ¡ • Exercise ¡overview ¡

Two ¡ques)ons ¡to ¡address ¡ • 1. ¡Did ¡the ¡ChIP ¡part ¡of ¡the ¡ChIP-‑seq ¡ experiment ¡work? ¡Was ¡the ¡enrichment ¡ successful? ¡ • 2. ¡Where ¡are ¡the ¡binding ¡sites ¡(of ¡the ¡protein ¡ of ¡interest)? ¡

Word ¡of ¡cau)on! ¡ ChIP-‑seq ¡experiments ¡are ¡more ¡unpredictable ¡ than ¡RNA-‑seq! ¡ Error ¡sources: ¡ ¡chroma)n ¡structure ¡ ¡PCR ¡over-‑amplifica)on ¡ ¡non-‑specific ¡an)body ¡ ¡other ¡things? ¡

ChIP-‑seq ¡QC: ¡did ¡the ¡ChIP ¡work? ¡ • 1. ¡Inspect ¡the ¡signal ¡(mapped ¡reads, ¡coverage ¡ profiles) ¡in ¡genome ¡browser ¡ • 2. ¡Compute ¡peak-‑independent ¡quality ¡metrics ¡ (cross ¡correla)on, ¡cumula)ve ¡enrichment) ¡ • 3. ¡Assess ¡replicate ¡consistency ¡(correla)ons ¡ between ¡replicates ¡of ¡the ¡same ¡condi)on) ¡

tag ¡density ¡distribu)on ¡ reproducibility ¡ similarity ¡of ¡coverage ¡ signal ¡at ¡known ¡sites ¡ … ¡ Sposng ¡inconsistencies ¡ Confounding ¡factors ¡ Under-‑sequenced ¡libraries ¡ … ¡

How ¡do ¡I ¡know ¡my ¡data ¡is ¡of ¡good ¡quality? ¡ Library ¡complexity ¡ Marinov ¡et ¡al, ¡G3 ¡2013 ¡ ¡

Quality ¡control: ¡tag ¡uniqueness ¡– ¡library ¡complexity ¡ metric ¡ Sequence ¡duplica)on ¡level ¡> ¡80% ¡(low ¡complexity ¡library) ¡ FastQC ¡ Babraham ¡Ins)tute ¡ NRF: ¡Non-‑redundant ¡frac)on ¡(of ¡reads): ¡propor)on ¡of ¡unique ¡tags ¡/ ¡total ¡ ¡ less ¡than ¡20% ¡of ¡reads ¡should ¡be ¡duplicates ¡for ¡10 ¡million ¡reads ¡sequenced ¡(ENCODE) ¡

Introduc)on to Chroma)n IP sequencing (ChIP-seq) data - PowerPoint PPT Presentation

Introduc)on to Chroma)n IP sequencing (ChIP-seq) data analysis Introduc)on to Bioinforma)cs using NGS data Linkping, 21 April 2016 Agata

A Brief Introduction to Chroma S. Collins, University of Regensburg Outline What is chroma?

Methods for Analyzing ChIP-Seq data Introduction to the ChIP-Seq server at SIB Lausanne Public

Introduction to Chromatin IP sequencing (ChIP-seq) data analysis Workshop on ChIP-seq data

ChIP-seq data analysis 04-05-12 Outlook Friday 04-05-12: Next-generation sequencing

Importing data Peter Humburg Statistician, Macquarie University DataCamp ChIP-seq Workflows in

Jen Grenier Director, TREx Facility Announcements New and Improved Project Submission Form

Learning Chroma-n States from ChIP-seq data Luca Pinello

RNA-seq basics: From reads to differential expression COMBINE RNA-seq Workshop RNA sequencing

The Epigenome Tools 2: ChIP-Seq and Data Analysis Chongzhi Zang zang@virginia.edu

Scaling normalisation for ChIP-seq with exogenous chromatin Workshop on ChIP-seq data analysis

Introduction to RNA-Seq Mary Piper Bioinformatics Consultant and Trainer DataCamp RNA-Seq

Introduc)on to the Analysis of RNA-seq Data Lecture

Predicting Chroma from Luma using Frequency Domain Intra Prediction in Codecs Based on Lapped

CS4405 JPEG Transform Coding JPEG Compression Workflow RGB Optional Chroma Subsample

Introduction to differential binding Peter Humburg Statistician, Macquarie University DataCamp

Outline Health Care Reform and Environment Changing Culture for Success in a White-

Introduction to Coaching - Enhancing Mentorship with Coaching Skills Aims & Objectives

R E T U R N O R I E N T E D P R O G R A M E V O L U T I O N with R O P E R Olivia Lucca

Computing and Big Data M Goals, Basics, and Models Antonio Corradi Academic year 2017/2018

Getting Started with Afterschool Meals May 15, 2019 3:00 PM ET Housekeeping Listen-only mode

Equip yourself for career success Introduction to the Student Employability Program Clare

Back to School Night Sample Title Sample Subtitle Weyanoke Elementary School September 21, 2020

Aug. 16, 2012 Yale LANS Live Streaming is a Major Internet App Yale LANS Poor Performance After

Sambuz

Useful Links

Newsletter

Mail Us