ChIP-seq analysis Morgane Thomas-Chollier Computa)onal - PowerPoint PPT Presentation

ChIP-‑seq ¡analysis ¡ Morgane ¡Thomas-‑Chollier ¡ ¡ Computa)onal ¡systems ¡biology ¡-‑ ¡IBENS ¡ mthomas@biologie.ens.fr ¡ ¡ M2 ¡– ¡Computa6onal ¡analysis ¡of ¡cis-‑regulatory ¡sequences ¡2013/2014 ¡ Denis ¡Thieffry, ¡Jacques ¡van ¡Helden ¡and ¡Carl ¡Herrmann ¡kindly ¡shared ¡some ¡of ¡their ¡slides. ¡ ¡

The ¡ChIP-‑seq ¡era ¡ Pubmed hits per year for "ChiP-Seq" 300 250 200 150 100 50 0 2005 2006 2007 2008 2009 2010 2011 2012 2013

Aim ¡of ¡the ¡course ¡ 1 ¡-‑ ¡From ¡reads ¡to ¡peaks ¡(= ¡primary ¡analysis) ¡ ¡ ¡ ¡ ¡ ¡ ¡ 2 ¡-‑ ¡Secondary ¡analysis ¡ ¡-‑ ¡mo6f ¡discovery ¡in ¡peaks ¡ ¡-‑ ¡func6onal ¡annota6on ¡of ¡peaks ¡

in ¡vivo ¡experimental ¡methods ¡to ¡iden6fy ¡binding ¡sites ¡ ChIP ¡(=Chroma6n ¡Immuno-‑Precipita6on) ¡ differences ¡in ¡ methods ¡to ¡detect ¡ the ¡ bound ¡DNA ¡ ¡ ¡ ¡ -‑ small-‑scale: ¡PCR ¡/ ¡qPCR ¡ ¡ ¡ ¡ -‑ ¡large-‑scale: ¡ ¡ ¡ -‑ ¡microarray ¡= ¡ ChIP-‑on-‑chip ¡ -‑ ¡sequencing ¡= ¡ ChIP-‑seq ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ h9p://www.chip-‑an)bodies.com/ ¡ ¡

ChIP-‑seq ¡ aim: ¡ find ¡ all ¡ regions ¡bound ¡by ¡a ¡specific ¡transcripIon ¡factor ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡by ¡histones ¡bearing ¡a ¡specific ¡modificaIon ¡ ¡in ¡a ¡given ¡ experimental ¡condi)on ¡ (cell ¡type, ¡developmental ¡stage,...) ¡ ¡ Mardis. ¡Nat ¡Methods ¡(2007) ¡ and ¡then ¡what ¡???? ¡

ChIP-‑seq ¡ Experimental ¡approach ¡ BioinformaIc ¡approach ¡ and ¡then ¡what ¡???? ¡

Different ¡ChIP ¡profiles ¡ Park, ¡Nature ¡reviews ¡2009 ¡

Modelling ¡noise ¡levels ¡ ChIP-seq dataset (=treatment) = signal + How do we estimate the noise ? background noise

Modelling ¡noise ¡levels ¡ ● noise ¡is ¡ not ¡uniform ¡(chromaIn ¡conformaIon, ¡local ¡biases, ¡ mappability) ¡ ● input ¡dataset ¡is ¡ mandatory ¡for ¡reliable ¡local ¡esImaIon ¡! ¡ ¡ (although ¡some ¡algorithms ¡do ¡not ¡require ¡it ¡… ¡:-‑( ¡ ¡) ¡ treatment ? input

From ¡sequence ¡reads ¡to ¡peaks ¡ experiment ¡ ¡ ¡ ¡Input ¡ FASTQ ¡ FASTQ ¡ sequences ¡(reads ¡length ¡36 ¡bp) ¡ ¡ from ¡Illumina ¡

FASTQ ¡format ¡ @ SRR002012.1 Oct4:5:1:871:340 > SRR002012.1 Oct4:5:1:871:340 GGCGCACTTACACCCTACATCCATTG GGCGCACTTACACCCTACATCCATTG + > SRR002012.2 Oct4:5:1:804:348 IIIIG1?II;IIIII1IIII1%.I7I GTCTGCATTATCTACCAGCACTTCCC @ SRR002012.2 Oct4:5:1:804:348 > SRR002012.3 Oct4:5:1:767:334 GTCTGCATTATCTACCAGCACTTCCC GCTGTCTTCCCGCTGTTTTATCCCCC + > SRR002012.4 Oct4:5:1:805:329 IIIIIIIII'I2IIIII:)I2II3I0 GTAGTTTACCTGTTCATATGTTTCTG @ SRR002012.3 Oct4:5:1:767:334 GCTGTCTTCCCGCTGTTTTATCCCCC + III8IIIIIII3III6II%II*III3 @ SRR002012.4 Oct4:5:1:805:329 GTAGTTTACCTGTTCATATGTTTCTG + IIIIIII9IIIIII?IIIIIIII7II adapted ¡from ¡Wikipedia ¡ SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSS ..................................................... ..........................XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX...................... ...............................IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII...................... .................................JJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJ...................... !"#$%&'()*+,-./0123456789:;<=>?@ABCDEFGHI JKLMNOPQRSTUVWXYZ[\]^_`abcdefghijklmnopqrstuvwxyz{|}~ | | | | | | 33 59 64 73 104 126 0 40 S - Sanger Phred+33, raw reads typically (0, 40) X - Solexa Solexa+64, raw reads typically (-5, 40) I - Illumina 1.3+ Phred+64, raw reads typically (0, 40) J - Illumina 1.5+ Phred+64, raw reads typically (3, 40) with 0=unused, 1=unused, 2=Read Segment Quality Control Indicator (bold)

From ¡sequence ¡reads ¡to ¡peaks ¡ experiment ¡ ¡ ¡ ¡Input ¡ FASTQ ¡ FASTQ ¡ sequences ¡(reads ¡length ¡36 ¡bp) ¡ ¡ from ¡Illumina ¡ quality ¡check ¡ FASTQC ¡

h]p://www.bioinformaIcs.bbsrc.ac.uk/projects/fastqc/ ¡ ¡ h]p://bioinfo-‑core.org/index.php/9th_Discussion-‑28_October_2010 ¡ h]p://bioinfo.cipf.es/courses/mda11/lib/exe/fetch.php?media=ngs_qc_tutorial_mda_val_2011.pdf ¡

modEncode Kni Drosophila

From ¡sequence ¡reads ¡to ¡peaks ¡ experiment ¡ ¡ ¡ ¡Input ¡ FASTQ ¡ FASTQ ¡ sequences ¡(reads ¡length ¡30/34 ¡bp) ¡ ¡ from ¡Illumina ¡ quality ¡check ¡ FASTQC ¡ if ¡necessary ¡only ¡!!! ¡ remove ¡adapter ¡sequences ¡ cutadapt ¡ h]p://code.google.com/p/cutadapt/ ¡ quality ¡check ¡ FASTQC ¡ FASTQ ¡ FASTQ ¡

From ¡sequence ¡reads ¡to ¡peaks ¡ experiment ¡ ¡ ¡ ¡Input ¡ FASTQ ¡ FASTQ ¡ FASTQC ¡ if ¡necessary ¡only ¡!!! ¡ cutadapt ¡ BED ¡ BAM ¡ SAM ¡ FASTQC ¡ mapping ¡ BED ¡ BAM ¡ FASTQ ¡ FASTQ ¡ BowIe ¡ SAM ¡ Langmead, Genome Biol 10:R25 (2009)

Mapping ¡ h]p://bifx-‑core.bio.ed.ac.uk:8080/galaxy/u/shaun%20webb/p/ngs-‑workshop ¡ ¡BowIe ¡and ¡Colourspace ¡BowIe ¡ ¡BWA ¡ ¡LastZ ¡ ¡ ¡Tophat ¡… ¡

From ¡sequence ¡reads ¡to ¡peaks ¡ experiment ¡ ¡ ¡ ¡Input ¡ FASTQ ¡ FASTQ ¡ FASTQC ¡ if ¡necessary ¡only ¡!!! ¡ cutadapt ¡ BED ¡ BAM ¡ SAM ¡ FASTQC ¡ mapping ¡ quality ¡check ¡ BED ¡ BAM ¡ FASTQ ¡ FASTQ ¡ BowIe ¡ Samstat ¡ SAM ¡ Lassmann ¡et ¡al. ¡ Bioinforma)cs ¡ (2010) ¡ Langmead, Genome Biol 10:R25 (2009)

From ¡sequence ¡reads ¡to ¡peaks ¡ experiment ¡ ¡ ¡ ¡Input ¡ experiment ¡ ¡ ¡ ¡Input ¡ FASTQ ¡ FASTQ ¡ FASTQ ¡ FASTQ ¡ GR FASTQC ¡ FASTQC ¡ if ¡necessary ¡only ¡!!! ¡ Input cutadapt ¡ cutadapt ¡ FASTQC ¡ FASTQC ¡ mapping ¡ quality ¡check ¡ visualiza6on ¡ BED ¡ BAM ¡ FASTQ ¡ FASTQ ¡ BowIe ¡ Samstat ¡ SAM ¡ Lassmann ¡et ¡al. ¡ Bioinforma)cs ¡ (2010) ¡ Langmead, Genome Biol 10:R25 (2009)

mapping ¡ peak-‑calling ¡ Valouev ¡Nat ¡Methods ¡(2008), ¡Jothi, ¡NAR ¡(2008) ¡

From ¡sequence ¡reads ¡to ¡peaks ¡ experiment ¡ ¡ ¡ ¡Input ¡ experiment ¡ ¡ ¡ ¡Input ¡ FASTQ ¡ FASTQ ¡ FASTQ ¡ FASTQ ¡ GR FASTQC ¡ FASTQC ¡ if ¡necessary ¡only ¡!!! ¡ Input cutadapt ¡ cutadapt ¡ FASTQC ¡ FASTQC ¡ mapping ¡ quality ¡check ¡ visualiza6on ¡ BED ¡ BAM ¡ FASTQ ¡ FASTQ ¡ BowIe ¡ Samstat ¡ SAM ¡ Lassmann ¡et ¡al. ¡ Bioinforma)cs ¡ (2010) ¡ Langmead, Genome Biol 10:R25 (2009)

From ¡sequence ¡reads ¡to ¡peaks ¡ experiment ¡ ¡ ¡ ¡Input ¡ experiment ¡ ¡ ¡ ¡Input ¡ MACS ¡ ¡ treatment ¡vs ¡control ¡ FASTQ ¡ FASTQ ¡ FASTQ ¡ FASTQ ¡ peaks FASTQC ¡ FASTQC ¡ if ¡necessary ¡only ¡!!! ¡ Cut-‑off ¡FDR ¡(2%) ¡ peak ¡calling ¡ MACS ¡ cutadapt ¡ cutadapt ¡ Zhang, ¡ Genome ¡Biol ¡(2008) ¡ ¡ FASTQC ¡ FASTQC ¡ visualiza6on ¡ BED ¡ BED ¡ BAM ¡ BAM ¡ FASTQ ¡ FASTQ ¡ Samstat ¡ BowIe ¡ SAM ¡ SAM ¡

mapping ¡ peak-‑calling ¡ Valouev ¡Nat ¡Methods ¡(2008), ¡Jothi, ¡NAR ¡(2008) ¡

ChIP-seq analysis Morgane Thomas-Chollier Computa)onal - PowerPoint PPT Presentation

ChIP-seq analysis Morgane Thomas-Chollier Computa)onal systems biology - IBENS mthomas@biologie.ens.fr M2 Computa6onal analysis of cis-regulatory

Jen Grenier Director, TREx Facility Announcements New and Improved Project Submission Form

Importing data Peter Humburg Statistician, Macquarie University DataCamp ChIP-seq Workflows in

Methods for Analyzing ChIP-Seq data Introduction to the ChIP-Seq server at SIB Lausanne Public

Introduction to RNA-Seq Mary Piper Bioinformatics Consultant and Trainer DataCamp RNA-Seq

ChIP-seq data analysis 04-05-12 Outlook Friday 04-05-12: Next-generation sequencing

Introduction to Chromatin IP sequencing (ChIP-seq) data analysis Workshop on ChIP-seq data

The Epigenome Tools 2: ChIP-Seq and Data Analysis Chongzhi Zang zang@virginia.edu

Scaling normalisation for ChIP-seq with exogenous chromatin Workshop on ChIP-seq data analysis

Introduction to differential binding Peter Humburg Statistician, Macquarie University DataCamp

Re-analysis of a CD4 ChIP-Seq data set with csaw Ryan C. Thompson Salomon Lab The Scripps

RNA-seq Data Analysis Introduction to RNA-seq data analysis June, 2018 1 Luigi Grassi < lg

Calibration des Microroc (II) Alex, Cyril, Giom, Jean, Max 09 Mai 2011, Annecy 1 Reminder 2

Genome-wide supervised ChIP-seq peak detection Toby Dylan Hocking toby.hocking@mail.mcgill.ca

Introduction to ChIP-seq Joanna Krupka CRUK Summer School in Bioinformatics

Overview of the DE analysis Mary Piper Bioinformatics Consultant and Trainer DataCamp RNA-Seq

RNA-seq: filtering, quality control and visualisation COMBINE RNA-seq Workshop QC and

HTPMD High Throughput Parallel Molecular Dynamics Steve Cox RENCI Engagement Overview

human protein kinase CK2 Christian Nienberg 1, *, Anika Retterath 1 , Kira Sophie Becher 2 ,

Hands-on Exercises C H I P S T E R A N D F E D E R A T E D C L O U D Slides and Exercises m

Overview Overview Processors Interconnect Look at the 3 Japanese HPCs Examine the

Data Mining in Bioinformatics Day 6: Classification in Next Generation Sequencing Data Analysis

The microRNAs of Caenorhabditis elegans (Lim et al . Genes & Development 2003) Vertebrate

Interprtation abstraite de modles de voies de signalisation intracellulaire Jrme Feret

Introduction to Genomics Atul Butte, MD atul_butte@harvard.edu Childrens Hospital Informatics

Sambuz

Useful Links

Newsletter

Mail Us

ChIP-seq analysis Morgane Thomas-Chollier Computa)onal - PowerPoint PPT Presentation

ChIP-seq analysis Morgane Thomas-Chollier Computa)onal systems biology - IBENS mthomas@biologie.ens.fr M2 Computa6onal analysis of cis-regulatory

Jen Grenier Director, TREx Facility Announcements New and Improved Project Submission Form

Importing data Peter Humburg Statistician, Macquarie University DataCamp ChIP-seq Workflows in

Methods for Analyzing ChIP-Seq data Introduction to the ChIP-Seq server at SIB Lausanne Public

Introduction to RNA-Seq Mary Piper Bioinformatics Consultant and Trainer DataCamp RNA-Seq

ChIP-seq data analysis 04-05-12 Outlook Friday 04-05-12: Next-generation sequencing

Introduction to Chromatin IP sequencing (ChIP-seq) data analysis Workshop on ChIP-seq data

The Epigenome Tools 2: ChIP-Seq and Data Analysis Chongzhi Zang zang@virginia.edu

Scaling normalisation for ChIP-seq with exogenous chromatin Workshop on ChIP-seq data analysis

Introduction to differential binding Peter Humburg Statistician, Macquarie University DataCamp

Re-analysis of a CD4 ChIP-Seq data set with csaw Ryan C. Thompson Salomon Lab The Scripps

RNA-seq Data Analysis Introduction to RNA-seq data analysis June, 2018 1 Luigi Grassi &lt; lg

Calibration des Microroc (II) Alex, Cyril, Giom, Jean, Max 09 Mai 2011, Annecy 1 Reminder 2

Genome-wide supervised ChIP-seq peak detection Toby Dylan Hocking toby.hocking@mail.mcgill.ca

Introduction to ChIP-seq Joanna Krupka CRUK Summer School in Bioinformatics

Overview of the DE analysis Mary Piper Bioinformatics Consultant and Trainer DataCamp RNA-Seq

RNA-seq: filtering, quality control and visualisation COMBINE RNA-seq Workshop QC and

HTPMD High Throughput Parallel Molecular Dynamics Steve Cox RENCI Engagement Overview

human protein kinase CK2 Christian Nienberg 1, *, Anika Retterath 1 , Kira Sophie Becher 2 ,

Hands-on Exercises C H I P S T E R A N D F E D E R A T E D C L O U D Slides and Exercises m

Overview Overview Processors Interconnect Look at the 3 Japanese HPCs Examine the

Data Mining in Bioinformatics Day 6: Classification in Next Generation Sequencing Data Analysis

The microRNAs of Caenorhabditis elegans (Lim et al . Genes &amp; Development 2003) Vertebrate

Interprtation abstraite de modles de voies de signalisation intracellulaire Jrme Feret

Introduction to Genomics Atul Butte, MD atul_butte@harvard.edu Childrens Hospital Informatics

Sambuz

Useful Links

Newsletter

Mail Us

RNA-seq Data Analysis Introduction to RNA-seq data analysis June, 2018 1 Luigi Grassi < lg

The microRNAs of Caenorhabditis elegans (Lim et al . Genes & Development 2003) Vertebrate