SLIDE 1
Nature of ChIP-Seq data
Short tag sequences of about 30 bp. Sequence correspond to 5’ and 3’ ends of ChIP-fragments DNA fragments have characteristic length resulting from sonication nuclease treatment Tags have some error rates (quality scores available) Not all tags can be uniquely mapped to the genome because of Recent repetitive elements SNPs and other types of genetic variations Tandem repeats (satellite DNA)
What can we do with ChIP-seq data:
- 1. Viewing the data in a genome browser environment:
Interesting to virtually every biologist Methods, upload of BED or WIG files
- 2. Statistical analysis of count distribution over the genome:
Example: correlation plot
- 3. Data reduction and interpretation
Partitioning: finding segments of signal-rich and signal-poor region. Peak recognition: Finding peaks of predefined size.
- 4. Higher-level (downstream) analysis: