Chromatin structure and 3C-like data Davide Ba & Franois Serra - - PowerPoint PPT Presentation

chromatin structure and 3c like data
SMART_READER_LITE
LIVE PREVIEW

Chromatin structure and 3C-like data Davide Ba & Franois Serra - - PowerPoint PPT Presentation

Chromatin structure and 3C-like data Davide Ba & Franois Serra Genome Biology Group (CNAG) Structural Genomics Group (CRG) The role of chromatin structure It can give insights into how distant genomic elements interacts with each


slide-1
SLIDE 1

Chromatin structure and 3C-like data

Davide Baù & François Serra

Genome Biology Group (CNAG) Structural Genomics Group (CRG)

slide-2
SLIDE 2

The role of chromatin structure

It can give insights into how distant genomic elements interacts with each other

Compact Loose

It helps to understand the compartmentalization

  • f chromosomes within the nucleus

It is essential to understand the mechanisms that regulate the cell

slide-3
SLIDE 3

Chromatin definition

Chromatin is composed of DNA complexed with histones and

  • ther proteins

Chromatin formation enables the genome to be hierarchically packaged or condensed so that it can fit inside the nuclear space The compaction allows to modulate gene transcription, DNA repair, recombination, and replication Chromatin structure is considered highly dynamic

slide-4
SLIDE 4

Resolution gap

μ 10 10 10 Resolution s Time 10 10 10 10 10 10 10 10 μm Volume 10 10 10 10 10 DNA length nt 10 10 10 10

Knowledge

IDM INM

slide-5
SLIDE 5

Chromatin structures

slide-6
SLIDE 6

The nuclear organization of DNA

Chromosome Chromatin fibre Nucleosome

Adapted from Richard E. Ballermann, 2012

slide-7
SLIDE 7

The nucleosome

Gene Histone Histone tail Methyl group Acetyl group DNA Histone proteins

slide-8
SLIDE 8

Histone modification effects

Type of modification H3K4 H3K9 H3K14 H3K27 H3K79 H4K20 H2BK5 mono- methylation activation activation activation activation activation activation di-methylation activation repression repression activation tri-methylation activation repression repression activation, repression repression acetylation activation activation

slide-9
SLIDE 9

The chromatin compaction levels

Several nucleosomes in a row form what is often referred to as a beads-on-a-string fiber (the 11 nm fiber) When histones H1 or H5, referred to as linker histones, are added to the 11-nm fiber, the condensed 30 nm fiber is formed The 30 nm fibers form the next level of compaction by forming loops

1 2 3 4 5 1 3 5 2 4 1 2 3 5

zigzag linker DNA)

11 nm fiber

30-nm fiber (secondary level)

30 nm fiber

slide-10
SLIDE 10

The chromatin compaction levels

Odd-numbered nucleosome Even-numbered nucleosome Plane of nucleosome layers DNA Protein scaffold Chromatin loop Metaphase chromosome

1 2 3 4 5 1 3 5 2 4 1 2 3 5

f Organization of whole

chromosomes inside the nucleus (quaternary level)

d Loops of 30-nm

fiber (tertiary level)

e Interdigitating layers of

irregularly organized nucleosomes (tertiary level)

a 11-nm fiber

(primary level)

b Nucleosome stacking

(folded 11-nm fiber with zigzag linker DNA)

c 30-nm fiber

(secondary level)

Nucleus

Adapted from Annu. Rev. Genomics Hum. Genet. 2012, 13:59-82

slide-11
SLIDE 11

Euchromatin and heterochromatin

Euchromatin: chromatin that is located away from the nuclear lamina, is generally less densely packed, and contains actively transcribed genes Heterochromatin: chromatin that is near the nuclear lamina, tightly condensed, and transcriptionally silent

Electron microscopy

slide-12
SLIDE 12

Complex genome organization

Takizawa, T., Meaburn, K. J. & Misteli, T. The meaning of gene positioning. Cell 135, 9–13 (2008).

 

Chromosome size Gene density Expression

slide-13
SLIDE 13

Lamina-genome interactions

to neural/glial The poising’’ ), in promoters here and architec-

  • ver-

large step

nuclear membrane nuclear lamina internal chromatin

(mostly active)

lamina-associated domains

(repressed)

Genes mRNA

AC

“Unlocking” gene Stemcell genes Cell-cycle gene Neuronal gene

Most genes in Lamina Associated Domains are transcriptionally silent, suggesting that lamina-genome interactions are widely involved in the control of gene expression

Adapted from Molecular Cell 38, 603-613, 2010

slide-14
SLIDE 14

Complex genome organization

Cavalli, G. & Misteli, T. Functional implications of genome topology. Nat Struct Mol Biol 20, 290–299 (2013).

DNA Chromatin domains Superdomains Chromosome territories Lamina Transcription hub Centromere cluster Nuclear pore Inactive Active Non- coding Nucleus Marina Corral

slide-15
SLIDE 15

Chromatin loops

Complex Simple

Loops bring distal genomic regions in close proximity to one another This in turn can have profound effects on gene transcription Enhancers can be thousands of kilobases away from their target genes in any direction (or even on a separate chromosome)

slide-16
SLIDE 16

Main approaches

slide-17
SLIDE 17

Restrain based modeling (IMP)

slide-18
SLIDE 18

5C technology

http://my5C.umassmed.edu

Job Dekker

Dostie et al. Genome Res (2006) vol. 16 (10) pp. 1299-309

slide-19
SLIDE 19

Hi-C data and genomic tracks data

  • RefSeq genes
  • Interaction

enrichment Interaction depletion Mouse chromosome 18 20 Mb

  • DNase I sensitivity

Dekker, J., Marti-Renom, M. A. & Mirny, L. A. Exploring the three-dimensional organization of genomes: interpreting chromatin interaction data. Nat Rev Genet 14, 390–403 (2013).

slide-20
SLIDE 20

Complex genome organization

A compartments 20 Mb 2 Mb B compartments Interaction preference TADs Compartments

Dekker, J., Marti-Renom, M. A. & Mirny, L. A. Exploring the three-dimensional organization of genomes: interpreting chromatin interaction data. Nat Rev Genet 14, 390–403 (2013).

Human chr 14

slide-21
SLIDE 21

1 Mb

b

100 Mb 101 Mb 102 Mb

B F G H I E D C A T A D s 2 98% of max Median count in 30-kb window

chrX 99 Mb 100 Mb 101 Mb 102 Mb 99 Mb 103 Mb 103 Mb

F G H I E D

TADs

Topologically associating domains (TADs) can be made of up to hundreds of kb in size Loci located within TADs tend to interact more frequently with each other than with loci located outside their domain The human and mouse genomes are each composed of over 2,000 TADs, covering over 90% of the genome

Topologically Associating Domains (TADs)

slide-22
SLIDE 22

Human α-globin domain

ENm008 genomic structure and environment

RAB11FIP3 DECR2 LOC1001134368 HS8 HS10 HS40 HS33 HS46 HS48 TMEM8 MRPL28 AXIN1 PDIA2 ARHGDIG RGS11 ITFG3 LUC7L HB HB1 HB2 HB HB C16ORF35 500000| 450000| 400000| 350000| 0| 300000| 250000| 200000| 150000| 100000| 50000| POLR3K SNRNP25 RHBDF1 MPG

p13.3 13.2 12.3 p12.1 16p11.2 11.1 q11.2 q12.1 13 16q21 22.1 q23.1

GM12878

250 500 750 >1,000 GM12878 cells 5C counts

b

250 500 750 >1,000 Forward fragments Reverse fragments K562 cells 5C counts

a

Forward fragments Reverse fragments

  • GM12878

K562

ENCODE Consortium. Nature (2007) vol. 447 (7146) pp. 799-816

slide-23
SLIDE 23

The genome of Caulobacter Crescentus

Toy interaction matrix

Ori Ter Ter Ori Ori Ori

0.0 0.5 1.1 1.7 2.1 2.5 3.0 3.5 4.0

Minus Probe Genome Position (mbp)

0.0 0.5 1.1 1.6 2.1 2.5 3.1 3.6 4.0

Plus Probe Genome Position (mbp)

  • 7.5 x 10
  • 1
  • 2.81 x 10-1

1.88 x 10-1 6.56 x 10-1 1.12 x 10 1.59 x 100 2.06 x 10 2.53 x 100 3 x 10

5C interaction Z-scores

slide-24
SLIDE 24

Ori Ter Ter Ori Ori Ori

0.0 0.5 1.1 1.7 2.1 2.5 3.0 3.5 4.0

Minus Probe Genome Position (mbp)

0.0 0.5 1.1 1.6 2.1 2.5 3.1 3.6 4.0

Plus Probe Genome Position (mbp)

  • 7.5 x 10
  • 1
  • 2.81 x 10-1

1.88 x 10-1 6.56 x 10-1 1.12 x 10 1.59 x 100 2.06 x 10 2.53 x 100 3 x 10

5C interaction Z-scores

= - Strand = + Strand

Terminus Origin

The genome of Caulobacter Crescentus

Toy interaction matrix

slide-25
SLIDE 25

Ori Ter Ter Ori Ori Ori

0.0 0.5 1.1 1.7 2.1 2.5 3.0 3.5 4.0

Minus Probe Genome Position (mbp)

0.0 0.5 1.1 1.6 2.1 2.5 3.1 3.6 4.0

Plus Probe Genome Position (mbp)

  • 7.5 x 10
  • 1
  • 2.81 x 10-1

1.88 x 10-1 6.56 x 10-1 1.12 x 10 1.59 x 100 2.06 x 100 2.53 x 100 3 x 10

5C interaction Z-scores

The genome of Caulobacter Crescentus

Real interaction matrix

slide-26
SLIDE 26

Ori Ter Ter Ori Ori Ori

0.0 0.5 1.1 1.7 2.1 2.5 3.0 3.5 4.0

Minus Probe Genome Position (mbp)

0.0 0.5 1.1 1.6 2.1 2.5 3.1 3.6 4.0

Plus Probe Genome Position (mbp)

  • 7.5 x 10
  • 1
  • 2.81 x 10-1

1.88 x 10-1 6.56 x 10-1 1.12 x 10 1.59 x 100 2.06 x 100 2.53 x 100 3 x 10

5C interaction Z-scores

Ori Ter

0.5 1 1.5 2 2.5 3 3.5 4 4.5
  • 5
0.5 1 1.5 2 2.5 3 Genome Position (mbp) Contact Frequency

The genome of Caulobacter Crescentus

Real interaction matrix

slide-27
SLIDE 27

Chromatin = DNA + (histone) proteins The genome is well organized and hierarchically packaged Histone modifications affect chromatin structure and activity 3C-like data measure the frequency of interaction between distant loci

Take home message

slide-28
SLIDE 28

How DNA is packaged

slide-29
SLIDE 29

The Integrative Modeling Platform

http://integrativemodeling.org

slide-30
SLIDE 30

Installing IMP

Move to the IMP directory and compile the code

Install the required libraries: sudo apt-get install cmake sudo apt-get install libboost1.49-all-dev sudo apt-get install libhdf5-dev sudo apt-get install swig sudo apt-get install libcgal-dev sudo apt-get install python-dev Download the IMP tarball file from http://salilab.org/imp/ and uncompress it: wget http://salilab.org/imp/get.php?pkg=2.0.1/download/imp-2.0.1.tar.gz -O imp-2.0.1.tar.gz tar xzvf imp-2.0.1.tar.gz

slide-31
SLIDE 31

Compiling IMP

LD_LIBRARY_PATH="/SOMETHING/imp-2.0.1/lib:/SOMETHING/imp-2.0.1/lib:/SOMETHING/imp-2.0.1/src/ dependency/RMF/:$LD_LIBRARY_PATH" export LD_LIBRARY_PATH PYTHONPATH="/SOMETHING/imp-2.0.1/lib:/SOMETHING/imp-2.0.1/lib:/SOMETHING/imp-2.0.1/src/dependency/ RMF/:$PYTHONPATH" export PYTHONPATH Once the compilation has finished, open the file setup_environment.sh in your IMP directory and copy the first lines into your ~/.bashrc file (if this file in not present in your home directory, create it). These lines should look like:

cd imp-2.0.1 cmake . -DCMAKE_BUILD_TYPE=Release -DIMP_MAX_CHECKS=NONE -DIMP_MAX_LOG=SILENT make -j4 >> Do not copy the lines above, copy them from setup_environment.sh, where SOMETHING is replaced by your real path to IMP <<

slide-32
SLIDE 32

Installing Chimera

http://www.cgl.ucsf.edu/chimera/

slide-33
SLIDE 33

Chimera commands

Align match.sh #1 #0 Select select #model:particles Measure distance #0:1-2 angle #0:1-2 Display vdwdefine #radius shape tube #0 radius 1 bandLength 3 segmentSubdivisions 10 shape tube #0 rad 1 band 3 seg 10 Surface molmap #all 80 color transparency