3D folding of chromosomal domains in relation to gene expression - - PowerPoint PPT Presentation

3d folding of chromosomal domains in relation to gene
SMART_READER_LITE
LIVE PREVIEW

3D folding of chromosomal domains in relation to gene expression - - PowerPoint PPT Presentation

3D folding of chromosomal domains in relation to gene expression Marc A. Marti-Renom http://sgu.bioinfo.cipf.es Structural Genomics Unit Bioinformatics & Genomics Department Prince Felipe Research Center (CIPF), Valencia, Spain Thursday,


slide-1
SLIDE 1

Marc A. Marti-Renom

http://sgu.bioinfo.cipf.es

Structural Genomics Unit Bioinformatics & Genomics Department Prince Felipe Research Center (CIPF), Valencia, Spain

3D folding of chromosomal domains in relation to gene expression

Thursday, November 25, 2010

slide-2
SLIDE 2

2

Complex genomes Simple genomes

Aim

Can we relate structure and expression?

Thursday, November 25, 2010

slide-3
SLIDE 3

Resolution

Limited knowledge...

3

Adapted from: Langowski and Heermann. Semin Cell Dev Biol (2007) vol. 18 (5) pp. 659-67

μ 10-1 10-2 10-3 Resolution s Time 103 102 100 10-2 10-4 10-6 10-8 10-10 μm3 Volume 103 100 10-3 10-6 10-9 DNA length nt 109 106 103 100

Knowledge

Thursday, November 25, 2010

slide-4
SLIDE 4

4

Experiments Computation

Integrative and iterative approach

Thursday, November 25, 2010

slide-5
SLIDE 5

Structure determination

Integrative Modeling Platform

http://www.integrativemodeling.org

5

Alber et al. Nature (2007) vol. 450 (7170) pp. 683-94

Biomolecular structure determination 2D-NOESY data Chromosome structure determination 5C data

Thursday, November 25, 2010

slide-6
SLIDE 6

6

5C “copies” the 3C library into a 5C library containing only ligation junctions Performed at high levels of multiplexing: 2,000 primers detect 1,000,000 unique interactions in 1 reaction

5C technology

Detecting up to millions of interactions in parallel

http://my5C.umassmed.edu

Dostie et al. Genome Res (2006) vol. 16 (10) pp. 1299-309

Thursday, November 25, 2010

slide-7
SLIDE 7

Human α-globin domain

ENm008 genomic structure and environment

7

ENCODE Consortium. Nature (2007) vol. 447 (7146) pp. 799-816

The ENCODE data for ENm008 region was obtained from the UCSC Genome Browser tracks for: RefSeq annotated genes, Affymetrix/CSHL expression data (Gingeras Group at Cold Spring Harbor), Duke/NHGRI DNaseI Hypersensitivity data (Crawford Group at Duke University), and Histone Modifications by Broad Institute ChIP-seq (Bernstein Group at Broad Institute of Harvard and MIT).

RAB11FIP3 DECR2 LOC1001134368 HS8 HS10 HS40 HS33 HS46 HS48 TMEM8 MRPL28 AXIN1 PDIA2 ARHGDIG RGS11 ITFG3 LUC7L HB HB1 HB2 HB HB C16ORF35 500000| 450000| 400000| 350000| 0| 300000| 250000| 200000| 150000| 100000| 50000| POLR3K SNRNP25 RHBDF1 MPG

p13.3 13.2 12.3 p12.1 16p11.2 11.1 q11.2 q12.1 13 16q21 22.1 q23.1

K562 GM06990 DNaseI K562 GM12878 H3K4me3 CTCF K562 GM12878 K562 GM12878 RNA diff CTCF

K562 GM12878

Thursday, November 25, 2010

slide-8
SLIDE 8

8

RAB11FIP3 DECR2 LOC1001134368 HS8 HS10 HS40 HS33 HS46 HS48 TMEM8 MRPL28 AXIN1 PDIA2 ARHGDIG RGS11 ITFG3 LUC7L HB HB1 HB2 HB HB C16ORF35 500000| 450000| 400000| 350000| 0| 300000| 250000| 200000| 150000| 100000| 50000| POLR3K SNRNP25 RHBDF1 MPG

p13.3 13.2 12.3 p12.1 16p11.2 11.1 q11.2 q12.1 13 16q21 22.1 q23.1

GM12878

α-globin genes enhancer

Human α-globin domain

ENm008 genomic structure and environment

ENCODE Consortium. Nature (2007) vol. 447 (7146) pp. 799-816

K562 cells: α-globin genes active

Thursday, November 25, 2010

slide-9
SLIDE 9

Integrative Modeling

http://www.integrativemodeling.org

9

P1 P2

Thursday, November 25, 2010

slide-10
SLIDE 10

Representation

10

i i+2 i+1 i+n

Harmonic Lower Bound Harmonic Upper Bound Harmonic

Hi, j = k di, j " di, j

( )

2

if di, j " di, j

0 ;

lbHi, j = k di, j # di, j

( )

2

if di, j > di, j

0 ;

lbHi, j = 0 $ % & ' & if di, j " di, j

0 ;

ubHi, j = k di, j # di, j

( )

2

if di, j < di, j

0 ;

ubHi, j = 0 $ % & ' &

Thursday, November 25, 2010

slide-11
SLIDE 11

Scoring

11

Harmonic Harmonic Lower Bound Harmonic Upper Bound

GM12878 K562

70 fragments 1,520 restraints 70 fragments 1,049 restraints

Thursday, November 25, 2010

slide-12
SLIDE 12

Optimization

CREATE PARTICLES ADD RESTRAINTS SIMULATED ANEALING MONTE-CARLO LOCAL CONJUGATE GRADIENT LOWEST OBJECTIVE FUNCTION start end 500 rounds 5 steps

50 100 150 200 250 300 350 400 450 500 Iteration 0.0E+00 1.00E+06 2.00E+06 3.00E+06 4.00E+06 5.00E+06 6.00E+06 7.00E+06 IMP Objective function

12

Thursday, November 25, 2010

slide-13
SLIDE 13

13

Clustering

Thursday, November 25, 2010

slide-14
SLIDE 14

Not just one solution

14

GM12878 K562

Thursday, November 25, 2010

slide-15
SLIDE 15

Not just one solution

and we can de-convolute them!

15

Thursday, November 25, 2010

slide-16
SLIDE 16

K562 GM12878

Cluster #2 314 model 232,673 IMP OF Cluster #1 2780 model 910,280 IMP OF

16

Consistency

Consistency (%)

20 40 60 80 100

50 nm 75 nm 100 nm 125 nm 150 nm

K562

150 nm

20 40 60 80 100

50 nm 75 nm 100 nm 125 nm

GM15878

Fragment

Thursday, November 25, 2010

slide-17
SLIDE 17

K562 GM12878

Cluster #2 314 model 232,673 IMP OF Cluster #1 2780 model 910,280 IMP OF

16

Regulatory elements

200 400 600 800 1,000 Distance (nm)

K562 GM12878

K562 GM12878

Relative abundance

Promoters Active genes No-active genes DNaseI sites CTCF sites H3K4me3 sites

Distance to center (nm) <400 <300 <350 <250 <200 <150 <100 <50

K562 GM12878

Distance to center (nm) <400 <300 <350 <250 <200 <150 <100 <50 Relative abundance

Promoters Active genes No-active genes DNaseI sites CTCF sites H3K4me3 sites

.0 .5 1.0 1.5 2.0 2.5 .00 .50 1.00 1.50 2.00 2.50

Thursday, November 25, 2010

slide-18
SLIDE 18

K562 GM12878

Cluster #2 314 model 232,673 IMP OF Cluster #1 2780 model 910,280 IMP OF

16

Compactness

K562 GM12878

40 50 60 70 80 90 100 110 Density (bp/1nm)

K562 GM12878

Fragment

Thursday, November 25, 2010

slide-19
SLIDE 19

K562 GM12878

Cluster #2 314 model 232,673 IMP OF Cluster #1 2780 model 910,280 IMP OF

16

Multi-loops

Path length (nm)

Distance between anchoring points (nm) 73 117 161 205 >=250

700 600 500 400 300 300 400 500 600 700

44Kb 20Kb 68Kb 52Kb 64Kb 45Kb 35Kb 65Kb 50Kb 55Kb 30Kb 68Kb 63Kb 69Kb 64Kb 55Kb

K562 GM12878

Thursday, November 25, 2010

slide-20
SLIDE 20

K562 GM12878

Cluster #2 314 model 232,673 IMP OF Cluster #1 2780 model 910,280 IMP OF

16

Expression

Increased in K562 Increased in GM12878 =

Thursday, November 25, 2010

slide-21
SLIDE 21

K562 GM12878

Cluster #2 314 model 232,673 IMP OF Cluster #1 2780 model 910,280 IMP OF

16

K562 GM12878

100 200 300 400 500 Distance (nm)

GM12878 K562 FISH M

  • d

e l s ( 2 D )

FISH validation

Thursday, November 25, 2010

slide-22
SLIDE 22

Summary

5C data results in comprehensive interaction matrices to build a consistent 3D model

17

Thursday, November 25, 2010

slide-23
SLIDE 23

18

Summary

Models allow for 5C data de-convolution

Thursday, November 25, 2010

slide-24
SLIDE 24

Summary

Models allow for 5C data de-convolution

20

Thursday, November 25, 2010

slide-25
SLIDE 25

Summary

Selected models reproduce known (and new) interactions

20

Thursday, November 25, 2010

slide-26
SLIDE 26 100 nm 100 nm

α-globin- Enhancer looping

  • nly in K562

Looping interaction In GM12878 cells

Summary

Large-scale changes in conformation correlate with gene expression of resident genes

21

K562 GM12878 RNA diff RAB11FIP3 DECR2 LOC1001134368 HS8 HS10 HS40 HS33 HS46 HS48 TMEM8 MRPL28 AXIN1 PDIA2 ARHGDIG RGS11 ITFG3 LUC7L HB HB1 HB2 HB HB C16ORF35 POLR3K SNRNP25 RHBDF1 MPG

Thursday, November 25, 2010

slide-27
SLIDE 27 100 nm 100 nm

Summary

The models have been partially validated by FISH

22

K562 GM12878

Thursday, November 25, 2010

slide-28
SLIDE 28

23

Summary

“Chromatin Globule” model

Münkel et al. JMB (1999) Lieberman-Aiden et al. Science (2009) Phillips and Corces. Cell (2009)

  • PolII

HBB Eraf Factory

a b

Osborne et al. Nat Genet (2004)

Thursday, November 25, 2010

slide-29
SLIDE 29

Acknowledgments

  • D. Baù, A. Sanyal, B. Lajoie, E. Capriotti, M. Byron, J. Lawrence, J. Dekker, and M.A. Marti-Renom.

Nature Structural & Molecular Biology (2010) in press (5th of December).

http://sgu.bioinfo.cipf.es http://integrativemodeling.org

Emidio Capriotti

Postdoctoral fellow

Davide Baù

Postdoctoral fellow

Bryan Lajoie

Bioinformatician

Amartya Sanyal

Postdoctoral Fellow

Meg Byron

Research Associate

Marc A. Marti-Renom

Structural Genomics Unit Bioinformatics and Genomics Department Centro de Investigación Príncipe Felipe Valencia, Spain

Job Dekker

Program in Gene Function and Expression Department of Biochemistry and Molecular Pharmacology University of Massachusetts Medical School Worcester, MA, USA

Jeanne Lawrence

Department of Cell Biology University of Massachusetts Medical School Worcester, MA, USA

Thursday, November 25, 2010