Using chromosome conformation capture to assemble genomes to - - PowerPoint PPT Presentation

using chromosome conformation capture to assemble genomes
SMART_READER_LITE
LIVE PREVIEW

Using chromosome conformation capture to assemble genomes to - - PowerPoint PPT Presentation

Using chromosome conformation capture to assemble genomes to perfection Nadge Guiglielmoni, Antoine Limasset, Romain Koszul, Jean-Franois Flot SeqBio, 19-20 Novembre 2018 1 Our goal Using chromosome conformation capture = information on


slide-1
SLIDE 1

Using chromosome conformation capture to assemble genomes to perfection

Nadège Guiglielmoni, Antoine Limasset, Romain Koszul, Jean-François Flot SeqBio, 19-20 Novembre 2018

1

slide-2
SLIDE 2

Using chromosome conformation capture = information on DNA 3D structure To assemble genomes = obtain less fragmented genomes = phase haplotypes

Our goal

2

slide-3
SLIDE 3

What is chromosome conformation capture ?

3

Chromosome Conformation Capture (3C) Technique to study DNA 3D structure Published in 2002 by Job Dekker → Hi-C : whole genome contacts (2009)

Capturing chromosome conformation. Dekker et al., 2002 Comprehensive mapping of long-range interactions reveals folding principles of the human genome.Lieberman-Aiden et al., 2009 Exploring the three-dimensional organization of genomes: Interpreting chromatin interaction data. Dekker et al., 2013

slide-4
SLIDE 4

Hi–C: A comprehensive technique to capture the conformation of genomes. Belton et al., 2012

Partial map of Lachancea cidri

Chromosome conformation capture technique : Hi-C

4

slide-5
SLIDE 5

Steps : ➔ Map reads on the genome ➔ Attribute fragments ➔ Eliminate non-chimeric reads ➔ Bin reads and build a map ➔ Normalize maps

Data processing

The Hitchhiker’s guide to Hi-C analysis: Practical guidelines. Lajoie et al., 2015 5

slide-6
SLIDE 6

Data processing

The Hitchhiker’s guide to Hi-C analysis: Practical guidelines. Lajoie et al., 2015 6

Raw data Mapping Filtering

slide-7
SLIDE 7

7

Data processing : contact map

Yeast : Lachancea kluyveri

slide-8
SLIDE 8

8

Data processing : contact map

Yeast : Lachancea kluyveri

Centromere

slide-9
SLIDE 9

9

Data processing : contact map

Yeast : Lachancea kluyveri

Loops

slide-10
SLIDE 10

10

How can chromosome conformation capture data assist in genome assembly ?

➔ DNA 3D structure is linked to 1D structure

slide-11
SLIDE 11

11

Genomic distance law

contact frequency = f(genomic distance)

Contact genomics: scaffolding and phasing (meta)genomes using chromosome 3D physical signatures. Flot et al., 2015

slide-12
SLIDE 12

12

Chromosome conformation capture in assembly

Tools to scaffold assemblies or reassemble with Hi-C data :

  • GRAAL - instaGRAAL : reassemble to obtain a more likely genome

according to contact data

  • SALSA2 : scaffold by introducing Hi-C links in assembly graphs
  • 3D-DNA, Lachesis...
slide-13
SLIDE 13

13

GRAAL principle

Paired-end sequencing

High-quality genome (re)assembly using chromosomal contact data. Marie-Nelly et al., 2014

slide-14
SLIDE 14

14

SALSA2 principle

Hi-C data GFA graph

Integrating Hi-C links with assembly graphs for chromosome-scale assembly. Ghurye et al., 2018

slide-15
SLIDE 15

15

GRAAL reassembly of Adineta vaga

1 2 3 4 5 6 7 8 9 10 11

slide-16
SLIDE 16

16

SALSA2 scaffolding of Adineta vaga

1 3 2 4 6 5 7 8 9 10 11 121314

slide-17
SLIDE 17

17

GRAAL vs SALSA2

1 2 4 3 5 6 8 7 9 10 11

1 4 5 7 8 9 10 11 12 13 14 6 2 3

GRAAL SALSA2

slide-18
SLIDE 18

18

Our project

  • Make a tool for genome scaffolding using 3C/Hi-C data
  • Test our strategy by generating Hi-C data and assembling the

genome of an organism → Chaetognaths

slide-19
SLIDE 19

19

Focus on an organism : Chaetognaths

Also known as “arrow worms” Currently classified as protostomians Phylum : Chaetognatha Class : Sagittida Orders : Aphragmophora Phragmophora Class: Spadella

slide-20
SLIDE 20

Goal : study invertebrate non-model organisms Motivation : invertebrate organisms represent the majority of animal diversity but remain understudied Gathers biologists, bioinformaticians, computer scientists… http://www.itn-ignite.eu @ignite_eu

20

IGNITE Innovative Training Network

slide-21
SLIDE 21

21

slide-22
SLIDE 22

Acknowledgements

EBE, Université libre de Bruxelles Jean-François Flot CRIStAL, Université de Lille Antoine Limasset Equipe RSG, Institut Pasteur Romain Koszul Axel Cournac Lyam Baudry Martial Marbouty Rémi Montagne Agnès Thierry

This project has received funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie grant agreement No 764840

22