- JSMC Practical Course - Inferring Phylogeny Based on Sequence - - PowerPoint PPT Presentation

jsmc practical course inferring phylogeny based on
SMART_READER_LITE
LIVE PREVIEW

- JSMC Practical Course - Inferring Phylogeny Based on Sequence - - PowerPoint PPT Presentation

- JSMC Practical Course - Inferring Phylogeny Based on Sequence Information Thursday Friday, March 21 22 Room 316, Philosophenweg 12 Wireless LAN: eduroam Username: tagung09@uni-jena.de Password: Gver58ges Phylogenetics -


slide-1
SLIDE 1
  • JSMC Practical Course -

Inferring Phylogeny Based on Sequence Information

Thursday – Friday, March 21 – 22 Room 316, Philosophenweg 12 Wireless LAN: eduroam Username: tagung09@uni-jena.de Password: Gver58ges

slide-2
SLIDE 2

Phylogenetics

  • Phylogeny = evolutionary history of a specific group of
  • rganisms
  • Discipline of phylogenetics aims to find a classification for

a specific group of organisms or genes that represents their true evolutionary relationship

  • Distinction between ancestral (plesiomorphic) and derived

(apomorphic) features

  • Kinds of features:
  • morphological data
  • biochemical data
  • molecular data
  • > evolve relatively continuously
  • > homologies may be detected more easily
  • > very high quantity
slide-3
SLIDE 3

Molecular phylogenetics

Types of molecular features: nucleotide sequences of ribosomal RNA- or tRNA-genes presence or absence of a certain gene within the genome genomic rearrangements presence or absence of introns amino acid sequences of proteins nucleotide sequences of protein coding genes nucleotide sequences of introns nucleotide sequences of intergenic regions single nucleotide polymorphisms (SNPs)

Phylogenetic distance

slide-4
SLIDE 4

Molecular phylogenetics

Types of molecular features: nucleotide sequences of ribosomal RNA- or tRNA-genes presence or absence of a certain gene within the genome genomic rearrangements presence or absence of introns amino acid sequences of proteins nucleotide sequences of protein coding genes nucleotide sequences of introns nucleotide sequences of intergenic regions single nucleotide polymorphisms (SNPs)

Phylogenetic distance Distinction of single individuals e.g. paternity tests or criminal biology Distinction of distantly related species broad range

  • f applications
slide-5
SLIDE 5

Interpretation of phylogenetic trees

The order of the taxa (terminal branches) is not of importance. Each sub-tree can be arbitrarily rotated at each node (so that the

  • rder of the taxa changes).

Only the topology of the tree (i.e. the node structure) specifies the phylogenetic relationship! Root Branch Node

slide-6
SLIDE 6

Types of phylogenetic trees

Cladogram Phylogram Dendrogram No meaning Feature change Time

slide-7
SLIDE 7

Display formats

slide-8
SLIDE 8

Gene trees and species trees

Speciation Gene duplication Gene loss Horizontal gene transfer

slide-9
SLIDE 9

Process of phylogeny inference

  • Data collection
  • Homologous sequences are searched based on sequence similarity
  • > BLAST
  • Multiple sequence alignment
  • Homologous sites are detected and aligned along each other
  • > MAFFT
  • Selection of an appropriate model to infer phylogeny
  • Based on the level of sequence identity/similarity among the

alignment members their phylogenetic relation is reconstructed

  • > Neighbour joining
  • > Bayesian phylogeny inference

Holder and Lewis, 2003

slide-10
SLIDE 10

Process of phylogeny inference

  • Data collection
  • Homologous sequences are searched based on sequence similarity
  • > BLAST
  • Multiple sequence alignment
  • Homologous sites are detected and aligned along each other
  • > MAFFT
  • Selection of an appropriate model to infer phylogeny
  • Based on the level of sequence identity/similarity among the

alignment members their phylogenetic relation is reconstructed

  • > Neighbour joining
  • > Bayesian phylogeny inference

Holder and Lewis, 2003

slide-11
SLIDE 11

Process of phylogeny inference

Tree construction and searching methods

  • Stepwise addition
  • Star decomposition
  • Heuristic search
  • Exact search

Tree evaluation methods (optimality criteria)

  • Minimum evolution
  • Parsimony
  • Maximum likelihood
slide-12
SLIDE 12

Tree construction and searching methods

  • Stepwise addition

Attaches linage by linage according to their relative similarity

  • Star decomposition

Joins linage by linage according to their relative similarity

slide-13
SLIDE 13

Tree construction and searching methods

  • Heuristic search

Performs branch swapping to generate alternative trees in attempt to find a better tree

  • Exact search

Searches the complete ‘space’ of possible trees

Space of all possible trees Tree quality Local

  • ptima

Global optimum

slide-14
SLIDE 14

Tree evaluation methods (optimality criteria)

Minimum Evolution

  • Uses a distance matrix to evaluate tree quality
  • For every tree the branch length are estimated that

best explain the observed distances

  • > fast
  • > can correct for unseen changes
  • > weaknesses for long branches (i.e. high

evolutionary distances)

Position 1 Position 2 Position 3 Sequence 1 A A A Sequence 2 A T G Sequence 3 A T C S 1 S 2 S 3 S 1 2 2 S 2 2 1 S 3 2 1 S 2 S 3 S 1 1 0.5 0.5 0.5

slide-15
SLIDE 15

Tree evaluation methods (optimality criteria)

Parsimony

  • Maps sequence history onto tree
  • Evaluates tree quality by finding the minimum

number of mutations that could explain the data

  • > fast enough for hundreds of sequences
  • > does not correct for multiple mutational

pathways of the same tree

  • > performs poorly if branch length differ

Position 1 Position 2 Position 3 Sequence 1 A A A Sequence 2 A T G Sequence 3 A T C A T G A T C A A A T -> A G -> C A -> G A T G A T A

slide-16
SLIDE 16

Tree evaluation methods (optimality criteria)

Maximum likelihood

  • Maps sequence history onto tree
  • Finds the tree that is most likely to explain the data
  • > captures all possible mutational pathways
  • > corrects for multiple mutational events at the

same site

  • > slow
slide-17
SLIDE 17

Process of phylogeny inference

Tree construction and searching methods

  • Stepwise addition
  • Star decomposition
  • Heuristic search
  • Exact search

Tree evaluation methods (optimality criteria)

  • Minimum evolution
  • Parsimony
  • Maximum likelihood

Produce only

  • ne tree

No information about the reliability of single branches

  • > Bootstrapping
slide-18
SLIDE 18

Bootstrapping

  • Creates pseudo-replicates of original data
  • Performs the same tree search for all pseudo-

replicates and stores the trees

  • The reliability of a certain grouping is determined

based on the number of trees that show this grouping

  • > very time consuming

Holder and Lewis, 2003

slide-19
SLIDE 19

Bayesian phylogenetics

  • Performs tree search and measure of support

simultaneously

  • Uses Markov chain Monte Carlo (MCMC)

simulations to produce alternative trees

  • Not a strict ‘hill-climber’ (does not only accept

better trees)

  • Higher probability to reach global optimum

Holder and Lewis, 2003

slide-20
SLIDE 20

Bayesian phylogenetics

slide-21
SLIDE 21
  • Monophyletic group
  • Analogy
  • Paraphyletic group
  • Homology
  • Polyphyletic group
  • Homoplasy
  • Apomorphic feature
  • Convergence
  • Plesiomorphic feature
  • MRCA
  • Autapomorphy
  • Extant species
  • Synapomomorphy
  • Extinct species
  • Symplesiomorphy
  • Dichotomy
  • Polytomy

Basic terms of phylogenetics

slide-22
SLIDE 22

The Tree Thinking Challenge

Is the frog more closely related to the human or to the fish?

slide-23
SLIDE 23

The Tree Thinking Challenge

Is the frog more closely related to the human or to the fish?

slide-24
SLIDE 24

The Tree Thinking Challenge

slide-25
SLIDE 25

The Tree Thinking Challenge

slide-26
SLIDE 26

The Tree Thinking Challenge

slide-27
SLIDE 27

The Tree Thinking Challenge

slide-28
SLIDE 28

The Tree Thinking Challenge

slide-29
SLIDE 29

The Tree Thinking Challenge

slide-30
SLIDE 30

The Tree Thinking Challenge

slide-31
SLIDE 31

The Tree Thinking Challenge

slide-32
SLIDE 32

The Tree Thinking Challenge

slide-33
SLIDE 33

The Tree Thinking Challenge

slide-34
SLIDE 34

The flower development of angiosperms

Sepals Petals Stamens Carpels Ovules

slide-35
SLIDE 35

The ABCDE model

Sepals Petals Stamens Carpels Ovules

slide-36
SLIDE 36

APETALA1 APETALA3 PISTILLATA AGAMOUS SHATTERPROOF1 SEEDSTICK SEPALLATA3 SEPALLATA2 SEPALLATA1 SEPALLATA4 SHATTERPROOF2

The ABCDE model

slide-37
SLIDE 37

The floral quartet model

slide-38
SLIDE 38

Phylogeny of seed plants

Gymnosperms Magnoliids Monocots Core eudicots Basal angiosperms Basal eudicots Arabidopsis thaliana

slide-39
SLIDE 39

APETALA1 APETALA3 PISTILLATA AGAMOUS SEPALLATA3 SEPALLATA2 SEPALLATA1 SEPALLATA4 Gymnosperms Magnoliids Monocots Core eudicots Basal angiosperms Basal eudicots Arabidopsis thaliana

Phylogeny of seed plants

?

slide-40
SLIDE 40
  • Search for orthologs of floral

homeotic genes in distantly related angiosperm and gymnosperm species

  • Examine the phylogenetic

relationship of the gene families Gymnosperms Magnoliids Monocots Core eudicots Basal angiosperms Basal eudicots Arabidopsis thaliana

Phylogeny of seed plants

APETALA1 APETALA3 PISTILLATA AGAMOUS SEPALLATA3 SEPALLATA2 SEPALLATA1 SEPALLATA4