1
Phylogenetics 2: Phylogenetic and genealogical homology Phylogenies distinguish homology from similarity
- Analogy
- Homology
- Polarity
- Ancestral character
- Derived character
1 Homology: similarity among two or more individuals or lineages in - - PDF document
Phylogenetics 2: Phylogenetic and genealogical homology Phylogenies distinguish homology from similarity Analogy Homology Polarity Ancestral character Derived character 1 Homology: similarity among two or more individuals or
Phylogenetic perspective on homologous characters and homologous character states
Ancestral character states
Ancestral character states
C ⇒ T
SYNAPOMORPHY: a shared derived character state in two or more lineages. These must be homologous in state. This represents a true phylogenetic similarity
APOMORPHY: a derived
character state unique to a single lineages
Implicit in the above alignment is the assumption of positional homology for the “red” position above. At this position C and T are non-homologous character states. Note that the pair of T’s in the first example, and the pair of C’s in the second example represent homologous character states.
G ⇒ A
T ⇒ C ⇒ A Convergent (non-phylogenetic) similarity of nucleotide character states; i.e., homoplasy
Adapted from: Mindell and Meyer (2001) Trends Ecol Evol. (16) 434-440.
Dissociation events
GENE A is homologous as a molecular character in all five lineages GENE A’s functional role in development also is homologous in all five lineages The “character state” of GENE A is NOT homologous in lineage 5 as compared with lineages 1 and 2 even though they have the same role! This is another example of homologous characters with non-homologous character states.
The phylogenetic concept of homology allows distinction between homologous characters and non-homologous characters states in the case of evolutionary dissociation.
Phylogeny represents evolution over macro- evolutionary time-scales One small slice of the branch represents a long period of micro-evolution Within a phylogeny there are individuals connected according to their own genealogies Species 1 Species 2 Species 3 Species 4 Trees within trees: For any one instance along a branch of an organism phylogeny there is an underlying population of individuals, each with its own genealogical history. Within each individual is a genome containing thousand of genes. Depending on the amount of linkage equilibrium, each genome will contain many genetic elements with their own evolutionary histories, and many of these gene trees can be quite different from the organism tree.
A B C D E F I Conventional representation A B C D E F I Conventional representation A C D E F I C D E F I B Path 1 Path 2 A C D E F I A C D E F I C D E F I B C D E F I B Path 1 Path 2
Species 1 Species 2 Species 3 Species 4
Species 1 Species 2 Species 3 Species 4
Species 1 Species 2 Species 3 Species 4
Species 1 Species 2 Species 3 Species 4
Species 1 Species 2 Species 3 Species 4
Drift Selection Population history
Polymorphism and substitution (highly simplified) along a branch of a phylogeny
Time
Residence time: the time that a particular neutral polymorphism is present in a population. Mean residence time is determined by effective population size (Ne) Population substitution 1 Population substitution 2 Population substitution 3 Coalescent
GAC GAT GAT
A T A
GAG
A ⇒ C ⇒ A ⇒ G A ⇒ C C ⇒ A A ⇒ G
Orthologous genes are derived from the divergence of an organismal lineage; i.e., a speciation event. Thus if we look at orthologs on a phylogeny we see that their most recent common ancestor represents the coalescence of two organismal lineages.
Paralogous genes are derived from the divergence event within a genomes; i.e., a gene duplication event. In this case if we look at paralogs on a phylogeny we see that they coalesce at a gene duplication event.
A gene is pro-orthologous to another gene if they coalesce at a speciation event that predates a gene duplication event. Thus a single-copy gene in organism A is pro-
duplication events that followed the divergence of organisms A and B.
This is a term that simply takes the reverse perspective of pro-orthology. Any one of the multi-copy genes in the genome of organism B is said to be semi-orthologous to a single copy gene in the genome of organism A, if the most recent common ancestors of those genes coalesce at a point in time that predates the gene duplication event.
segments within the same gene coalesce at different ancestors. This can arise from evolutionary processes such as homologous recombination or exon shuffling.
Gametologs coalesce at an event that isolated those genes on opposite sex chromosomes; i.e., they coalesce at the point when they became isolated from the process of recombination.
Genes that coalesce at either a speciation or duplication event, but whose evolutionary histories do not fit with that of the organismal lineages which carry such genes due to one
Homologous genes found within the same organism’s genome have different evolutionary histories due to the fusion of formerly evolutionarily independent genomes, such as in endosymbiosis.
The mammalian Ldh-A and Ldh-C gene family is used as an example to illustrate the various forms of homology (ORTHOLOGY, PAROLOGY, PRO-ORTHOLOGY, and SEMI-ORTHOLOGY) that are important when dealing with gene families. Pro-orthologs pre-date the involved duplication event Examples of different types of homology in gene families: Homo and Rattus (rat) Ldh-C are orthologous. Homo Ldh-C and Homo Ldh-A are paralogous. Homo Ldh-C and Rattus Ldh-A also are paralogous. Gallus (chicken) Ldh-A is pro-orthologous to both Homo Ldh-C and Homo Ldh-A. Homo Ldh-C is semi-orthologous to the Gallus Ldh-A. All mammalian Ldh-A genes are semi-orthologous to the non-mammalian Ldh-A. Note that the gene duplication that gave rise to the Ldh-C gene is specific to an ancestor of all present-day mammals. Mammalian Ldh-C and Ldh-A genes are paralogous.
Mus Cr icetinae Homo Gallus Sc e lopor us R attus Sus Homo Sus R abbit Mus R attus L dh-A L dh-C Gene duplication event
Tandem array of 2 genes in the ancestral species
(−) (−) (+) (+) (+) (−) (−)
1 2 A1 A2 B1 B2 C1 C2 Gene “Birth”: (+) Gene “Death”: (-) ti m e Tandem array of 2 genes in the ancestral species
(−) (−) (+) (+) (+) (−) (−)
1 2 1 2 A1 A2 A1 A2 B1 B2 B1 B2 C1 C2 C1 C2 Gene “Birth”: (+) Gene “Death”: (-) ti m e
A B C A1 B1 C1
A B C A1 B1 C1
Species A Species B Species C Gene 1 Gene 2
A B C D a b c d A B C D a B C d
Gene conversion between the same gene
Parental contribution Resultant offspring Gene conversion between different genes Gene A Gene A Gene A Gene B
Among alleles within an individual: think about patterns of population coalescence Among alleles within an individual: this is a nonreciprocal exchange Among genes (paralogs) within an individual: this is also a nonreciprocal
that different genes can have different phylogenetic histories
A B C a b c
Mus Cr icetinae Homo Gallus Sc e lopor us R attus Sus Homo Sus R abbit Mus R attus L dh-A L dh-C Gene duplication event