Accepted Point Mutation (Dayhoff et al. 68,72,78) An APM in a - PowerPoint PPT Presentation

1 Accepted Point Mutation (Dayhoff et al. 68,72,78) • “An APM in a protein is a replacement of one AA by another accepted by evolution” • We want to estimate the • probability that given a site with AA A has udergone an APM, the new AA is B • the rates each AA undergoes an APM • Dayhoff et al. estimated those from hypothetically constructed phylogenetic trees • originally phylogenetic trees were used to represent evolutionary relationship between species • they can be used to represent relationship between sequences • trees relating the sequences in 71 families were constructed using the parsimony method

2 The parsimony method for phylogenetic trees • Look for a tree that can relate the observed sequences with a minimal number of substitutions • typically it is not unique • An example of the most parsimonuous phylogenetic trees for the family of sequences AA , AB , BB :

3 Estimating transition probabilities from trees • The transition frequencies were estimated from the neighboring sequences on the phylogenetic trees: • If A and B are aligned in two nodes of the tree connected by and edge then the A → B and the B → A counts are incremented • Within each of the 71 families the counts are averaged over all possible most parsimonuous trees • The 71 families considered had the property that any pair of sequences in them agreed in ≥ 85% of the sequences • This restriction hopefully reduced to negligible the number of edges along which two APMs occurred in the same site • Dividing those counts by the total number of times A mutated yields an estimate of the conditional probability that A mutated to B given that it mutated

4 Counted transitions ( × 10)

5 Estimating the “mutability” from trees • Dayhoff et al. estimated the rates at which an AA undergoes mutation by dividing the number of times it mutated by the number of times at appears in the phylogenetic trees • They define the Markov chain transition matrix: T AB p AB = m A � C � = A T AC

6 Transition matrix for PAM1 ( × 10 4 )

7 PAM X vs. % identity

Biochemistry: Henikoff and Henikoff Proc. Natl. Acad. Sci. USA 89 (1992) 10917 Performance of substitution matrices in aligning three mation of mutation rates. Nevertheless, the BLOSUM series Table 1. based on percent clustering of aligned segments in blocks can serine proteases be compared to the Dayhoff matrices based on PAM using a Residue positions missed* Matrix measure of average information per residue pair in bit units Program All positions Side chains aligned called relative entropy (9). Relative entropy is 0 when the target (or observed) distribution of pair frequencies is the 12 6 MSA same as the background (or expected) distribution and in- PAM 120 31 22 MULTALIN creases as these two distributions become more distinguish- PAM 160 30 22 MULTALIN able. Relative entropy was used by Altschul (9) to charac- PAM 250 30 22 MULTALIN terize the Dayhoff matrices, which show a decrease with +6/-i MULTALIN 34 26 increasing PAM. For the BLOSUM series, relative entropy BLOSUM 45 9 5 MULTALIN BLOSUM 62 6 4 increases nearly linearly with increasing clustering percent- MULTALIN BLOSUM 80 9 6 MULTALIN age (Fig. 1). Based on relative entropy, the PAM 250 matrix is comparable to BLOSUM 45 with relative entropy of =0.4 bit, *From data of Greer (22), where residues were considered to be while PAM 120 is comparable to BLOSUM 80 with relative aligned whenever a-carbons occupied comparable positions in space (All positions column). For a subset (Side chains column), entropy of =1 bit. BLOSUM 62 (Fig. 2 Lower) is intermediate residues were excluded where there were differences in the posi- in both clustering percentage and relative entropy (0.7 bit) tions of side chains. and is comparable to PAM 160. Matrices with comparable relative entropies also have similar expected scores. parable numbers were obtained when residues that show Some consistent differences are seen when PAM 160 is differences in the positions of side chains were excluded. subtracted from BLOSUM 62 for every matrix entry (Fig. 2 Therefore, BLOSUM matrices produced accurate global align- Upper). Compared to PAM 160, BLOSUM 62 is less tolerant to ments of these sequences. substitutions involving hydrophilic amino acids, while it is Performance in Searching for Homology in Sequence Data more tolerant to substitutions involving hydrophobic amino Banks. To determine how BLOSUM matrices perform in data acids. For rare amino acids, especially cysteine and tryp- bank searches, we first tested them on the guanine nucleo- tophan, BLOSUM 62 is typically more tolerant to mismatches tide-binding protein-coupled receptors, a particularly chal- than is PAM 160. lenging group that has been used previously to test searching Performance in Multiple Alignment of Known Structures. and alignment programs (10, 18, 23, 24). Three diverse One test of sequence alignment accuracy is to compare the queries, LSHR$RAT, RTA$RAT, and UL33$HCMVA, results obtained to alignments seen in three-dimensional were chosen from among the 114 full-length family members structures. Lipman et al. (21) applied a simultaneous multiple catalogued in Prosite based on the observation that none alignment program, MSA, to 3 similarly diverged serine pro- detected either of the others in searches. The number of teases of known three-dimensional structures. They found misses was averaged in order to assess the overall searching that for 161 closely aligned residue positions, 12 residues performance of different matrices for this group. Three were involved in misalignments. We asked how well a different programs were used-BLAST (11), FASTA (19), and hierarchical multiple alignment program, MULTALIN (17), Smith-Waterman (20). BLAST rapidly determines the best performs on the same proteins using different substitution ungapped alignments in a data bank. FASTA is a heuristic and matrices. Table 1 shows that MULTALIN performs much Smith-Waterman is a rigorous local alignment program; both worse than MSA using the PAM 120, 160, or 250 matrices, can optimize an alignment by the introduction of gaps. misaligning residues at 30-31 positions. In comparison, MUL- 8 TALIN with a simple +6/-i matrix (that assigns +6 to Several BLOSUM and PAM matrices in the entropy range of PAM 160 vs. BLOSUM 62 0.15-1.2 were tested. matches and -1 to mismatches) misaligns residues at 34 positions. In the same test using BLOSUM 45, 62 and 80, Results with each of the 3 programs show that all BLOSUM MULTALIN misaligned residues at only 6-9 positions. Com- matrices in the 0.3-0.8 range performed better than the best V F Y W G D E Q H R K M I L C S T P A N 1 2 -2 0 0 2 4 5 5 C 0 0 2 1 1 2 1 2 1 -1 1 -1 0 0 0 0 0 1 0 1 1 2 0 -2 0 -1 0 1 1 -1 S 0 1 3 T 0 0 0 0 0 0-1 0-1 1 1 C 9 2-1 -1 -1 0 0-1 -1-1 1 0-1 0 2 1 P S-1 2-2 -1 -1 0 1 4 0 2 A 0 0 1 0 0 1 1 1 T-1 1 5 2-1 -2 -2 -1 1 0 -1 2 4 G 2 0 -1 -2 0 1 1 0 0-1 1 P -3 -1-1 7 0 -1 0 0 0 N 0 1 3 -1 -1 0 0 1-1 0-1 A 0-1 4 3 D 2 0-1 0 0 0 0 2 1 G -3 0-2 -2 0 6 -1 -1 -1 0 2 2 4 E 0 6 1 0 0 2 2 1-1 0 N -3 1 0-2 -2 0 0 3 3 Q D -3 -2 6 0-2 0 1 1-1 1 0-1 -1 -1 1 0 0 0-1 1 2 2 H 0-1 -1 -1 -2 0 2 5 2-1 1 E -4 0 3 -4 R 0 2 5 -1-1 0-1 1 1 Q -3 0-1 -1 -1 -2 0 1 2 3 1 K -2 -2 -2 -2 1-1 0 0 8 1-2 -1 1 H -3 -1 0 2 0-2 0 0 5 -2 -1 -1 1 4 M R -3 -1 -1 -2 -1 -2 1 0 0 0-1 -1 -1 -20-11 1-1 2 5 -1 1 1 3 I K -3 0 -1 -1 1 2 M -1 -1 -1 -2 -1 -3 -2 -3 -2 0-2 -1 -1 5 L 0 -4 -3 -3 -3 -3 -3 -3 -3 4 1 2 4 V -2 -3 -1 1 I -1 -1 -2 1 F -3 -3 -4 -3 -2 -3 -2 -2 2 2 4 -1 L -1 -2 -1 -1 -4 4 2 Y 0-3 -3 -3 -2 -2 -3 -3 -2 1 3 1 -1 V -1 -2 0-2 -1 -1W 0 0 6 -2 -2 -2 -3 -3 -3 -3 -3 -1 -3 -3 0 F -2 -4 -1 -2 -3 -2 -3 -2 2 -2 -2 -1 -1 -1 3 7 Y -2 -2 -2 -3 -1 -3 2 11 -3 -2 -3 -2 -2 -3 -3 -1 -3 -2 1 W -2 -3 -2 -4 -4 -4 V F Y W Q R K M I L C S T P A G N D E H BLOSUM 62 substitution matrix (Lower) and difference matrix (Upper) obtained by subtracting the PAM 160 matrix position by position. FIG. 2. These matrices have identical relative entropies (0.70); the expected value of BLOSUM 62 is -0.52; that for PAM 160 is -0.57.

Accepted Point Mutation (Dayhoff et al. 68,72,78) An APM in a - PowerPoint PPT Presentation

1 Accepted Point Mutation (Dayhoff et al. 68,72,78) An APM in a protein is a replacement of one AA by another accepted by evolution We want to estimate the probability that given a site with AA A has udergone an APM, the new AA

1 Mutation pressure Let = the mutation rate from A a Let = the mutation rate from a

1 Mutation pressure Let = the mutation rate from A a Let = the mutation rate from a

Drift, mutation, selection, and the evolutionary dispersion of mean phenotypes recombination

Gate 3 Project Status Update Offer Acceptance Area DSO offers TSO Offers Accepted Accepted

Craig Menear Chairman, CEO & President Diane Dayhoff Vice President, Investor Relations

Diane Dayhoff Vice President, Investor Relations Ben Finger Senior Manager, Investor Relations

Heuristic Approaches Mark Voorhies 5/5/2017 Mark Voorhies Heuristic Approaches PAM (Dayhoff)

Using Conditional Mutation to Increase the Efficiency of Mutation Analysis e Just 1 & Gregory

BRCA Founder Mutations Ashkenazi Jewish (Hungarian and Russian): BRCA1 - 185delAG and

Equivalence Modulo States Bo Wang, Yingfei Xiong , Yangqingwei Shi, Lu Zhang, Dan Hao Peking

Mutation Testing in Python Austin Bingham @austin_bingham @sixty_north 1 Sunday, October 4, 15

Mutation models: probabilistic study and parameter estimation Adrien Mazoyer, supervised by

CSE 331 Mutation and immutability slides created by Marty Stepp based on materials by M. Ernst,

2: Old English Sound Laws and Verbs Proto-Germanic fts , dative fti , plural *ftiz Front

Milu: A Higher Order Mutation Testing Tool Yue Jia University College London Joint work with

Advances in Mutation Testing Research for C++ Pedro Delgado-Prez TAROT: Intro Talk June 2015

Maximal green sequences of minimal mutation-infinite quivers John Lawson joint with Matthew

Mutation Analysis for Coq Pengyu Nie 1 Ahmet Celik 1 , Karl Palmskog 1 , Marinela Parovic 1 , us

Simultaneous Identification of Multiple Driver Pathways in Cancer Mark D. M. Leiserson, et.al

Mutation Testing How good your tests are 2017 whoami iOS Developer by day compiler

Semantic Mutation Testing John A. Clark, Haitao Dan, Robert M Hierons The 8th CREST Open

Testing Query Execution Engines with Mutations Xinyue Chen 1 , Chenglong Wang 1 , Alvin Cheung 2 1

Comparing Global and Local Mutations on Bit Strings This work benefited from being presented and

Time and chance happeneth to them all: Mutation, selection and recombination Steven N. Evans

Sambuz

Useful Links

Newsletter

Mail Us

Accepted Point Mutation (Dayhoff et al. 68,72,78) An APM in a - PowerPoint PPT Presentation

1 Accepted Point Mutation (Dayhoff et al. 68,72,78) An APM in a protein is a replacement of one AA by another accepted by evolution We want to estimate the probability that given a site with AA A has udergone an APM, the new AA

1 Mutation pressure Let = the mutation rate from A a Let = the mutation rate from a

1 Mutation pressure Let = the mutation rate from A a Let = the mutation rate from a

Drift, mutation, selection, and the evolutionary dispersion of mean phenotypes recombination

Gate 3 Project Status Update Offer Acceptance Area DSO offers TSO Offers Accepted Accepted

Craig Menear Chairman, CEO &amp; President Diane Dayhoff Vice President, Investor Relations

Diane Dayhoff Vice President, Investor Relations Ben Finger Senior Manager, Investor Relations

Heuristic Approaches Mark Voorhies 5/5/2017 Mark Voorhies Heuristic Approaches PAM (Dayhoff)

Using Conditional Mutation to Increase the Efficiency of Mutation Analysis e Just 1 &amp; Gregory

BRCA Founder Mutations Ashkenazi Jewish (Hungarian and Russian): BRCA1 - 185delAG and

Equivalence Modulo States Bo Wang, Yingfei Xiong , Yangqingwei Shi, Lu Zhang, Dan Hao Peking

Mutation Testing in Python Austin Bingham @austin_bingham @sixty_north 1 Sunday, October 4, 15

Mutation models: probabilistic study and parameter estimation Adrien Mazoyer, supervised by

CSE 331 Mutation and immutability slides created by Marty Stepp based on materials by M. Ernst,

2: Old English Sound Laws and Verbs Proto-Germanic *fts , dative *fti , plural *ftiz Front

Milu: A Higher Order Mutation Testing Tool Yue Jia University College London Joint work with

Advances in Mutation Testing Research for C++ Pedro Delgado-Prez TAROT: Intro Talk June 2015

Maximal green sequences of minimal mutation-infinite quivers John Lawson joint with Matthew

Mutation Analysis for Coq Pengyu Nie 1 Ahmet Celik 1 , Karl Palmskog 1 , Marinela Parovic 1 , us

Simultaneous Identification of Multiple Driver Pathways in Cancer Mark D. M. Leiserson, et.al

Mutation Testing How good your tests are 2017 whoami iOS Developer by day compiler

Semantic Mutation Testing John A. Clark, Haitao Dan, Robert M Hierons The 8th CREST Open

Testing Query Execution Engines with Mutations Xinyue Chen 1 , Chenglong Wang 1 , Alvin Cheung 2 1

Comparing Global and Local Mutations on Bit Strings This work benefited from being presented and

Time and chance happeneth to them all: Mutation, selection and recombination Steven N. Evans

Sambuz

Useful Links

Newsletter

Mail Us

Craig Menear Chairman, CEO & President Diane Dayhoff Vice President, Investor Relations

Using Conditional Mutation to Increase the Efficiency of Mutation Analysis e Just 1 & Gregory

2: Old English Sound Laws and Verbs Proto-Germanic fts , dative fti , plural *ftiz Front