Analyzing the Operational RNA Code for Amino Acids - Using R Read / - - PowerPoint PPT Presentation

analyzing the operational rna code for amino acids using r
SMART_READER_LITE
LIVE PREVIEW

Analyzing the Operational RNA Code for Amino Acids - Using R Read / - - PowerPoint PPT Presentation

Analyzing the Operational RNA Code for Amino Acids - Using R Read / Contact me: Read / Contact me: R-statistics.com Tal Galili [1] , Shaul Shaul [2] , Dror Berel [3] , Yoav Benjamini [4] [1] Department of Statistics and Operations Research, Tel


slide-1
SLIDE 1

Analyzing the Operational RNA Code for Amino Acids - Using R

Read / Contact me:

Tal Galili[1], Shaul Shaul[2], Dror Berel [3], Yoav Benjamini [4]

[1] Department of Statistics and Operations Research, Tel Aviv University, Israel. [2] Department of Zoology, Tel Aviv University, Israel. [3] Cedars-Sinai Medical Center. [4] Department of Statistics and Operations Research, Tel Aviv University, Israel.

Read / Contact me: R-statistics.com

slide-2
SLIDE 2

Story

slide-3
SLIDE 3
slide-4
SLIDE 4
slide-5
SLIDE 5

ATP

slide-6
SLIDE 6
slide-7
SLIDE 7
slide-8
SLIDE 8
slide-9
SLIDE 9
slide-10
SLIDE 10

Data

slide-11
SLIDE 11

Data source

slide-12
SLIDE 12
slide-13
SLIDE 13
slide-14
SLIDE 14

55 Archaea tRNA sequences

slide-15
SLIDE 15

55 Archaea tRNA sequences

slide-16
SLIDE 16

Visualize

slide-17
SLIDE 17

2342 2342 Data rows

slide-18
SLIDE 18

Mosaic plot: nucleotide distribution in tRNA acceptor stem

slide-19
SLIDE 19

Sequence logo plot

(Schneider and Stephens - 1990)

slide-20
SLIDE 20

Title ? Colors? X labels? Colors?

slide-21
SLIDE 21
  • > “grid“ graphics….
slide-22
SLIDE 22

Title ? Colors? X labels? Colors?

slide-23
SLIDE 23
slide-24
SLIDE 24
slide-25
SLIDE 25

Codon distribution, per location, per Amino Acid

slide-26
SLIDE 26

Analyze

slide-27
SLIDE 27
slide-28
SLIDE 28

CART

(Classification and regression trees)

  • Goal predict target variable
  • Method: recursively partition

explanatory variables

slide-29
SLIDE 29

CART

(Classification and regression trees)

  • !"
  • - “The Blind Men and the Elephant” by John Godfrey Saxe (1816-1887)
slide-30
SLIDE 30

CART

with default values

Library(rpart)

slide-31
SLIDE 31

CART

Predictive success ? (use.n = T)

slide-32
SLIDE 32

CART

Percent of success in prediction

Red = bellow 60% fit

slide-33
SLIDE 33

Cross validation – Relative error

slide-34
SLIDE 34

Cross validation – Misclassification rate

slide-35
SLIDE 35

Bigger tree?

slide-36
SLIDE 36

Bigger tree?

slide-37
SLIDE 37

Cross validation – Misclassification rate

slide-38
SLIDE 38

Relative importance for each letter in the model

(Sum of information gain in the use of each letter in the model)

slide-39
SLIDE 39

Relative importance for each letter in the model

slide-40
SLIDE 40

Results

  • 20% misclassification in prediction
  • Same acceptor stem RNA can code for for different Amino Acids
  • In different Archaeas
  • In the same Archaea

Conclusions Conclusions

  • There is code in the acceptor stem
  • It is not enough
slide-41
SLIDE 41

Read / Contact me:

Thank you !

Read / Contact me: R-statistics.com

Tal Galili[1], Shaul Shaul[2], Yoav Benjamini [3], Dror Berel [4]

[1] Department of Statistics and Operations Research, Tel Aviv University, Israel. [2] Department of Zoology, Tel Aviv University, Israel. [3] Department of Statistics and Operations Research, Tel Aviv University, Israel. [4] Cedars-Sinai Medical Center.