Contents PRO-Decoder Function Methods Results Abstract - - PowerPoint PPT Presentation
Contents PRO-Decoder Function Methods Results Abstract - - PowerPoint PPT Presentation
Contents PRO-Decoder Function Methods Results Abstract Experiment Computer RBS-Decoder TER-Decoder Transcriptional level Translational level Transcriptional level PRO-Decoder Synoproteinerc Transcriptional level Translational level
Contents
Experiment Computer
PRO-Decoder
Abstract Function Methods Results
Translational level Transcriptional level Transcriptional level
PRO-Decoder RBS-Decoder TER-Decoder Synoproteinerc
Translational level Transcriptional level Transcriptional level
PRO-Decoder
Transcription: the binding of RNA polymeraze to promoter
RNA polymerase
PRO-Decoder
Abstract Function Methods Results
Transcription: the binding of RNA polymeraze to promoter
RNA polymerase
Sigma factor
PRO-Decoder
Abstract Function Methods Results
Transcription: the binding of RNA polymerase to promoter
PRO-Decoder
Abstract Function Methods Results
Transcription: the binding of RNA polymeraze to promoter
Consensus sequence Sigma factor binding site
PRO-Decoder
Abstract Function Methods Results
Transcription: various sigma factors Promoter strength Similarity
PRO-Decoder
Abstract Function Methods Results
Type Consensus Spacer Consensus Sigma 70 TTGACA 15-20 TATAAT Sigma 54 TGGCAC 5 TTGCW Sigma S / / CTATACT Sigma 32 CTTGAAA 11-16 CCCATNT Sigma 28 TAAA 15 GCCGATAA Sigma 24 GAACTT 16-17 TCTRA
Promoter type
PRO-Decoder
Abstract Function Methods Results
Other Transcription Factors (TF)
Trancription Factor Binding Site Consensus
similarity
TF
PRO-Decoder
Abstract Function Methods Results
Consensus Consensus
imilarity S Matrix Score
PRO-Decoder
Abstract Function Methods Results CTGACG TTGACA N17 TATAAT
TFBS Sigma factor biding sites TSS
Matrix Similarity Score
Position Weight Matrix ( PWM )
Kel, A. E.; Gößling, E.; Reuter, I.; Cheremushkin, E.; Kel-Margoulis, O. V.; Wingender, E., MATCHTM: a tool for searching transcription factor binding sites in DNA sequences. Nucleic acids research 2003, 31 (13), 3576-3579
Min Max Min Current mss
L i bi i
f i I Current
1 ,
) (
L i f f i I
C G T A B B i B i
,..., 2 , 1 , 4 ln ) (
, , , , ,
bi if , Derived from RegulonDB Posi- tion
1 2 3 4 5 … 13 A 0.53 0.63 0.31 0.56 0.31 … 0.63 T 0.09 0.07 0.40 0.09 0.19 … 0.21 G 0.07 0.08 0.11 0.12 0.07 … 0.02 C 0.31 0.22 0.18 0.23 0.43 … 0.14 I(i) 0.28 0.37 0.11 0.25 0.16 … 0.41 Position Frequency Matrix of Ada
A , 3
f
PRO-Decoder
Abstract Function Methods Results
L i if i I Max
1 max) ( :
L i i
f i I Min
1 min
) ( :
Recognition and location of sigma factor binding sites
Similarity score=MSS(1)+MSS(2)
MSS(1) MSS(2)
- 35 region
- 10 region
spacer
Promoter sequence
PRO-Decoder
Abstract Function Methods Results
spacer
MSS(1) MSS(2)
- 35 region
- 10 region
Promoter sequence
Relative Strength =
Prediction of promoter strength
Similarity score Spacer score
PRO-Decoder
Abstract Function Methods Results
+ Similarity score Spacer score
Other possible TF
44 12 BaeR CpxR 23
CGGATCCTAC CTGACGCTT
AraC
TTCTCCATA ATTGGCGC GTAAAGAT GGGTAAA
0.96 0.92 0.85
PRO-Decoder
Abstract Function Methods Results
95%
Sigma factor type:
TFBS Location (sigma 70) Accuracy 64%
Type Sigma 70 Sigma 28 Sigma 24 Sigma 54 Sigma 38 Sigma 32 Sample size 50 10 10 10 10 10 Average accuracy
92%
56%
PRO-Decoder
Abstract Function Methods Results
Data verification
Promoter strength prediction Promoter BBa _K206000 GATAGT
- 10 Region
BBa _K1070003 4437.2510 1.5214
PRO-Decoder
Abstract Function Methods Results
Strength Prediction
Experimental Results Prediction strength
Promoter strength prediction
PRO-Decoder
Abstract Function Methods Results
93% Sigma70 Ada AgaR AraC 33 25 56 7 15 65% ATCATCCCGC
GCGCAAGATTG TTGGTTTTTGCGT TTTCGTTTT ATTTTTATCTC TAGCGGATCC TACCTGA0.987 0.956 0.924
PRO-Decoder
Abstract Function Methods Results
Position weight matrix Similarity score+ spacer score
Translational level Transcriptional level Transcriptional level
PRO-Decoder TER-Decoder
Translational level Transcriptional level Transcriptional level
PRO-Decoder RBS-Decoder TER-Decoder
Spacer Start codon SD sequence 3' 5' 5' 3'
Ribosome
RBS-Decoder
Abstract Methods
PWM method
RBS strength = MSS(SD)+ spacer score
GeneMark.hmm: new solutions for gene finding, Alexander V. Lukashin and Mark Borodovsky
Table 1. Nucleotide frequencies for the RBS model
Nucleotide Position
1 2 3 4 5 T 0.161 0.050 0.012 0.071 0.115 C 0.077 0.037 0.012 0.025 0.046 A 0.681 0.105 0.105 0.861 0.164 G 0.077 0.808 0.960 0.043 0.659
RBS-Decoder
Abstract Methods
Correlation between the experimental data and prediction
http://parts.igem.org/Ribosome_Binding_Sites/Prokaryotic/Constitutive/Community_Collection.
RBS-Decoder
Abstract Methods
R² = 0.8039 0.8 0.82 0.84 0.86 0.88 0.9 0.92 0.03 0.05 0.07 0.09 0.11 0.13 Predicted Strength 线性 (Predicted Strength)Predicted Strength Experimental Strength
Actual and prediction strength correlationAGGAG 12 ATG 100%
agataagatagcgataga
Position weight matrix Similarity score + spacer score
Translational level Transcriptional level Transcriptional level
PRO-Decoder RBS-Decoder TER-Decoder Synoproteiner
- Analysis
- Prediction
Protein Synonymous
- Analysis
- Prediction
SynoProteiner
UUU UUU UUC UUC UU UUA UUG UUG CU CUU CU CUC CU CUA CU CUG
Ph Phe Leu Leu
Codon usage bias
3 1
Single codon
SynoProteiner
Theory Operation Extra Future
UUU UUU UU UUA UUC UUC UUG UUG UUU UUU UUG UUG
Phe Phe Leu Leu
Codon Pair
GC GCG CCU CU GC GCU ACG Best point
Codon pair Single codon Ideal point
SynoProteiner
Theory Operation Extra Future
NSGA-II Algorithm
g k g sc et t sc sc
k c r k c r g g fit
1 arg
)) ( ( )) ( ( 1 ) (
1 1
)) 1 ( ), ( ( 1 1 ) (
g k cp
k c k c w g g fit
Chung, B.; Lee, D.-Y., Computational codon
- ptimization of synthetic gene for protein expression.
BMC systems biology 2012, 6 (1), 134. China patent, 200780024670.5[P]. 2009-07-22
SynoProteiner
Theory Operation Extra Future
SynoProteiner
Theory Operation Extra Future
SynoProteiner
Theory Operation Extra Future
SynoProteiner
Theory Operation Extra Future
SynoProteiner
Theory Operation Extra Future
Analysis
SynoProteiner
Theory Operation Extra Future
Prediction
1 1 1 1 2 2 2 1 3 3 3 1 11 ( , ) 1 1 ( , ) 2 1 ( , ),( ) 3 1 ( , )
L i i i L i i i L i i i L i i iR R L R R L R R L L R R L
Chou, K. C., Prediction of protein cellular attributes using pseudo‐amino acid composition. Proteins: Structure, Function, and Bioinformatics 2001, 43 (3), 246-255.
SynoProteiner
Theory Operation Extra Future
Prediction
1 1 1 1 2 2 2 1 3 3 3 1 11 ( , ) 1 1 ( , ) 2 1 ( , ),( ) 3 1 ( , )
L i i i L i i i L i i i L i i iR R L R R L R R L L R R L
Chou, K. C., Prediction of protein cellular attributes using pseudo‐amino acid composition. Proteins: Structure, Function, and Bioinformatics 2001, 43 (3), 246-255.
SynoProteiner
Theory Operation Extra Future
1 1 1 1 2 2 2 1 3 3 3 1 11 ( , ) 1 1 ( , ) 2 1 ( , ),( ) 3 1 ( , )
L i i i L i i i L i i i L i i iR R L R R L R R L L R R L
Chou, K. C., Prediction of protein cellular attributes using pseudo‐amino acid composition. Proteins: Structure, Function, and Bioinformatics 2001, 43 (3), 246-255.
- 15
- 10
- 5
- 15
- 10
- 5
ln[kf(Predicted)/s] ln[kf(Experimental)/s]
PREDICTION RESULTS THEORETICAL RESULTSPrediction
Time Accuracy Database
SynoProteiner
Theory Operation Extra Future
Analysis Prediction Optimization
SynoProteiner
Theory Operation Extra Future
Contents
The sequence The process of recording experiment
Optimize
E ' NOTE
Abstract Templates Future Tools
Hard Waste Boring
E ' NOTE
Abstract Templates Future Tools
Web app
E ' NOTE
Abstract Templates Future Tools
Web app Multi-users
E ' NOTE
Abstract Templates Future Tools
Web app Multi-users Templates
E ' NOTE
Abstract Templates Future Tools
Web app Multi-users Templates
E ' NOTE
Abstract Templates Future Tools
Auto-calculate?
User E' NOTE
Yes, I can!
E' NOTE
E ' NOTE
Abstract Templates Future Tools
Auto-filling?
User E' NOTE
Yes, I can!
E' NOTE
E ' NOTE
Abstract Templates Future Tools
Can you connect a plasmid database to it?
User E' NOTE
Yes, I can!
E' NOTE
Abstract Templets Future Tools
E ' NOTE
Abstract Templates Future Tools
Can you organize the plasmid database?
User E' NOTE
Yes, I can!
E' NOTE
E ' NOTE
Abstract Templates Future Tools
Templates may not be enough…
User E' NOTE
It doesn’t matter.
E' NOTE
E ' NOTE
Abstract Templates Future Tools
Templates may not be enough…
User E' NOTE
It doesn’t matter.
E' NOTE
E ' NOTE
Abstract Templates Future Tools
Templates may not be enough…
User E' NOTE
It doesn’t matter.
E' NOTE
E ' NOTE
Abstract Templates Future Tools
EASIER!
Data-output & wiki-build
I am sick of writing experiment to the wiki.
User
E ' NOTE
Abstract Templates Future Tools
E' NOTE
Don’t worry
XMU-China has used it.
I am sick of writing experiment to the wiki.
User
E ' NOTE
Abstract Templates Future Tools
E' NOTE
Don’t worry
XMU-China has used it.
I am sick of writing experiment to the wiki.
User
E ' NOTE
Abstract Templates Future Tools
E' NOTE
Don’t worry
E ' NOTE
Abstract Templates Future Tools
E ' NOTE
Abstract Templates Future Tools
From Internet:
E ' NOTE
Abstract Templates Future Tools
From Wellesley:
E ' NOTE
Abstract Templates Future Tools
E ' NOTE
Abstract Templates Future Tools
Image-upload Web-app Multi-user E-mail sender Templates Plasmid database Record Picture Text Table Wiki Experiment record XML Data output Tools PHP HTML5 Javascript CSS3
E ' NOTE, designed for iGEMers.
E ' NOTE
Abstract Templates Future Tools
Contents
Human Practice
Lecture Party Communication
Human Practice
Lecture Party Communication
Safety prombles on information The warm scene of lecture
Human Practice
Lecture Party Communication
He Help lp th them em st stud udy Co Conv nvenient enient Br Brea eak k th the e bar arric ricad ades es No No id idea ea Yes es No No
Interest In E'NOTE Reasons
Human Practice
Lecture Party Communication
Lantern riddles Finding differences in DNA Foldit Protein Game Synthetic-biology-kill
Human Practice
Lecture Party Communication
Xiamen and Peiking iGEMers iGEMers from Xiamen University and Nanjing University
Human Practice
Lecture Party Communication
Review
PRO-Decoder
Predict the strength of promoter and RBS
Best
Review
Synoproteiner
Analysis and opitimize the protein
Best
Review
E‘ NOTE
Best
Record and simplify experiments
Analysis and opitimize the protein
Synoproteiner
Review
Predict the strength of promoter and RBS
PRO- Decoder
E‘ NOTE
Record and simplify experiments
- Prof. Zhiliang Ji
- Prof. Shoufa Han
Hongchun Li Tina Zhang Team XMU China Qiang Kou Wenjun Rao
Acknowledge
China patent, 200780024670.5[P]. 2009-07-22
Appendix
Fitness:
))) , (( )), , (( max( )) , (( )) , (( )) , ((
exp exp j i combi j i high
- bs
j i high
- bs
j i combi j i
c c n c c n c c n c c n c c w
) ( ) ( exp
)) , (( ) ( ) ( )) , ((
j l i k
c syn c c syn c j i high
- bs
j all sc i all sc j i combi
c c n c r c r c c n
) ( ) ( 2 ) (
arg k all sc k high sc k et t sc
c r c r c r
Prediction:
1 1 1 1 2 2 2 1 3 3 3 1 11 ( , ) 1 1 ( , ) 2 1 ( , ),( ) 3 1 ( , )
L i i i L i i i L i i i L i i iR R L R R L R R L L R R L
- 1. Chou, K. C., Prediction of protein cellular attributes using pseudo‐amino acid composition. Proteins: Structure, Function, and Bioinformatics 2001, 43 (3), 246-255.
- 2. Galzitskaya, O. V.; Garbuzynskiy, S. O.; Ivankov, D. N.; Finkelstein, A. V., Chain length is the main determinant of the folding rate for proteins with three‐state folding
- kinetics. Proteins: Structure, Function, and Bioinformatics 2003, 51 (2), 162-166.
i i
R , ) ( ) ( )
j j
R H R H R (
Appendix