SLIDE 8 16
Homework Assignment:
Protein Expression on Silico 1) Reading a Fasta file and reformat it 2) Transcription on Silico 3) Translation
1 ggcgcacata gcgacttggt gggcgcgtcc agtgatgact gggggatccc ggcaagtaac 61 atgactaaaa agaagcggga gaatctgggc gtcgctctag agatcgatgg gttagaggag 121 aagctgtccc agtgtcggag agacctggag gccgtgaact ccagactcca cagccgggag 181 ctgagcccag aggccaggag gtccctggag aaggagaaaa acagcctaat gaacaaagcc 241 tccaactacg agaaggaact gaagtttctt cggcaagaga accggaagaa catgctgctc 301 tctgtggcca tctttatcct cctgacgctc gtctatgcct actggaccat gtgagcctgg 361 cacttcccca caaccagcac aggcttccac ttggcccctt ggtcaggatc aagcaggcac 421 ttcaagcctc aataggacca aggtgctggg gtgttcccct cccaacctag tgttcaagca 481 tggcttcctg gcgcccagcc ttgcctccct ggcctgctgg ggggttccgg gtctccagaa 541 ggacatggtg ctggtccctc ccttagccca agggagaggc aataaagaac acaaagctgt 601 tcccgtaaaa aaaaaaaaaa aaaaaaaaaa aaa
17 Can you answer a few questions with your code: 1) Sequence length; 2) Base content; 3) Print the reverse complementary strand; 4) Think about how to translate this gene;
Code Hints:
1) Copy sequence to a file called seq.dat; 2) seq = open("seq.dat") 3) Set up your control flow line = seq.readline() while line: #1)substitute all the digits with null line = re.sub(“\d”,””,line) #2) substitue white space with null line = re.sub(“\w”,””,line) #3) Reverse Complement your DNA line = revcomp(line) DNA = DNA+line line = seq.readline() #Base content print "Adenine: " , newDNA.count("a") print "Thymine: " , newDNA.count("t") print "Guanine: ", newDNA.count("g") print "Cytosine: ", newDNA.count("c")
18
Code Hints:
#Translation standard = { 'ttt': 'F', 'tct': 'S', 'tat': 'Y', 'tgt': 'C', 'ttc': 'F', 'tcc': 'S', 'tac': 'Y', 'tgc': 'C', 'tta': 'L', 'tca': 'S', 'taa': '*' , 'tca': '*', 'ttg': 'L', 'tcg': 'S', 'tag': '*', 'tcg': 'W', 'ctt': 'L', 'cct': 'P', 'cat': 'H', 'cgt': 'R', 'ctc': 'L', 'ccc': 'P', 'cac': 'H', 'cgc': 'R', 'cta': 'L', 'cca': 'P', 'caa': 'Q', 'cga': 'R', 'ctg': 'L', 'ccg': 'P', 'cag': 'Q', 'cgg': 'R', 'att': 'I', 'act': 'T', 'aat': 'N', 'agt': 'S', 'atc': 'I', 'acc': 'T', 'aac': 'N', 'agc': 'S', 'ata': 'I', 'aca': 'T', 'aaa': 'K', 'aga': 'R', 'atg': 'M', 'acg': 'T', 'aag': 'K', 'agg': 'R', 'gtt': 'V', 'gct': 'A', 'gat': 'D', 'ggt': 'G', 'gtc': 'V', 'gcc': 'A', 'gac': 'D', 'ggc': 'G', 'gta': 'V', 'gca': 'A', 'gaa': 'E', 'gga': 'G', 'gtg': 'V', 'gcg': 'A', 'gag': 'E', 'ggg': 'G' } #a dictionary #function definition def dnatoprotein (dna, code): """ translate a DNA sequence to a protein """