Protein Structures Sequences of amino acid residues 20 - - PowerPoint PPT Presentation

protein structures
SMART_READER_LITE
LIVE PREVIEW

Protein Structures Sequences of amino acid residues 20 - - PowerPoint PPT Presentation

Protein Structures Sequences of amino acid residues 20 different amino acids Primary Secondary Tertiary Quaternary Primary Secondary Tertiary Quaternary 1/25/05 CAP5510/CGS5166 (Lec 5) 1 Amino Acid Types Hydrophobic


slide-1
SLIDE 1

1/25/05 CAP5510/CGS5166 (Lec 5) 1

Protein Structures

  • Sequences of amino acid residues
  • 20 different amino acids

Primary Primary Quaternary Quaternary Tertiary Tertiary Secondary Secondary

slide-2
SLIDE 2

1/25/05 CAP5510/CGS5166 (Lec 5) 2

Amino Acid Types

  • Hydrophobic I,L,M,V,A,F,P
  • Charged

– Basic K,H,R – Acidic E,D

  • Polar

S,T,Y,H,C,N,Q,W

  • Small A,S,T
  • Very Small

A,G

  • Aromatic

F,Y,W

slide-3
SLIDE 3

1/25/05 CAP5510/CGS5166 (Lec 5) 3

All 3 figures are cartoons of an amino acid residue.

slide-4
SLIDE 4

1/25/05 CAP5510/CGS5166 (Lec 5) 4

Angles φ and ψ in the polypeptide chain

slide-5
SLIDE 5

1/25/05 CAP5510/CGS5166 (Lec 5) 5

Peptide bonds in chains of residues

slide-6
SLIDE 6

1/25/05 CAP5510/CGS5166 (Lec 5) 6

slide-7
SLIDE 7

1/25/05 CAP5510/CGS5166 (Lec 5) 7

Proteins

  • Primary structure is the sequence of

amino acid residues of the protein, e.g., Flavodoxin:

AKIGLFYGTQTGVTQTIAESIQQEFGGESIVDLNDIANADA…

  • Different regions of the sequence form

local regular secondary structures, such as

– Alpha helix, beta strands, etc.

AKIGLFYGTQTGVTQTIAESIQQEFGGESIVDLNDIANADA… Secondary Secondary

slide-8
SLIDE 8

1/25/05 CAP5510/CGS5166 (Lec 5) 8

slide-9
SLIDE 9

1/25/05 CAP5510/CGS5166 (Lec 5) 9

slide-10
SLIDE 10

1/25/05 CAP5510/CGS5166 (Lec 5) 10

Alpha Helix

slide-11
SLIDE 11

1/25/05 CAP5510/CGS5166 (Lec 5) 11

slide-12
SLIDE 12

1/25/05 CAP5510/CGS5166 (Lec 5) 12

Beta Strand

slide-13
SLIDE 13

1/25/05 CAP5510/CGS5166 (Lec 5) 13

Proteins

  • Tertiary structures are formed by packing

secondary structural elements into a globular structure.

Lambda Cro Myoglobin

slide-14
SLIDE 14

1/25/05 CAP5510/CGS5166 (Lec 5) 14

Quaternary Structures in Proteins

Quaternary Quaternary

  • The final structure may contain more

than one “chain” arranged in a quaternary structure.

Insulin Hexamer

slide-15
SLIDE 15

1/25/05 CAP5510/CGS5166 (Lec 5) 15

More on Secondary Structures

  • α-helix

– Main chain with peptide bonds – Side chains project outward from helix – Stability provided by H-bonds between CO and NH groups of residues 4 locations away.

  • β-strand

– Stability provided by H-bonds with one or more β-strands, forming β-sheets. Needs a β-turn.

slide-16
SLIDE 16

1/25/05 CAP5510/CGS5166 (Lec 5) 16

Secondary Structure Prediction Software

slide-17
SLIDE 17

1/25/05 CAP5510/CGS5166 (Lec 5) 17

PDB: Protein Data Bank

  • Database of protein tertiary and quaternary structures and

protein complexes. http://www.rcsb.org/pdb/

  • Over 29,000 structures as of Feb 1, 2005.
  • Structures determined by

– NMR Spectroscopy – X-ray crystallography – Computational prediction methods

  • Sample PDB file: Click here [▪]
slide-18
SLIDE 18

1/25/05 CAP5510/CGS5166 (Lec 5) 18

Active Sites

Active sites in proteins are usually hydrophobic pockets/crevices/troughs that involve sidechain atoms.

slide-19
SLIDE 19

1/25/05 CAP5510/CGS5166 (Lec 5) 19

Active Sites

Left PDB 3RTD (streptavidin) and the first site located by the MOE Site Finder. Middle 3RTD with complexed ligand (biotin). Right Biotin ligand overlaid with calculated alpha spheres of the first site.

slide-20
SLIDE 20

1/25/05 CAP5510/CGS5166 (Lec 5) 20

Simple Models

  • Helps to model simple sequence features.
  • sets of sequences e.g. [AT] C [GC] TC [AGC]
  • single sequences e.g. TTGACA or TATATT [??]
  • sets of sequences with inserts e.g. GCA [AT] [AT]* AG
  • & deletes too, e.g. TATA [G –] T

STATE 1 END START STATE 2 STATE 3 STATE 4 STATE 5 STATE 6

  • long sequences with a sequence of domains H-B-T-B-H
slide-21
SLIDE 21

1/25/05 CAP5510/CGS5166 (Lec 5) 21

Profile Method

slide-22
SLIDE 22

1/25/05 CAP5510/CGS5166 (Lec 5) 22

Profile Method

slide-23
SLIDE 23

1/25/05 CAP5510/CGS5166 (Lec 5) 23

Profile HMMs

STATE 1 END START STATE 2 STATE 3 STATE 4 STATE 5 STATE 6

slide-24
SLIDE 24

1/25/05 CAP5510/CGS5166 (Lec 5) 24

Hidden Markov Model (HMM)

  • States
  • Transitions
  • Transition Probabilities
  • Emissions
  • Emission Probabilities
  • What is hidden about HMMs?

Answer: The path through the model is hidden since there are many valid paths.

slide-25
SLIDE 25

CpG Island + in an ocean of – First order Markov Model

MM=16, HMM= 64 transition probabilities (adjacent bp)

1/25/05 CAP5510/CGS5166 (Lec 5) 25

C+ A+ G+ T+

P(G+|C+) P(A+|A+)

C- A- G- T-

P(C-|A+)

slide-26
SLIDE 26

1/25/05 CAP5510/CGS5166 (Lec 5) 26

How to Solve Problem 2?

  • Solve the following problem:

Input: Hidden Markov Model M, parameters Θ, emitted sequence S Output: Most Probable Path Π How: Viterbi’s Algorithm (Dynamic Programming)

Define Π[i,j] = MPP for first j characters of S ending in state i Define P[i,j] = Probability of Π[i,j] – Compute state i with largest P[i,j].

slide-27
SLIDE 27

1/25/05 CAP5510/CGS5166 (Lec 5) 27

Profile HMMs with InDels

  • Insertions
  • Deletions
  • Insertions & Deletions

STATE 1 END START STATE 2 STATE 3 STATE 4 STATE 5 STATE 6 INSERT 4 DELETE 2 DELETE 3 DELETE 1 INSERT 3 INSERT 4

slide-28
SLIDE 28

1/25/05 CAP5510/CGS5166 (Lec 5) 28

Profile HMMs with InDels

STATE 1 END START STATE 2 STATE 3 STATE 4 STATE 5 STATE 6 INSERT 4 DELETE 2 DELETE 3 DELETE 1 INSERT 3 DELETE 4 DELETE 5 DELETE 6 INSERT 4 INSERT 4 INSERT 4 INSERT 4

Missing transitions from DELETE j to INSERT j and from INSERT j to DELETE j+1.

slide-29
SLIDE 29

1/25/05 CAP5510/CGS5166 (Lec 5) 29

How to model Pairwise Sequence Alignment

LEAPVE LAPVIE MATCH END START DELETE INSERT

Pair HMMs

  • Emit pairs of synbols
  • Emission probs?
  • Related to Sub. Matrices
  • How to deal with InDels?
  • Global Alignment? Local?
  • Related to Sub. Matrices
slide-30
SLIDE 30

1/25/05 CAP5510/CGS5166 (Lec 5) 30

How to model Pairwise Local Alignments?

Skip Module Align Module Skip Module

START END

How to model Pairwise Local Alignments with gaps?

Skip Module Align Module Skip Module

START END