SLIDE 1
MHC-Peptide Interaction Studies using Bioinformatics tools
Presented by
Kunde Ramamoorthy Govindarajan
SLIDE 2 Introduction to Major histocompatibility complex
What is MHC MHC gene organization MHC classification Structure, Function and antigen processing. Development of MHC-Peptide interaction Database(MPID)
Objectives Database Design Database Access
Conclusion and Future work
Overview
SLIDE 3
Major Histocompatibility complex
MHC is a genetic complex with multiple loci encoding two major MHC proper (proteins with the potential of presenting peptides to the TCRs.) class I, class II and a non-MHC genes with immune functions (class III). Highly polymorphic cell surface molecules that present peptide ligands to cell of the T-cell compartment of the immune system.
SLIDE 4
MHC genes
Human MHC is a cluster of genes on short arm of chromosome 6. MHC is highly polymorphic. Class I: HLA-A, B and C α- chains. Class II: DP, DQ and DR arranged in pairs encoding α and β chains.
SLIDE 5
MHC gene Organization
Mouse Human
SLIDE 6
MHC Structure
The 3-D structure of MHC class I and class II have been established using X-ray crystallography. Class I and class II MHC molecules have less similarity in protein sequences, but are very similar in function. The physical difference between the two are small and are essential for peptide binding.
SLIDE 7
Class I MHC Structure and Function
Ternary complexes- α- chain, β-2 microglobulin and antigenic peptide. Consists three extra cellular domains (α1, α2 & α3) a transmembrane segment and a cytoplasmic domain. α1 and α2 forms peptide binding groove. Binding groove contains long α-helix and four β-strands.
SLIDE 8
α-chain is highly polymorphic. Cleft is closed at the ends,limiting the size of the peptide. α chain folds to form a cleft with α- helical sides and β- pleated sheet floor to hold 8-10 a.a peptides.
Class I MHC Structure and Function
SLIDE 9
Class I Antigen Processing
Source: http://www.med.sc.edu:85/bowers/mhc.htm
SLIDE 10
Class II MHC Structure and Function
Heterodimer of α and β chains. Consists of four extra cellular domains (α1, α2,β1 & β2 ). α1 and β1 forms peptide binding groove. Binding groove Contains long α-helix and four β-strands. Cleft is open at the ends, allowing longer peptides to bind (13-18 a.a).
SLIDE 11
MHC class II Structure and Function
SLIDE 12
Class II Antigen processing
Source: http://www.med.sc.edu:85/bowers/mhc.htm
SLIDE 13
Importance of MHC
Recognition of a peptide derived from a disease associated protein (e.g) viral or a bacteria, in the presence of a co-stimulatory signal leads to T-cell activation and triggers a T-cell mediated Immune response. Therefore, which peptide fragments binds to MHC molecules for recognition by T-cell is crucial for the development of peptide based vaccines.
SLIDE 14 Current scenario in MHC-Peptide binding prediction
So far MHC-Peptide binding prediction has focused on sequence-based methods. However, the methods are not sensitive. A recent review says “Poor correspondence between predicted and experimental binding
- f peptides to class I MHC molecules”.
(Anderson et al, 2000 Tissue antigens. 55, 519- 31).
SLIDE 15
Current Scenario (Contd…)
Another approach is based on structural information obtained from X-ray crystallography. (Altuvia et al., 1997; Schueler-Furman et al., 2000). Recent review says “structure based methods have not been extensively used for the prediction of CTL- epitopes”(Markus Schirle et al., 2001 Journal of Immunological
Methods 257, 1-16).
Advantages of structural approach Structure is conserved longer time in evolution when compared to sequences. Structure determinants influence the binding of specific amino acid sequences for particular MHC- allele.
SLIDE 16
Existing Databases for MHC
IMGT/HLA - Sequences for all HLA-alleles.
(http://www.ebi.ac.uk/imgt/hla/).
SYFPEITHI - Sequence database for MHC binding Peptides.
(http://syfpeithi.bmiheidelberg.com/)
MHCPEP - Peptide sequences with experimental binding data.
(http://wehih.wehi.edu.au/mhcpep/)
FIMM - Referenced data-peptides,MHC and relevant disease association.
(http://sdmc.krdl.org.sg:8080/fimm/)
SLIDE 17
Project Objectives
To collect and curate the available MHC-Peptide complex structure data from PDB. Calculate properties defining MHC-Peptide complex interaction. Develop a comprehensive database spanning sequence-structure-function realms. Analyze data and quantify MHC-Peptide complex interaction. Develop algorithm for MHC-Peptide binding prediction.
SLIDE 18
Project Objectives
To collect and curate the available MHC-Peptide complex structure data from Protein Data Bank (PDB). Calculate properties defining MHC-Peptide complex interaction. Develop a comprehensive database spanning sequence-structure-function realms Analyze data and quantify MHC-Peptide complex interaction. Develop algorithm for MHC-Peptide binding prediction.
SLIDE 19
Data clustering and Redundancy
Structural Data derived from Protein DataBank(PDB). Data clustered based on MHC class, By Allele and Peptide length.
SLIDE 20
Redundancy (contd…)
Non-Redundancy: Best structure from each group on the basis of highest resolution and completeness of structural information. If the PDB entries contains multiple protein chains, the first complex is stored in MPID. Non-classical MHC-Peptide structure and complexes of non-standard amino acids are not included.
SLIDE 21
Project Objectives
To collect and curate the available MHC-Peptide complex structure data from PDB. Calculate properties defining MHC-Peptide complex interaction. Develop a comprehensive database spanning sequence-structure-function realms Analyze data and quantify MHC-Peptide complex interaction. Develop algorithm for MHC-Peptide binding prediction.
SLIDE 22 Interaction parameters
Interface area between MHC and Peptide
Defined as the change in their solvent accessible surface area (∆ASA) when going from a monomeric to a dimeric MHC-Peptide complexes state. The ∆ASA of the complexes and the individual polypeptides were calculated using the program NACCESS based on Lee & Richard(1971) algorithm.
∆ A S A = A S A o f M H C + A S A o f P e p tid e -A S A o f M H C p c o m p le x 2
SLIDE 23 Parameters(contd…)
Gap Volume
The gap volume between the MHC & peptide in each complex was calculated using the Program SURFNET (Laskowski 1995).
Gap Index
Gap index (Å ) is defined as the ratio of gap volume between the MHC and the peptide(Å3 ) to the interface area(Å2 ) per MHC peptide.
Gap Index = Gap Volume between MHC & Peptide(Å
3)
Interface area (Å
2)
SLIDE 24
Parameters(contd…)
Hydrogen bonds
The number of intermolecular hydrogen bonds between the peptide and the MHC was calculated using HBPLUS(McDonald and Thornton 1994) in which hydrogen bonds are defined according to standard geometric criteria.
SLIDE 25 Ligplot
Schematic diagram MHC-Peptide interactions based
diagram, LIGPLOT are also available in MPID in pdf format.
Schematic diagram of MHC Peptide Interactions
SLIDE 26
Schematic diagram (contd…)
Conserved residues in 12-mer peptides.
Sequence Conservation (by peptide length) Sequence logo is a graphical display of a multiple alignment consisting of color coded stacks of letters representing amino acids at successive positions. The height represents the frequency of the amino acids.
SLIDE 27
Project Objectives
To collect and curate the available MHC-Peptide complex structure data from PDB. Calculate properties defining MHC-Peptide complex interaction. Develop a comprehensive database spanning sequence-structure-function realms. Analyze data and quantify MHC-Peptide complex interaction. Develop algorithm for MHC-Peptide binding prediction.
SLIDE 28
- MPID is a relational database(MySQL v.3.22.21)
developed and hosted on the UNIX platform (IRIX 6.5) running Apache 1.3.12.
- MPID is a semi-automatically derived.
- The overall dimensional model of MPID data depicts
the flow between the different dimensions along with internal links and links to relevant external data sources.
- Currently, there are five dimensions: MHC-Peptide
complexes, MHC, Peptides, Interactions and References.
Database design
SLIDE 29
SLIDE 30
The sequence data derived from PDB is hyperlinked to IMGT/HLA (for MHC) and to the SYFPEITHI (for the peptide) databases. The publication reference for each MPID structure is provided by a link to the PubMed database. The related sequences and structures for the relevant protein chains of each MPID entry can be accessed via the NCBI structure database. Experimental binding strengths along with the corresponding references have been provided, wherever available.
MPID Links
SLIDE 31 Database Access and Features
MPID is accessible via a www interface (http://surya.bic.nus.edu.sg/mpid) Users can customize the fields shown in the
Visualization of the structural information (either the MHC, the peptide or the complex) is possible using freely available graphics applications such as RASMOL or CHIME. The entire dataset including the PDB coordinates for single MHC-Peptide complexes are available from a separate download directory at the MPID
- website. (http://surya.bic.nus.edu.sg/mpid/download).
SLIDE 32
SLIDE 33
SLIDE 34
Query Output for class I (A*0201)
SLIDE 35 Conclusion and future work
The MPID is a comprehensive database for sequence-structure-function information of MHC-peptide complexes. It provides interface to different sources of information on MHC like-SYFPEITHI etc. Quantitative and qualitative information on MHC- Peptide interactions are available in MPID. Users can customize the fields shown in the
SLIDE 36
Future work
The database will be updated quarterly. Electrostatic potential surfaces of the interaction zones on the MHC and the peptides will be determined. Analysis is currently underway.
SLIDE 37
Supervisors
A/P. Shoba Ranganathan. A/P. Tan Tin Wee.
SLIDE 38 Acknowledgements
BIC staffs and my colleagues.