In silico grouping of peptide/HLA class I complexes using structural interaction characteristics (original) (raw)
Journal Article
,
1 1
Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore
8 Medical Drive, Singapore 117597
2 2
Institute for Infocomm Research
21 Heng Mui Keng Terrace, Singapore 119613
Search for other works by this author on:
,
1 1
Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore
8 Medical Drive, Singapore 117597
Search for other works by this author on:
1 1
Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore
8 Medical Drive, Singapore 117597
3 3
Department of Chemistry and Biomolecular Sciences & Biotechnology Research Institute, Macquarie University
NSW 2109, Australia
*To whom correspondence should be addressed.
Search for other works by this author on:
Revision received:
03 November 2006
Accepted:
03 November 2006
Published:
07 November 2006
Cite
Joo Chuan Tong, Tin Wee Tan, Shoba Ranganathan, In silico grouping of peptide/HLA class I complexes using structural interaction characteristics, Bioinformatics, Volume 23, Issue 2, January 2007, Pages 177–183, https://doi.org/10.1093/bioinformatics/btl563
Close
Navbar Search Filter Mobile Enter search term Search
Abstract
Motivation: Classification of human leukocyte antigen (HLA) proteins into supertypes underpins the development of epitope-based vaccines with wide population coverage. Current methods for HLA supertype definition, based on common structural features of HLA proteins and/or their functional binding specificities, leave structural interaction characteristics among different HLA supertypes with antigenic peptides unexplored.
Methods: We describe the use of structural interaction descriptors for the analysis of 68 peptide/HLA class I crystallographic structures. Interaction parameters computed include the number of intermolecular hydrogen bonds between each HLA protein and its corresponding bound peptide, solvent accessibility, gap volume and gap index.
Results: The structural interactions patterns of peptide/HLA class I complexes investigated herein vary among individual alleles and may be grouped in a supertype dependent manner. Using the proposed methodology, eight HLA class I supertypes were defined based on existing experimental crystallographic structures which largely overlaps (77% consensus) with the definitions by binding motifs. This mode of classification, which considers conformational information of both peptide and HLA proteins, provides an alternative to the characterization of supertypes using either peptide or HLA protein information alone.
Contact: shoba@els.mq.edu
1 INTRODUCTION
Major histocompatibility complex (MHC) are cell surface glycoproteins that play a vital role in adaptive immune response (Rammensee et al., 1993). In order to generate maximal immunological protection against a large repertoire of possible pathogens, MHC bind peptides of diverse sequences and present them on the surface of antigen-presenting cells for recognition by T cell receptors (Lefranc and Lefranc, 2001). T cell recognition of the peptide–MHC complex initiates a cascade of immunological events necessary for initiation and regulation of immune responses. Two classes of MHC are responsible for antigen presentation: (1) MHC class I presents endogenous peptides; and MHC class II presents exogenous peptides. The MHC binding clefts contain polymorphic cavities (or ‘pockets’) that fit the side-chains of complementary residues on the binding peptide (Falk et al., 1991a; Hunt et al., 1992). These corresponding peptide residues are termed anchor residues because they ‘anchor’ the peptides firmly at various positions in the MHC binding cleft and contribute to most of the binding interactions. Specific MHC alleles can bind peptides with similar anchor residues, leading to the definition of ‘peptide motif’ for an array of class I and class II alleles (Falk et al., 1991b; Rotzschke et al., 1991). The subsequent discovery that certain MHC alleles can recognize very similar motifs led to the definition of MHC ‘supermotifs’ or ‘supertypes’ (Del Guercio et al., 1995).
The characterization and classification of MHC alleles into supertypes is important for the development of epitope-based vaccine (Sette et al., 2001, 2002; Zhu et al., 2006). By clustering MHC alleles based on their structural features and/or peptide binding specificities, promiscuous T cell epitopes that bind multiple MHC alleles can be identified. Such peptides are key targets for the design of vaccines and immunotherapies because they are applicable to higher proportions of human population. However, experimental determination of binding specificities for even a single MHC allele is an expensive, laborious and time consuming process; and not practical for the study of MHC supertypes which involve large numbers of alleles (Kobayashi et al., 2001; Panigada et al., 2002; Doytchinova and Flower, 2005). In silico, bioinformatics is emerging as an alternative and viable approach for MHC supertype classification. A number of clustering methods for MHC supertype definitions are available, including those based on local sequence similarities in binding pockets (Chelvanayagam, 1996; Zhang et al., 1998; Zhao et al., 2003), global sequence similarities (Cano et al., 1998; McKenzie et al., 1999) and peptide binding motifs (Lund et al., 2004). For the latter, the success of the approach depends on the availability of sufficient binding data. Where data is limited or there is bias in the experimental binding motifs, mixed results have been reported (Tong et al., 2006a). Recently, Doytchinova et al. (2004) and Doytchinova and Flower (2005) employed the use of hierarchical clustering and principal component analysis to classify human MHC or human leukocyte antigen (HLA) alleles according to MHC sequences and structures. The approach successfully identified MHC class I and class II supertype fingerprints and illustrated that only 1–3 amino acid are sufficient for an allele to be classified within a particular supertype. Kangueane et al. (2005) defined critical polymorphic functional residue positions within the binding grooves of HLA-A, -B and -C alleles and grouped 47% of 295 HLA-A alleles, 44% of 540 HLA-B alleles and 35% of 156 HLA-C alleles to 36, 71 and 18 groups, respectively.
In the present study, we applied the use of structural interaction parameters previously reported as significant for peptide/MHC interactions (Kangueane et al., 2001) for in-depth analysis of 68 peptide/HLA crystallographic structures from the MPID-T database (Tong et al., 2006b). Our analysis revealed the striking observation that peptide/HLA structural interaction patterns vary among different alleles and may be grouped in a supertype dependent manner. The results obtained in this study shed new light into HLA supertype definition, suggesting that HLA supertype definitions may not be limited to peptide binding motifs or receptor information, and can be characterized at the intermolecular level, based on the interactions between HLA proteins and their associated peptides, consistent with solutions from X-ray crystallography. Through the use of structural interaction parameters described herein, a novel HLA class I supertype classification schema has been developed for alleles with available crystallographic structures.
2 METHODS
2.1 Data
A total of 68 peptide–HLA complexes spanning 13 classes I alleles from the MPID-T database (Tong et al., 2006b, Author Webpage) were used in the current analysis (Table 1).
Table 1
Computed HLA structural interaction parameters investigated in this study
Class | Allele | PDB ID | Interface area (Å2) | Gap volume (Å3) | Gap index (Å) | No. of H-bonds | Peptide length | Supertype assigned | ||
---|---|---|---|---|---|---|---|---|---|---|
This work | Sette and Sidney (1999) | Doytchinova and Flower (2004) | ||||||||
I | A*0101 | 1DUY | 717.7 | 1116.1 | 1.6 | 7 | 8 | A-I | A1 | A1 |
I | A*0201 | 1AKJ | 857.2 | 746.5 | 0.9 | 13 | 9 | A-II | A2 | A2 |
I | A*0201 | 1AO7 | 883.3 | 1038.3 | 1.2 | 10 | 9 | A-II | A2 | A2 |
I | A*0201 | 1B0G | 860.2 | 441.3 | 0.5 | 12 | 9 | A-II | A2 | A2 |
I | A*0201 | 1B0R | 642.4 | 724.0 | 1.1 | 7 | 9 | A-II | A2 | A2 |
I | A*0201 | 1BD2 | 874.2 | 813.6 | 0.9 | 11 | 9 | A-II | A2 | A2 |
I | A*0201 | 1DUZ | 863.0 | 1069.6 | 1.2 | 11 | 9 | A-II | A2 | A2 |
I | A*0201 | 1EEY | 787.1 | 900.3 | 1.1 | 11 | 9 | A-II | A2 | A2 |
I | A*0201 | 1EEZ | 829.8 | 704.2 | 0.9 | 9 | 9 | A-II | A2 | A2 |
I | A*0201 | 1HHG | 765.8 | 1039.9 | 1.4 | 12 | 9 | A-II | A2 | A2 |
I | A*0201 | 1HHH | 918.4 | 530.4 | 0.6 | 11 | 10 | A-II | A2 | A2 |
I | A*0201 | 1HHI | 842.0 | 455.7 | 0.5 | 9 | 9 | A-II | A2 | A2 |
I | A*0201 | 1HHJ | 847.9 | 827.4 | 1.0 | 14 | 9 | A-II | A2 | A2 |
I | A*0201 | 1HHK | 865.5 | 1083.4 | 1.3 | 10 | 9 | A-II | A2 | A2 |
I | A*0201 | 1I1F | 850.9 | 800.8 | 0.9 | 11 | 9 | A-II | A2 | A2 |
I | A*0201 | 1I1Y | 877.9 | 745.0 | 0.9 | 13 | 9 | A-II | A2 | A2 |
I | A*0201 | 1I4F | 820.1 | 877.1 | 1.1 | 15 | 10 | A-II | A2 | A2 |
I | A*0201 | 1I7R | 902.5 | 805.4 | 0.9 | 11 | 9 | A-II | A2 | A2 |
I | A*0201 | 1I7T | 847.9 | 591.4 | 0.7 | 9 | 9 | A-II | A2 | A2 |
I | A*0201 | 1I7U | 845.2 | 545.8 | 0.7 | 11 | 9 | A-II | A2 | A2 |
I | A*0201 | 1IM3 | 855.0 | 954.9 | 1.1 | 11 | 9 | A-II | A2 | A2 |
I | A*0201 | 1JF1 | 870.4 | 492.5 | 0.6 | 11 | 10 | A-II | A2 | A2 |
I | A*0201 | 1JHT | 781.0 | 588.8 | 0.8 | 12 | 9 | A-II | A2 | A2 |
I | A*0201 | 1LP9 | 855.7 | 549.7 | 0.6 | 12 | 9 | A-II | A2 | A2 |
I | A*0201 | 1OGA | 856.0 | 539.5 | 0.6 | 12 | 9 | A-II | A2 | A2 |
I | A*0201 | 1P7Q | 810.8 | 648.1 | 0.8 | 7 | 9 | A-II | A2 | A2 |
I | A*0201 | 1QR1 | 827.1 | 722.4 | 0.9 | 9 | 9 | A-II | A2 | A2 |
I | A*0201 | 1QRN | 871.5 | 921.6 | 1.1 | 10 | 9 | A-II | A2 | A2 |
I | A*0201 | 1QSE | 873.0 | 1097.7 | 1.3 | 11 | 9 | A-II | A2 | A2 |
I | A*0201 | 1QSF | 828.6 | 959.5 | 1.2 | 10 | 9 | A-II | A2 | A2 |
I | A*0201 | 1S9W | 904.9 | 933.9 | 1.0 | 13 | 9 | A-II | A2 | A2 |
I | A*0201 | 1S9X | 872.3 | 933.1 | 1.1 | 12 | 9 | A-II | A2 | A2 |
I | A*0201 | 1S9Y | 883.7 | 979.0 | 1.1 | 11 | 9 | A-II | A2 | A2 |
I | A*0201 | 1TVB | 829.2 | 866.8 | 1.0 | 12 | 9 | A-II | A2 | A2 |
I | A*0201 | 1TVH | 865.1 | 950.6 | 1.1 | 10 | 9 | A-II | A2 | A2 |
I | A*0201 | 2CLR | 896.5 | 911.4 | 1.0 | 10 | 10 | A-II | A2 | A2 |
I | A*6801 | 1TMC | 918.1 | 926.2 | 1.0 | 14 | 10 | A-III | A3 | A3 |
I | B*0801 | 1AGB | 820.8 | 881.7 | 1.1 | 15 | 8 | B-II | Outlier | B7 |
I | B*0801 | 1AGC | 812.5 | 688.1 | 0.9 | 18 | 8 | B-II | Outlier | B7 |
I | B*0801 | 1AGD | 819.7 | 816.1 | 1.0 | 16 | 8 | B-II | Outlier | B7 |
I | B*0801 | 1AGE | 812.3 | 920.6 | 1.1 | 15 | 8 | B-II | Outlier | B7 |
I | B*0801 | 1AGF | 860.0 | 765.4 | 0.9 | 14 | 8 | B-II | Outlier | B7 |
I | B*0801 | 1M05 | 1000.6 | 897.4 | 0.9 | 18 | 9 | B-II | Outlier | B7 |
I | B*0801 | 1MI5 | 1028.0 | 850.0 | 1.8 | 15 | 9 | B-II | Outlier | B7 |
I | B*3501 | 1QEW | 843.2 | 855.9 | 1.0 | 12 | 9 | B-II | B7 | B7 |
I | B*3501 | 1XH3 | 927.0 | 1198.5 | 1.3 | 13 | 14 | B-II | B7 | B7 |
I | B*3501 | 1A1N | 857.9 | 670.2 | 0.8 | 11 | 8 | B-II | B7 | B7 |
I | B*3501 | 1A9B | 855.4 | 847.4 | 1.0 | 12 | 9 | B-II | B7 | B7 |
I | B*3501 | 1A9E | 882.6 | 779.3 | 0.9 | 12 | 9 | B-II | B7 | B7 |
I | B*1501 | 1XR8 | 860.7 | 968.1 | 1.1 | 16 | 9 | B-V | B62 | B62 |
I | B*1501 | 1XR9 | 883.0 | 414.1 | 0.5 | 18 | 9 | B-V | B62 | B62 |
I | B*2705 | 1HSA | 691.8 | 1148.4 | 1.7 | 14 | 9 | B-I | B27 | B27 |
I | B*2705 | 1OF2 | 1087.7 | 1015.5 | 0.9 | 17 | 9 | B-I | B27 | B27 |
I | B*2705 | 1W0V | 1007.2 | 898.9 | 0.9 | 16 | 9 | B-I | B27 | B27 |
I | B*2705 | 1JGE | 815.4 | 994.4 | 0.9 | 15 | 9 | B-I | B27 | B27 |
I | B*2705 | 1OGT | 1096.0 | 849.3 | 0.8 | 21 | 9 | B-I | B27 | B27 |
I | B*2709 | 1UXW | 1071.9 | 1116.8 | 1.0 | 18 | 9 | B-I | B27 | B27 |
I | B*2709 | 1W0W | 999.9 | 968.1 | 1.0 | 18 | 9 | B-I | B27 | B27 |
I | B*2709 | 1JGD | 979.7 | 994.4 | 1.0 | 23 | 10 | B-I | B27 | B27 |
I | B*2709 | 1K5N | 780.7 | 879.9 | 1.1 | 19 | 9 | B-I | B27 | B27 |
I | B*4402 | 1M6O | 937.3 | 903.2 | 1.0 | 16 | 9 | B-III | B44 | B44 |
I | B*4403 | 1SYS | 941.9 | 1076.0 | 1.1 | 15 | 9 | B-I | B44 | B44 |
I | B*4403 | 1N2R | 901.0 | 949.9 | 1.1 | 15 | 9 | B-I | B44 | B44 |
I | B*4405 | 1SYV | 901.8 | 915.6 | 1.0 | 15 | 9 | B-III | B44 | B44 |
I | B*5101 | 1E27 | 865.2 | 809.0 | 0.9 | 8 | 9 | B-IV | B7 | B44 |
I | B*5101 | 1E28 | 804.9 | 724.0 | 0.9 | 7 | 8 | B-IV | B7 | B44 |
I | B*5301 | 1A1M | 823.9 | 971.3 | 1.2 | 12 | 9 | B-II | B7 | B44 |
I | B*5301 | 1A1O | 965.2 | 778.8 | 0.8 | 10 | 9 | B-II | B7 | B44 |
Class | Allele | PDB ID | Interface area (Å2) | Gap volume (Å3) | Gap index (Å) | No. of H-bonds | Peptide length | Supertype assigned | ||
---|---|---|---|---|---|---|---|---|---|---|
This work | Sette and Sidney (1999) | Doytchinova and Flower (2004) | ||||||||
I | A*0101 | 1DUY | 717.7 | 1116.1 | 1.6 | 7 | 8 | A-I | A1 | A1 |
I | A*0201 | 1AKJ | 857.2 | 746.5 | 0.9 | 13 | 9 | A-II | A2 | A2 |
I | A*0201 | 1AO7 | 883.3 | 1038.3 | 1.2 | 10 | 9 | A-II | A2 | A2 |
I | A*0201 | 1B0G | 860.2 | 441.3 | 0.5 | 12 | 9 | A-II | A2 | A2 |
I | A*0201 | 1B0R | 642.4 | 724.0 | 1.1 | 7 | 9 | A-II | A2 | A2 |
I | A*0201 | 1BD2 | 874.2 | 813.6 | 0.9 | 11 | 9 | A-II | A2 | A2 |
I | A*0201 | 1DUZ | 863.0 | 1069.6 | 1.2 | 11 | 9 | A-II | A2 | A2 |
I | A*0201 | 1EEY | 787.1 | 900.3 | 1.1 | 11 | 9 | A-II | A2 | A2 |
I | A*0201 | 1EEZ | 829.8 | 704.2 | 0.9 | 9 | 9 | A-II | A2 | A2 |
I | A*0201 | 1HHG | 765.8 | 1039.9 | 1.4 | 12 | 9 | A-II | A2 | A2 |
I | A*0201 | 1HHH | 918.4 | 530.4 | 0.6 | 11 | 10 | A-II | A2 | A2 |
I | A*0201 | 1HHI | 842.0 | 455.7 | 0.5 | 9 | 9 | A-II | A2 | A2 |
I | A*0201 | 1HHJ | 847.9 | 827.4 | 1.0 | 14 | 9 | A-II | A2 | A2 |
I | A*0201 | 1HHK | 865.5 | 1083.4 | 1.3 | 10 | 9 | A-II | A2 | A2 |
I | A*0201 | 1I1F | 850.9 | 800.8 | 0.9 | 11 | 9 | A-II | A2 | A2 |
I | A*0201 | 1I1Y | 877.9 | 745.0 | 0.9 | 13 | 9 | A-II | A2 | A2 |
I | A*0201 | 1I4F | 820.1 | 877.1 | 1.1 | 15 | 10 | A-II | A2 | A2 |
I | A*0201 | 1I7R | 902.5 | 805.4 | 0.9 | 11 | 9 | A-II | A2 | A2 |
I | A*0201 | 1I7T | 847.9 | 591.4 | 0.7 | 9 | 9 | A-II | A2 | A2 |
I | A*0201 | 1I7U | 845.2 | 545.8 | 0.7 | 11 | 9 | A-II | A2 | A2 |
I | A*0201 | 1IM3 | 855.0 | 954.9 | 1.1 | 11 | 9 | A-II | A2 | A2 |
I | A*0201 | 1JF1 | 870.4 | 492.5 | 0.6 | 11 | 10 | A-II | A2 | A2 |
I | A*0201 | 1JHT | 781.0 | 588.8 | 0.8 | 12 | 9 | A-II | A2 | A2 |
I | A*0201 | 1LP9 | 855.7 | 549.7 | 0.6 | 12 | 9 | A-II | A2 | A2 |
I | A*0201 | 1OGA | 856.0 | 539.5 | 0.6 | 12 | 9 | A-II | A2 | A2 |
I | A*0201 | 1P7Q | 810.8 | 648.1 | 0.8 | 7 | 9 | A-II | A2 | A2 |
I | A*0201 | 1QR1 | 827.1 | 722.4 | 0.9 | 9 | 9 | A-II | A2 | A2 |
I | A*0201 | 1QRN | 871.5 | 921.6 | 1.1 | 10 | 9 | A-II | A2 | A2 |
I | A*0201 | 1QSE | 873.0 | 1097.7 | 1.3 | 11 | 9 | A-II | A2 | A2 |
I | A*0201 | 1QSF | 828.6 | 959.5 | 1.2 | 10 | 9 | A-II | A2 | A2 |
I | A*0201 | 1S9W | 904.9 | 933.9 | 1.0 | 13 | 9 | A-II | A2 | A2 |
I | A*0201 | 1S9X | 872.3 | 933.1 | 1.1 | 12 | 9 | A-II | A2 | A2 |
I | A*0201 | 1S9Y | 883.7 | 979.0 | 1.1 | 11 | 9 | A-II | A2 | A2 |
I | A*0201 | 1TVB | 829.2 | 866.8 | 1.0 | 12 | 9 | A-II | A2 | A2 |
I | A*0201 | 1TVH | 865.1 | 950.6 | 1.1 | 10 | 9 | A-II | A2 | A2 |
I | A*0201 | 2CLR | 896.5 | 911.4 | 1.0 | 10 | 10 | A-II | A2 | A2 |
I | A*6801 | 1TMC | 918.1 | 926.2 | 1.0 | 14 | 10 | A-III | A3 | A3 |
I | B*0801 | 1AGB | 820.8 | 881.7 | 1.1 | 15 | 8 | B-II | Outlier | B7 |
I | B*0801 | 1AGC | 812.5 | 688.1 | 0.9 | 18 | 8 | B-II | Outlier | B7 |
I | B*0801 | 1AGD | 819.7 | 816.1 | 1.0 | 16 | 8 | B-II | Outlier | B7 |
I | B*0801 | 1AGE | 812.3 | 920.6 | 1.1 | 15 | 8 | B-II | Outlier | B7 |
I | B*0801 | 1AGF | 860.0 | 765.4 | 0.9 | 14 | 8 | B-II | Outlier | B7 |
I | B*0801 | 1M05 | 1000.6 | 897.4 | 0.9 | 18 | 9 | B-II | Outlier | B7 |
I | B*0801 | 1MI5 | 1028.0 | 850.0 | 1.8 | 15 | 9 | B-II | Outlier | B7 |
I | B*3501 | 1QEW | 843.2 | 855.9 | 1.0 | 12 | 9 | B-II | B7 | B7 |
I | B*3501 | 1XH3 | 927.0 | 1198.5 | 1.3 | 13 | 14 | B-II | B7 | B7 |
I | B*3501 | 1A1N | 857.9 | 670.2 | 0.8 | 11 | 8 | B-II | B7 | B7 |
I | B*3501 | 1A9B | 855.4 | 847.4 | 1.0 | 12 | 9 | B-II | B7 | B7 |
I | B*3501 | 1A9E | 882.6 | 779.3 | 0.9 | 12 | 9 | B-II | B7 | B7 |
I | B*1501 | 1XR8 | 860.7 | 968.1 | 1.1 | 16 | 9 | B-V | B62 | B62 |
I | B*1501 | 1XR9 | 883.0 | 414.1 | 0.5 | 18 | 9 | B-V | B62 | B62 |
I | B*2705 | 1HSA | 691.8 | 1148.4 | 1.7 | 14 | 9 | B-I | B27 | B27 |
I | B*2705 | 1OF2 | 1087.7 | 1015.5 | 0.9 | 17 | 9 | B-I | B27 | B27 |
I | B*2705 | 1W0V | 1007.2 | 898.9 | 0.9 | 16 | 9 | B-I | B27 | B27 |
I | B*2705 | 1JGE | 815.4 | 994.4 | 0.9 | 15 | 9 | B-I | B27 | B27 |
I | B*2705 | 1OGT | 1096.0 | 849.3 | 0.8 | 21 | 9 | B-I | B27 | B27 |
I | B*2709 | 1UXW | 1071.9 | 1116.8 | 1.0 | 18 | 9 | B-I | B27 | B27 |
I | B*2709 | 1W0W | 999.9 | 968.1 | 1.0 | 18 | 9 | B-I | B27 | B27 |
I | B*2709 | 1JGD | 979.7 | 994.4 | 1.0 | 23 | 10 | B-I | B27 | B27 |
I | B*2709 | 1K5N | 780.7 | 879.9 | 1.1 | 19 | 9 | B-I | B27 | B27 |
I | B*4402 | 1M6O | 937.3 | 903.2 | 1.0 | 16 | 9 | B-III | B44 | B44 |
I | B*4403 | 1SYS | 941.9 | 1076.0 | 1.1 | 15 | 9 | B-I | B44 | B44 |
I | B*4403 | 1N2R | 901.0 | 949.9 | 1.1 | 15 | 9 | B-I | B44 | B44 |
I | B*4405 | 1SYV | 901.8 | 915.6 | 1.0 | 15 | 9 | B-III | B44 | B44 |
I | B*5101 | 1E27 | 865.2 | 809.0 | 0.9 | 8 | 9 | B-IV | B7 | B44 |
I | B*5101 | 1E28 | 804.9 | 724.0 | 0.9 | 7 | 8 | B-IV | B7 | B44 |
I | B*5301 | 1A1M | 823.9 | 971.3 | 1.2 | 12 | 9 | B-II | B7 | B44 |
I | B*5301 | 1A1O | 965.2 | 778.8 | 0.8 | 10 | 9 | B-II | B7 | B44 |
Table 1
Computed HLA structural interaction parameters investigated in this study
Class | Allele | PDB ID | Interface area (Å2) | Gap volume (Å3) | Gap index (Å) | No. of H-bonds | Peptide length | Supertype assigned | ||
---|---|---|---|---|---|---|---|---|---|---|
This work | Sette and Sidney (1999) | Doytchinova and Flower (2004) | ||||||||
I | A*0101 | 1DUY | 717.7 | 1116.1 | 1.6 | 7 | 8 | A-I | A1 | A1 |
I | A*0201 | 1AKJ | 857.2 | 746.5 | 0.9 | 13 | 9 | A-II | A2 | A2 |
I | A*0201 | 1AO7 | 883.3 | 1038.3 | 1.2 | 10 | 9 | A-II | A2 | A2 |
I | A*0201 | 1B0G | 860.2 | 441.3 | 0.5 | 12 | 9 | A-II | A2 | A2 |
I | A*0201 | 1B0R | 642.4 | 724.0 | 1.1 | 7 | 9 | A-II | A2 | A2 |
I | A*0201 | 1BD2 | 874.2 | 813.6 | 0.9 | 11 | 9 | A-II | A2 | A2 |
I | A*0201 | 1DUZ | 863.0 | 1069.6 | 1.2 | 11 | 9 | A-II | A2 | A2 |
I | A*0201 | 1EEY | 787.1 | 900.3 | 1.1 | 11 | 9 | A-II | A2 | A2 |
I | A*0201 | 1EEZ | 829.8 | 704.2 | 0.9 | 9 | 9 | A-II | A2 | A2 |
I | A*0201 | 1HHG | 765.8 | 1039.9 | 1.4 | 12 | 9 | A-II | A2 | A2 |
I | A*0201 | 1HHH | 918.4 | 530.4 | 0.6 | 11 | 10 | A-II | A2 | A2 |
I | A*0201 | 1HHI | 842.0 | 455.7 | 0.5 | 9 | 9 | A-II | A2 | A2 |
I | A*0201 | 1HHJ | 847.9 | 827.4 | 1.0 | 14 | 9 | A-II | A2 | A2 |
I | A*0201 | 1HHK | 865.5 | 1083.4 | 1.3 | 10 | 9 | A-II | A2 | A2 |
I | A*0201 | 1I1F | 850.9 | 800.8 | 0.9 | 11 | 9 | A-II | A2 | A2 |
I | A*0201 | 1I1Y | 877.9 | 745.0 | 0.9 | 13 | 9 | A-II | A2 | A2 |
I | A*0201 | 1I4F | 820.1 | 877.1 | 1.1 | 15 | 10 | A-II | A2 | A2 |
I | A*0201 | 1I7R | 902.5 | 805.4 | 0.9 | 11 | 9 | A-II | A2 | A2 |
I | A*0201 | 1I7T | 847.9 | 591.4 | 0.7 | 9 | 9 | A-II | A2 | A2 |
I | A*0201 | 1I7U | 845.2 | 545.8 | 0.7 | 11 | 9 | A-II | A2 | A2 |
I | A*0201 | 1IM3 | 855.0 | 954.9 | 1.1 | 11 | 9 | A-II | A2 | A2 |
I | A*0201 | 1JF1 | 870.4 | 492.5 | 0.6 | 11 | 10 | A-II | A2 | A2 |
I | A*0201 | 1JHT | 781.0 | 588.8 | 0.8 | 12 | 9 | A-II | A2 | A2 |
I | A*0201 | 1LP9 | 855.7 | 549.7 | 0.6 | 12 | 9 | A-II | A2 | A2 |
I | A*0201 | 1OGA | 856.0 | 539.5 | 0.6 | 12 | 9 | A-II | A2 | A2 |
I | A*0201 | 1P7Q | 810.8 | 648.1 | 0.8 | 7 | 9 | A-II | A2 | A2 |
I | A*0201 | 1QR1 | 827.1 | 722.4 | 0.9 | 9 | 9 | A-II | A2 | A2 |
I | A*0201 | 1QRN | 871.5 | 921.6 | 1.1 | 10 | 9 | A-II | A2 | A2 |
I | A*0201 | 1QSE | 873.0 | 1097.7 | 1.3 | 11 | 9 | A-II | A2 | A2 |
I | A*0201 | 1QSF | 828.6 | 959.5 | 1.2 | 10 | 9 | A-II | A2 | A2 |
I | A*0201 | 1S9W | 904.9 | 933.9 | 1.0 | 13 | 9 | A-II | A2 | A2 |
I | A*0201 | 1S9X | 872.3 | 933.1 | 1.1 | 12 | 9 | A-II | A2 | A2 |
I | A*0201 | 1S9Y | 883.7 | 979.0 | 1.1 | 11 | 9 | A-II | A2 | A2 |
I | A*0201 | 1TVB | 829.2 | 866.8 | 1.0 | 12 | 9 | A-II | A2 | A2 |
I | A*0201 | 1TVH | 865.1 | 950.6 | 1.1 | 10 | 9 | A-II | A2 | A2 |
I | A*0201 | 2CLR | 896.5 | 911.4 | 1.0 | 10 | 10 | A-II | A2 | A2 |
I | A*6801 | 1TMC | 918.1 | 926.2 | 1.0 | 14 | 10 | A-III | A3 | A3 |
I | B*0801 | 1AGB | 820.8 | 881.7 | 1.1 | 15 | 8 | B-II | Outlier | B7 |
I | B*0801 | 1AGC | 812.5 | 688.1 | 0.9 | 18 | 8 | B-II | Outlier | B7 |
I | B*0801 | 1AGD | 819.7 | 816.1 | 1.0 | 16 | 8 | B-II | Outlier | B7 |
I | B*0801 | 1AGE | 812.3 | 920.6 | 1.1 | 15 | 8 | B-II | Outlier | B7 |
I | B*0801 | 1AGF | 860.0 | 765.4 | 0.9 | 14 | 8 | B-II | Outlier | B7 |
I | B*0801 | 1M05 | 1000.6 | 897.4 | 0.9 | 18 | 9 | B-II | Outlier | B7 |
I | B*0801 | 1MI5 | 1028.0 | 850.0 | 1.8 | 15 | 9 | B-II | Outlier | B7 |
I | B*3501 | 1QEW | 843.2 | 855.9 | 1.0 | 12 | 9 | B-II | B7 | B7 |
I | B*3501 | 1XH3 | 927.0 | 1198.5 | 1.3 | 13 | 14 | B-II | B7 | B7 |
I | B*3501 | 1A1N | 857.9 | 670.2 | 0.8 | 11 | 8 | B-II | B7 | B7 |
I | B*3501 | 1A9B | 855.4 | 847.4 | 1.0 | 12 | 9 | B-II | B7 | B7 |
I | B*3501 | 1A9E | 882.6 | 779.3 | 0.9 | 12 | 9 | B-II | B7 | B7 |
I | B*1501 | 1XR8 | 860.7 | 968.1 | 1.1 | 16 | 9 | B-V | B62 | B62 |
I | B*1501 | 1XR9 | 883.0 | 414.1 | 0.5 | 18 | 9 | B-V | B62 | B62 |
I | B*2705 | 1HSA | 691.8 | 1148.4 | 1.7 | 14 | 9 | B-I | B27 | B27 |
I | B*2705 | 1OF2 | 1087.7 | 1015.5 | 0.9 | 17 | 9 | B-I | B27 | B27 |
I | B*2705 | 1W0V | 1007.2 | 898.9 | 0.9 | 16 | 9 | B-I | B27 | B27 |
I | B*2705 | 1JGE | 815.4 | 994.4 | 0.9 | 15 | 9 | B-I | B27 | B27 |
I | B*2705 | 1OGT | 1096.0 | 849.3 | 0.8 | 21 | 9 | B-I | B27 | B27 |
I | B*2709 | 1UXW | 1071.9 | 1116.8 | 1.0 | 18 | 9 | B-I | B27 | B27 |
I | B*2709 | 1W0W | 999.9 | 968.1 | 1.0 | 18 | 9 | B-I | B27 | B27 |
I | B*2709 | 1JGD | 979.7 | 994.4 | 1.0 | 23 | 10 | B-I | B27 | B27 |
I | B*2709 | 1K5N | 780.7 | 879.9 | 1.1 | 19 | 9 | B-I | B27 | B27 |
I | B*4402 | 1M6O | 937.3 | 903.2 | 1.0 | 16 | 9 | B-III | B44 | B44 |
I | B*4403 | 1SYS | 941.9 | 1076.0 | 1.1 | 15 | 9 | B-I | B44 | B44 |
I | B*4403 | 1N2R | 901.0 | 949.9 | 1.1 | 15 | 9 | B-I | B44 | B44 |
I | B*4405 | 1SYV | 901.8 | 915.6 | 1.0 | 15 | 9 | B-III | B44 | B44 |
I | B*5101 | 1E27 | 865.2 | 809.0 | 0.9 | 8 | 9 | B-IV | B7 | B44 |
I | B*5101 | 1E28 | 804.9 | 724.0 | 0.9 | 7 | 8 | B-IV | B7 | B44 |
I | B*5301 | 1A1M | 823.9 | 971.3 | 1.2 | 12 | 9 | B-II | B7 | B44 |
I | B*5301 | 1A1O | 965.2 | 778.8 | 0.8 | 10 | 9 | B-II | B7 | B44 |
Class | Allele | PDB ID | Interface area (Å2) | Gap volume (Å3) | Gap index (Å) | No. of H-bonds | Peptide length | Supertype assigned | ||
---|---|---|---|---|---|---|---|---|---|---|
This work | Sette and Sidney (1999) | Doytchinova and Flower (2004) | ||||||||
I | A*0101 | 1DUY | 717.7 | 1116.1 | 1.6 | 7 | 8 | A-I | A1 | A1 |
I | A*0201 | 1AKJ | 857.2 | 746.5 | 0.9 | 13 | 9 | A-II | A2 | A2 |
I | A*0201 | 1AO7 | 883.3 | 1038.3 | 1.2 | 10 | 9 | A-II | A2 | A2 |
I | A*0201 | 1B0G | 860.2 | 441.3 | 0.5 | 12 | 9 | A-II | A2 | A2 |
I | A*0201 | 1B0R | 642.4 | 724.0 | 1.1 | 7 | 9 | A-II | A2 | A2 |
I | A*0201 | 1BD2 | 874.2 | 813.6 | 0.9 | 11 | 9 | A-II | A2 | A2 |
I | A*0201 | 1DUZ | 863.0 | 1069.6 | 1.2 | 11 | 9 | A-II | A2 | A2 |
I | A*0201 | 1EEY | 787.1 | 900.3 | 1.1 | 11 | 9 | A-II | A2 | A2 |
I | A*0201 | 1EEZ | 829.8 | 704.2 | 0.9 | 9 | 9 | A-II | A2 | A2 |
I | A*0201 | 1HHG | 765.8 | 1039.9 | 1.4 | 12 | 9 | A-II | A2 | A2 |
I | A*0201 | 1HHH | 918.4 | 530.4 | 0.6 | 11 | 10 | A-II | A2 | A2 |
I | A*0201 | 1HHI | 842.0 | 455.7 | 0.5 | 9 | 9 | A-II | A2 | A2 |
I | A*0201 | 1HHJ | 847.9 | 827.4 | 1.0 | 14 | 9 | A-II | A2 | A2 |
I | A*0201 | 1HHK | 865.5 | 1083.4 | 1.3 | 10 | 9 | A-II | A2 | A2 |
I | A*0201 | 1I1F | 850.9 | 800.8 | 0.9 | 11 | 9 | A-II | A2 | A2 |
I | A*0201 | 1I1Y | 877.9 | 745.0 | 0.9 | 13 | 9 | A-II | A2 | A2 |
I | A*0201 | 1I4F | 820.1 | 877.1 | 1.1 | 15 | 10 | A-II | A2 | A2 |
I | A*0201 | 1I7R | 902.5 | 805.4 | 0.9 | 11 | 9 | A-II | A2 | A2 |
I | A*0201 | 1I7T | 847.9 | 591.4 | 0.7 | 9 | 9 | A-II | A2 | A2 |
I | A*0201 | 1I7U | 845.2 | 545.8 | 0.7 | 11 | 9 | A-II | A2 | A2 |
I | A*0201 | 1IM3 | 855.0 | 954.9 | 1.1 | 11 | 9 | A-II | A2 | A2 |
I | A*0201 | 1JF1 | 870.4 | 492.5 | 0.6 | 11 | 10 | A-II | A2 | A2 |
I | A*0201 | 1JHT | 781.0 | 588.8 | 0.8 | 12 | 9 | A-II | A2 | A2 |
I | A*0201 | 1LP9 | 855.7 | 549.7 | 0.6 | 12 | 9 | A-II | A2 | A2 |
I | A*0201 | 1OGA | 856.0 | 539.5 | 0.6 | 12 | 9 | A-II | A2 | A2 |
I | A*0201 | 1P7Q | 810.8 | 648.1 | 0.8 | 7 | 9 | A-II | A2 | A2 |
I | A*0201 | 1QR1 | 827.1 | 722.4 | 0.9 | 9 | 9 | A-II | A2 | A2 |
I | A*0201 | 1QRN | 871.5 | 921.6 | 1.1 | 10 | 9 | A-II | A2 | A2 |
I | A*0201 | 1QSE | 873.0 | 1097.7 | 1.3 | 11 | 9 | A-II | A2 | A2 |
I | A*0201 | 1QSF | 828.6 | 959.5 | 1.2 | 10 | 9 | A-II | A2 | A2 |
I | A*0201 | 1S9W | 904.9 | 933.9 | 1.0 | 13 | 9 | A-II | A2 | A2 |
I | A*0201 | 1S9X | 872.3 | 933.1 | 1.1 | 12 | 9 | A-II | A2 | A2 |
I | A*0201 | 1S9Y | 883.7 | 979.0 | 1.1 | 11 | 9 | A-II | A2 | A2 |
I | A*0201 | 1TVB | 829.2 | 866.8 | 1.0 | 12 | 9 | A-II | A2 | A2 |
I | A*0201 | 1TVH | 865.1 | 950.6 | 1.1 | 10 | 9 | A-II | A2 | A2 |
I | A*0201 | 2CLR | 896.5 | 911.4 | 1.0 | 10 | 10 | A-II | A2 | A2 |
I | A*6801 | 1TMC | 918.1 | 926.2 | 1.0 | 14 | 10 | A-III | A3 | A3 |
I | B*0801 | 1AGB | 820.8 | 881.7 | 1.1 | 15 | 8 | B-II | Outlier | B7 |
I | B*0801 | 1AGC | 812.5 | 688.1 | 0.9 | 18 | 8 | B-II | Outlier | B7 |
I | B*0801 | 1AGD | 819.7 | 816.1 | 1.0 | 16 | 8 | B-II | Outlier | B7 |
I | B*0801 | 1AGE | 812.3 | 920.6 | 1.1 | 15 | 8 | B-II | Outlier | B7 |
I | B*0801 | 1AGF | 860.0 | 765.4 | 0.9 | 14 | 8 | B-II | Outlier | B7 |
I | B*0801 | 1M05 | 1000.6 | 897.4 | 0.9 | 18 | 9 | B-II | Outlier | B7 |
I | B*0801 | 1MI5 | 1028.0 | 850.0 | 1.8 | 15 | 9 | B-II | Outlier | B7 |
I | B*3501 | 1QEW | 843.2 | 855.9 | 1.0 | 12 | 9 | B-II | B7 | B7 |
I | B*3501 | 1XH3 | 927.0 | 1198.5 | 1.3 | 13 | 14 | B-II | B7 | B7 |
I | B*3501 | 1A1N | 857.9 | 670.2 | 0.8 | 11 | 8 | B-II | B7 | B7 |
I | B*3501 | 1A9B | 855.4 | 847.4 | 1.0 | 12 | 9 | B-II | B7 | B7 |
I | B*3501 | 1A9E | 882.6 | 779.3 | 0.9 | 12 | 9 | B-II | B7 | B7 |
I | B*1501 | 1XR8 | 860.7 | 968.1 | 1.1 | 16 | 9 | B-V | B62 | B62 |
I | B*1501 | 1XR9 | 883.0 | 414.1 | 0.5 | 18 | 9 | B-V | B62 | B62 |
I | B*2705 | 1HSA | 691.8 | 1148.4 | 1.7 | 14 | 9 | B-I | B27 | B27 |
I | B*2705 | 1OF2 | 1087.7 | 1015.5 | 0.9 | 17 | 9 | B-I | B27 | B27 |
I | B*2705 | 1W0V | 1007.2 | 898.9 | 0.9 | 16 | 9 | B-I | B27 | B27 |
I | B*2705 | 1JGE | 815.4 | 994.4 | 0.9 | 15 | 9 | B-I | B27 | B27 |
I | B*2705 | 1OGT | 1096.0 | 849.3 | 0.8 | 21 | 9 | B-I | B27 | B27 |
I | B*2709 | 1UXW | 1071.9 | 1116.8 | 1.0 | 18 | 9 | B-I | B27 | B27 |
I | B*2709 | 1W0W | 999.9 | 968.1 | 1.0 | 18 | 9 | B-I | B27 | B27 |
I | B*2709 | 1JGD | 979.7 | 994.4 | 1.0 | 23 | 10 | B-I | B27 | B27 |
I | B*2709 | 1K5N | 780.7 | 879.9 | 1.1 | 19 | 9 | B-I | B27 | B27 |
I | B*4402 | 1M6O | 937.3 | 903.2 | 1.0 | 16 | 9 | B-III | B44 | B44 |
I | B*4403 | 1SYS | 941.9 | 1076.0 | 1.1 | 15 | 9 | B-I | B44 | B44 |
I | B*4403 | 1N2R | 901.0 | 949.9 | 1.1 | 15 | 9 | B-I | B44 | B44 |
I | B*4405 | 1SYV | 901.8 | 915.6 | 1.0 | 15 | 9 | B-III | B44 | B44 |
I | B*5101 | 1E27 | 865.2 | 809.0 | 0.9 | 8 | 9 | B-IV | B7 | B44 |
I | B*5101 | 1E28 | 804.9 | 724.0 | 0.9 | 7 | 8 | B-IV | B7 | B44 |
I | B*5301 | 1A1M | 823.9 | 971.3 | 1.2 | 12 | 9 | B-II | B7 | B44 |
I | B*5301 | 1A1O | 965.2 | 778.8 | 0.8 | 10 | 9 | B-II | B7 | B44 |
2.2 Hierarchical clustering
A hierarchical clustering technique using the agglomerative algorithm (Barnard and Downs, 1992; Doytchinova et al., 2004; Doytchinova and Flower, 2005) was applied. The distance between the structures was computed by the single-linkage method as implemented in MATLAB version 7.0 based on the separation between the each pair of data points (Barnard and Downs, 1992). The nearest neighbours were merged into clusters. Smaller clusters were then merged into larger clusters based on inter-cluster distances, until all structures are combined. We have considered the last three levels for defining HLA class I supertypes.
2.3 Interaction parameters
Some interaction parameters have been identified as being significant for the characterization of peptide/MHC interface (Kangueane et al., 2001; Govindarajan et al., 2003) and can be computed from the 3D coordinates of a peptide–MHC complex (Berman et al., 2000; Kaas et al., 2004). These parameters were applied in this study for analyzing the binding characteristics of HLA supertypes:
2.4 Intermolecular hydrogen bonds
The number of intermolecular hydrogen bonds between the bound peptide and MHC protein was calculated using HBPLUS (McDonald and Thornton, 1994) in which hydrogen bonds are defined in accordance to standard geometric parameters. Hydrogen bonding patterns of all complexes presented in this study are available in MPID-T (Govindarajan et al., 2003; Tong et al., 2006b; Author Webpage).
2.5 Interface area between peptide and MHC
The accessible surface area (ASA; Å2) between the bound peptide and MHC is measured by tracing out the maximum permitted van der Waals' contact that is covered by the center of a water molecule as it rolls over the surface of the protein. Interface area for MHC class I complexes is defined as the mean ΔASA on complexation when going from a monomeric MHC protein to a dimeric peptide/MHC complex state and calculated as half the sum of the total ΔASA for both molecules for each type of complex. Interface area for MHC class II complexes is similarly defined (Kangueane et al., 2001).
2.6 Gap volume
The gap volume or volume (Å3) enclosed by the bound peptide and MHC protein is computed using the SURFNET program (Laskowski, 1995). The algorithm places a series of spheres (maximum radius 5.00 Å) midway between the surfaces of each pair of subunit atoms, such that its surface is in contact with the surfaces of the atoms of the pair. The size of each sphere is reduced accordingly whenever it is intercepted by other atoms and subsequently discarded if it falls below a minimum allowed radius (1.00 Å). The sizes of all the remaining allowable gap-spheres are subsequently used to compute the gap volume between the two subunits.
2.7 Gap index
One essential feature in receptor-ligand binding is the electrostatic and geometric complementarity observed between associating molecules. In this study, we adopted the use of gap index (reviewed in Jones and Thornton, 1996) as means to evaluate complementarity of interacting interfaces between the bound peptide and HLA protein:
Gap index(Å)=Gap volume between peptide/HLA(Å2)Interface ASA(Å2)(per complex).
3 RESULTS
3.1 HLA-A supertypes
Three main clusters are observed: A-I (A*0101), A-II (A*0201) and A-III (A*6801), which are consistent with the assignments of from the groups of Sette (Sette and Sidney, 1999) and Flower (Doytchinova et al., 2004). Since A-I and A-III are confined to single structures; the analysis of A-II structures is described. The mean interface area for A-II is 846.3 ± 48.9 Å2. On average, the number of intermolecular hydrogen bonds and gap index is 11.1 ± 1.9 and 0.9 ± 0.2, respectively. Extensive hydrogen bonding networks are found in binding pockets A, B and F, which corresponds to IMGT peptide/MHC contact sites (Kaas and Lefranc, 2005). No clear difference is observed in the number of intermolecular hydrogen bonds for 9mer (11.0 ± 1.8) and 10mer (11.8 ± 2.2) complexes. The gap indices, however, for the 9mer and 10mer complexes are 1.0 ± 0.2 and 0.8 ± 0.3, respectively, indicating that the interacting surfaces of 10mer complexes are generally more complementary than 9mer complexes. The interface area inversely correlates with gap index, indicating that A-II complexes with larger interface area have better geometric and electrostatic complementarities, stabilized by the formation of several intermolecular hydrogen bonds.
3.2 HLA-B supertypes
The hierarchical clustering for the structural interaction characteristics of 10 HLA-B alleles is given in Figure 1. Five main clusters can be identified in this study: B-I (B*2705, B*2709, B*4403), B-II (B*0801, B*3501, B*5301), B-III (B*4402, B*4405), B-IV (B*5101) and B-V (B*1501). Our HLA-B supertype definition largely overlaps (70% consensus) with the definition by binding motifs (Sette and Sidney, 1999; Rammensee et al., 1999). However, our data indicates that B*5101 does not share similar interaction patterns with either B*3501 and B*5301 from the SS-B7 supertype (Sette and Sidney, 1999; Rammensee et al., 1999; Lund et al., 2004); or B*4402, B*4403 and B*4405 from the DF-B44 supertype (Doytchinova et al., 2004) and may form a separate supertype instead. On average, B*5101 complexes have a smaller gap volume (766.5 Å3) and fewer intermolecular hydrogen bonds (7.5) in comparison with B*3501 (gap volume = 870.2 Å3, intermolecular hydrogen bonds = 12.0) and B*5301 (gap volume = 875.0 Å3, intermolecular hydrogen bonds = 11.0). The B*0801 allele clustered within B-II is in agreement with the B7 supertype definition based on hierarchical clustering and consensus principle component analysis on the functional residues of HLA proteins (Doytchinova et al., 2004). The deciding factors for this cluster are interface area (876.2 ± 3.0 Å2) and gap volume (850.8 ± 19.5 Å3). This allele has, however, been classified as an outlier by Sette (Sette and Sidney, 1999).
Fig. 1
Hierarchical clustering of structural interaction characteristics for 10 HLA-B alleles. Five main clusters can be identified in this study. Coloring below indicate the supertype classifications by Sette and Sidney (1999), Lund et al. (2004) and Doytchinova et al. (2004): B7, black; B27, green; B44, orange; B62, blue; Outlier/B8, red.
B-I has the largest average interface area (947.8 ± 148.0 2) among all class I supertypes investigated in this study. On average, the number of intermolecular hydrogen bonds, interface area, gap volume, and gap index for B-I is 15.5 ± 7.1, 919.5 ± 25.1, 909.4 ± 8.8 and 1.0 ± 0.0, respectively. The number of intermolecular hydrogen bonds inversely correlates with gap volume (Figure 2G; r =−0.42) and gap index (Figure 2H; r = −0.46). Hydrogen bonds are localized in pockets A, B and F. More hydrogen bonds are formed at smaller gap index compared to other HLA class I complexes B-I has the largest average interface area (947.8 ± 148.0 2) among all class I supertypes investigated in this study. On average, the number of intermolecular hydrogen bonds, interface area, gap volume, and gap index for B-I is 15.5 ± 7.1, 919.5 ± 25.1, 909.4 ± 8.8 and 1.0 ± 0.0, respectively. The number of intermolecular hydrogen bonds inversely correlates with gap volume (Figure 2G; r = −0.42) and gap index (Figure 2H; r = −0.46). Hydrogen bonds are localized in pockets A, B and F. More hydrogen bonds are formed at smaller gap index compared to other HLA class I complexes investigated herein. The average number of intermolecular hydrogen bonds, gap index, and interface area for B-II is 13.8 ± 2.5, 1.0 ± 0.2 and 879.2 ± 72.4 2, respectively. In general, intermolecular hydrogen bonds for this supertype are well distributed and do not correlate with the interface area (Figure 2I; r = 0.01), gap volume (Figure 2K, r = −0.01) and gap index (Figure 2L, r = −0.01) of complex. B*4403 clusters with B*2705 and B*2709 although the three alleles have different specificities based on existing binding motifs (Sette and Sidney, 1999). The gap volume of B*4403 (1013.0 Å3) is clearly similar to that of B27 alleles (985.6 ± 4.2 Å3) and not the other B44 alleles. Based on existing binding data (Rammensee et al., 1999; de Castro et al., 2004), B*2705 exhibits strong predominance for Arg at position 2, while B*4403 prefers Glu at this position. Nonetheless, the existence of Glu at position 2 has also been reported in B*2705-binding peptides derived from fresh blood lymphocytes (Stodůlková et al., 2004), and this dual Arg/Glu substitution mechanism also exist in the auxiliary anchor residues of other alleles, including A*3101, and B*07 (Rammensee et al., 1999). Given the caveat of existing binding data, peptide motifs for many of these alleles are primarily derived from limited sources (Rammensee et al., 1999) and only sequences that have been studied are reported. It is possible that the current motifs for many less-studied alleles are under-represented and biased towards selected anchor residues investigated in the relevant studies. This could be the norm rather than the exception as more binding assays are carried out in the laboratory.
Fig. 2
Structural interaction parameters for selected HLA supertypes, with their correlation coefficients r: (A–D) A-II, (E–H) B-I and (I–L) B-II.
4 DISCUSSION
The extremely high polymorphism of HLA alleles (Williams, 2001) has been a confounding factor in the study of HLA peptide binding specificities. For a HLA protein to recognize specific peptide, geometric and electrostatic complementarity between the receptor and its corresponding ligand is essential for the formation of chemical bonds between their functional groups, which in turn determines the net stability of the complex. In this context, we introduced the use of structural interaction information to analyze high-level relationships hidden within peptide/HLA crystallographic structures at supertype level. Such descriptors should better reflect peptide/HLA interactions than just sequence data alone.
In general, three types of hydrogen bonding patterns for peptide–HLA class I complexes investigated herein can be identified: (1) the gap index directly correlates with the number of intermolecular hydrogen bonds (Figure 2D); (2) the gap index does not correlate with the number of intermolecular hydrogen bonds (Figure 2L) and (3) the gap index inversely correlates with the number of intermolecular hydrogen bonds (Figure 2H). For the first group (A2 supertype), the majority of intermolecular hydrogen bonds are concentrated at both ends of the binding groove (in pockets A, B and F). More hydrogen bonds are observed with decreasing geometric and electrostatic complementarity (i.e. increasing gap index; Figure 2D) as well as increasing gap volume (Figure 2C). However, the correlations are weak. For the second group (B-II supertype), the gap index is independent of the number of intermolecular hydrogen bonds formed. A loss of four hydrogen bonds is noticed for B*0801, when the peptide GGKKKYQL (PDB ID: 1AGC) is modified to GGKKRYKL (PDB ID: 1AGF), while the gap index is unchanged. A possible explanation for these observations is that the interaction mechanism employed by this group may be degenerate and a combination of non-covalent interactions (hydrogen bonds, hydrophobic and ionic interactions) may be involved in peptide selection. However, it is not clear to what extent the different interactive forces contribute to the net stability of complex. For the third group (B-I supertype), the number of intermolecular hydrogen bonds increases with higher geometric and electrostatic complementarity (smaller gap index). For instance, the number of intermolecular hydrogen bonds between GRFAAAIAK and B*2705 (PDB ID: 1JGE) increases from 15 to 19 when the same peptide binds to B*2709 (PDB ID: 1K5N). Thus although the structures of B*2705 and B*2709 bind identical or similar peptides, the four parameters used in this study result in a more comprehensive characterization of the peptide–MHC interaction. This group also consists of the highest mean number of intermolecular hydrogen bonds. The results strongly indicate that the complexes formed by this group may be more stable, with higher overall geometric and electrostatic complementarity.
Computational techniques with different degrees of accuracy have been developed to model the structure of peptide–MHC complexes, and predict the binding free energy of peptides to MHC proteins (reviewed in Tong et al., 2006c). While some studies showed excellent results when applied to specific sets of alleles, the results presented here suggest that the use of a standardized set of structural interaction rules or free energy scoring functions to discriminate binding peptides may not be applicable for all MHC alleles as interaction characteristics vary across MHC supertypes. The modeling of MHC bound peptide ligands is non-trivial. Although the conformational space of MHC binding peptides is restricted by the highly conserved binding groove, peptide side-chain prediction still requires the screening of a large conformational space (Schueler-Furman et al., 1998). The use of ab initio techniques (Tong et al., 2004, 2006d; Bordner and Abagyan, 2006) and rotamer libraries (Schueler-Furman et al., 1998) are excellent approaches. For the latter, the results presented herein suggest that existing rotamer libraries within this context may be further refined by selecting rotamers from representative sets of known structures within the same MHC superfamily. Likewise, where binding free energy prediction is of concern, the design of generalized free energy scoring functions applicable for all MHC alleles may not be appropriate. Accordingly, it is important to identify key parameters for optimal predictive results for the alleles of interest (Tong et al., 2006c; Davies et al., 2006). The classification scheme presented herein is applicable to facilitate the identification of binding peptides for alleles with scarce experimental peptide information, as well as the development of predictive algorithms based on machine learning and artificial intelligence approaches.
The present analysis is difficult due to the limited number of peptide/HLA crystallographic structures in the current PDB (Berman et al., 2000). Nonetheless, we have demonstrated that different HLA proteins employ the use of different binding mechanism for selectivity of antigenic peptides in a supertype dependent manner. By focusing solely on the use of experimental 3D structures, our analysis is supported and verified by existing data. The proposed classification scheme provides an alternative to HLA supertype analysis using either sequence (Chelvanayagam, 1996; Zhang et al., 1998; Zhao et al., 2003; Cano et al., 1998; McKenzie et al., 1999; Lund et al., 2004) or receptor structure information (Doytchinova et al., 2004; Doytchinova and Flower, 2005; Kangueane et al., 2005) alone. In silico analysis of peptide–HLA interaction characteristics opens the way for more in-depth understanding of the binding mechanism involved in peptide selection and better characterization of HLA supertypes. Future work will focus on the use of molecular modeling techniques for large-scale classification of HLA class I and class II supertypes for which experimental crystallographic structures are unavailable as well as in-depth analysis of their structural interaction patterns.
Conflicts of Interest: none declared.
REFERENCES
Clustering of chemical structures on the basis of two-dimensional similarity measures
,
J. Chem. Inf. Comput. Sci.
,
1992
, vol.
32
(pg.
644
-
649
)
et al.
The Protein Data Bank
,
Nucleic Acids Res.
,
2000
, vol.
28
(pg.
235
-
242
)
Ab initio prediction of peptide-MHC binding geometry for diverse class I MHC allotypes
,
Proteins
,
2006
, vol.
63
(pg.
512
-
526
)
A roadmap for HLA-A, HLA-B, and HLA-C peptide binding specificities
,
Immunogenetics
,
1996
, vol.
45
(pg.
15
-
26
)
et al.
A geometric study of the amino acid sequence of class I HLA molecules
,
Immunogenetics
,
1998
, vol.
48
(pg.
324
-
334
)
et al.
Statistical convolution of enthalpic energetic contributions to MHC-peptide binding affinity
,
BMC Struct. Biol.
,
2006
, vol.
6
pg.
5
et al.
HLA-27: a registry of constitutive peptide ligands
,
Tissue Antigens
,
2004
, vol.
63
(pg.
424
-
445
)
et al.
Binding of a peptide antigen to multiple HLA alleles allows definition of an A2-like supertype
,
J. Immunol.
,
1995
, vol.
154
(pg.
685
-
693
)
et al.
Identifying human MHC supertypes using bioinformatic methods
,
J. Immunol.
,
2004
, vol.
172
(pg.
4314
-
4323
)
In silico identification of supertypes for class II MHCs
,
J. Immunol.
,
2005
, vol.
174
(pg.
7085
-
7095
)
et al.
Identification of naturally processed viral nonapeptides allows their quantification in infected cells and suggests an allele-specific T cell epitope forecast
,
J. Exp. Med.
,
1991
, vol.
174
(pg.
425
-
434
)
et al.
Allele-specific motifs revealed by sequencing of self-peptides eluted from MHC molecules
,
Nature
,
1991
, vol.
351
(pg.
290
-
296
)
et al.
MPID: MHC-Peptide Interaction Database for sequence-structure-function information on peptides binding to MHC molecules
,
Bioinformatics
,
2003
, vol.
19
(pg.
309
-
310
)
et al.
Characterization of peptides bound to the class I MHC molecule HLA-A2.1 by mass spectrometry
,
Science
,
1992
, vol.
255
(pg.
1261
-
1263
)
Principles of protein–protein interactions
,
Proc. Natl Acad. Sci. USA
,
1996
, vol.
93
(pg.
13
-
20
)
et al.
IMGT/3Dstructure-DB and IMGT/StructuralQuery, a database and a tool for immunoglobulin, T cell receptor and MHC structural data
,
Nucleic Acids Res.
,
2004
, vol.
32
(pg.
D208
-
D210
)
T cell receptor/peptide/MHC molecular characterization and standardized pMHC contact sites in IMGT/3Dstructure-DB
,
In Silico Biol.
,
2005
, vol.
5
(pg.
505
-
528
)
et al.
Towards the MHC-peptide combinatorics
,
Hum Immunol.
,
2001
, vol.
62
(pg.
539
-
556
)
et al.
A framework to sub-type HLA supertypes
,
Front. Biosci.
,
2005
, vol.
10
(pg.
879
-
886
)
et al.
Identification of helper T cell epitopes that encompass or lie proximal to cytotoxic T cell epitopes in the gp100 melanoma tumor antigen
,
Cancer Res.
,
2001
, vol.
61
(pg.
7577
-
7584
)
SURFNET: a program for visualizing molecular surfaces, cavities and intermolecular interactions
,
J. Mol. Graph.
,
1995
, vol.
13
(pg.
323
-
330
)
,
The T Cell Receptor FactsBook
,
2001
London
Academic Press
et al.
Definition of supertypes for HLA molecules using clustering of specificity matrices
,
Immunogenetics
,
2004
, vol.
12
(pg.
797
-
810
)
Satisfying hydrogen bonding potential in proteins
,
J. Mol. Biol.
,
1994
, vol.
238
(pg.
777
-
793
)
et al.
Taxonomic hierarchy of HLA class I allele sequences
,
Genes Immun.
,
1999
, vol.
1
(pg.
120
-
129
)
et al.
Identification of promiscuous T cell epitope in Mycobacterium tuberculosis Mce proteins
,
Infect. Immun.
,
2002
, vol.
70
(pg.
79
-
85
)
et al.
Peptides naturally presented by MHC class I molecules
,
Annu. Rev. Immunol.
,
1993
, vol.
11
(pg.
213
-
244
)
et al.
SYFPEITHI: database for MHC ligands and peptide motifs
,
Immunogenetics
,
1999
, vol.
50
(pg.
213
-
219
)
et al.
Exact prediction of a natural T cell epitope
,
Eur. J. Immunol.
,
1991
, vol.
21
(pg.
2891
-
2894
)
et al.
Knowledge-based structure prediction of MHC class I bound peptides: a study of 23 complexes
,
Fold Des.
,
1998
, vol.
3
(pg.
549
-
564
)
Nine major HLA class I supertypes account for vast preponderance of HLA-A and -B polymorphism
,
Immunogenetics
,
1999
, vol.
50
(pg.
201
-
212
)
et al.
The development of multi-epitope vaccines: epitope identification, vaccine design and clinical evaluation
,
Biologicals
,
2001
, vol.
29
(pg.
271
-
276
)
et al.
Optimizing vaccine design for cellular processing, MHC binding and TCR recognition
,
Tissue Antigens
,
2002
, vol.
59
(pg.
443
-
451
)
et al.
Peptides eluted from HLA-B27 of human splenocytes and blood cells reveal a similar but partially different profile compared to in vitro grown cell lines
,
Immunology Lett.
,
2004
, vol.
94
(pg.
261
-
265
)
et al.
Modeling the structure of bound peptide ligands to major histocompatibility complex
,
Protein Sci.
,
2004
, vol.
13
(pg.
2523
-
2532
)
et al.
Modeling the bound conformation of _Pemphigus vulgaris_-associated peptides to MHC class II DR and DQ alleles
,
Immunome Res.
,
2006
, vol.
2
pg.
1
et al.
MPID-T: database for sequence-structure-function information on TCR/peptide/MHC interactions
,
Appl. Bioinformatics
,
2006
, vol.
5
(pg.
111
-
114
)
et al.
Methods and protocols for predicting immunogenic epitopes
,
Brief. Bioinformatics
,
2006
in press, Advance Access published on October 31, 2006. doi:10.1093/bib/bbl038
et al.
Prediction of HLA-DQ3.2β ligands: evidence of multiple registers in class II binding peptides
,
Bioinformatics
,
2006
, vol.
22
(pg.
1232
-
1238
)
‘Human leukocyte antigen gene polymorphism and the histocompatibility laboratory’
,
J. Mol. Diagn.
,
2001
, vol.
3
(pg.
98
-
104
)
et al.
Structural principles that govern the peptide-binding motifs of class I MHC molecules
,
J. Mol. Biol.
,
1998
, vol.
281
(pg.
929
-
947
)
et al.
Compression of functional space in HLA-A sequence diversity
,
Hum. Immunol.
,
2003
, vol.
64
(pg.
718
-
728
)
et al.
Improving MHC binding peptide prediction by incorporating binding data of auxiliary MHC molecules
,
Bioinformatics
,
2006
, vol.
22
(pg.
1648
-
1655
)
Author notes
Associate Editor: Dmitrij Frishman
© The Author 2006. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org
Citations
Views
Altmetric
Metrics
Total Views 807
572 Pageviews
235 PDF Downloads
Since 2/1/2017
Month: | Total Views: |
---|---|
February 2017 | 13 |
March 2017 | 7 |
April 2017 | 4 |
May 2017 | 7 |
June 2017 | 1 |
July 2017 | 7 |
August 2017 | 2 |
October 2017 | 1 |
December 2017 | 11 |
January 2018 | 10 |
February 2018 | 7 |
March 2018 | 7 |
April 2018 | 11 |
May 2018 | 15 |
June 2018 | 8 |
July 2018 | 6 |
August 2018 | 14 |
September 2018 | 8 |
October 2018 | 5 |
November 2018 | 8 |
December 2018 | 16 |
January 2019 | 7 |
February 2019 | 31 |
March 2019 | 13 |
April 2019 | 12 |
May 2019 | 11 |
June 2019 | 3 |
July 2019 | 6 |
August 2019 | 10 |
September 2019 | 14 |
October 2019 | 12 |
November 2019 | 6 |
December 2019 | 7 |
January 2020 | 6 |
February 2020 | 9 |
March 2020 | 8 |
April 2020 | 13 |
May 2020 | 2 |
June 2020 | 8 |
July 2020 | 7 |
August 2020 | 5 |
September 2020 | 5 |
October 2020 | 11 |
November 2020 | 8 |
December 2020 | 7 |
January 2021 | 4 |
February 2021 | 10 |
March 2021 | 7 |
April 2021 | 6 |
May 2021 | 21 |
June 2021 | 12 |
July 2021 | 12 |
August 2021 | 11 |
September 2021 | 6 |
October 2021 | 15 |
November 2021 | 7 |
December 2021 | 1 |
January 2022 | 11 |
February 2022 | 9 |
March 2022 | 8 |
April 2022 | 5 |
May 2022 | 17 |
June 2022 | 10 |
July 2022 | 15 |
August 2022 | 18 |
September 2022 | 9 |
October 2022 | 10 |
November 2022 | 12 |
December 2022 | 10 |
January 2023 | 9 |
February 2023 | 6 |
March 2023 | 10 |
April 2023 | 14 |
May 2023 | 2 |
June 2023 | 3 |
July 2023 | 7 |
August 2023 | 8 |
September 2023 | 6 |
October 2023 | 5 |
November 2023 | 4 |
January 2024 | 5 |
February 2024 | 10 |
March 2024 | 8 |
April 2024 | 15 |
May 2024 | 9 |
June 2024 | 11 |
July 2024 | 11 |
August 2024 | 11 |
September 2024 | 5 |
October 2024 | 13 |
Citations
11 Web of Science
×
Email alerts
Citing articles via
More from Oxford Academic