PDZBase: a protein–protein interaction database for PDZ-domains (original) (raw)
Journal Article
,
*To whom correspondence should be addressed.
Search for other works by this author on:
,
Search for other works by this author on:
,
Search for other works by this author on:
,
Search for other works by this author on:
Search for other works by this author on:
Revision received:
05 October 2004
Accepted:
08 October 2004
Published:
28 October 2004
Cite
Thijs Beuming, Lucy Skrabanek, Masha Y. Niv, Piali Mukherjee, Harel Weinstein, PDZBase: a protein–protein interaction database for PDZ-domains, Bioinformatics, Volume 21, Issue 6, March 2005, Pages 827–828, https://doi.org/10.1093/bioinformatics/bti098
Close
Navbar Search Filter Mobile Enter search term Search
Abstract
Summary: PDZBase is a database that aims to contain all known PDZ-domain-mediated protein–protein interactions. Currently, PDZBase contains approximately 300 such interactions, which have been manually extracted from >200 articles. The database can be queried through both sequence motif and keyword-based searches, and the sequences of interacting proteins can be visually inspected through alignments (for the comparison of several interactions), or as residue-based diagrams including schematic secondary structure information (for individual complexes).
Availability: http://icb.med.cornell.edu/services/pdz/start.
Contact: pdzbase@med.cornell.edu.
INTRODUCTION
PDZ (PSD-95, Discs-large, ZO-1) domains are ubiquitous protein–protein interaction domains comprising about 70–90 residues (Nourry et al., 2003; Hung and Sheng, 2002; Fan and Zhang, 2002; Sheng and Sala, 2001; van Ham and Hendriks, 2003). They are involved in numerous interactions with various proteins, in a variety of biological processes. Some proteins contain multiple copies of PDZ-domains, often in combination with other protein–protein interaction domains. This architecture enables simultaneous interactions among several proteins, thus turning the PDZ-domain-containing proteins into putative ‘molecular switchboards’ (Dueber et al., 2003). The most prominent role of PDZ-containing proteins appears to be the assembly of protein complexes at the plasma membrane, where they bind to the C-termini of membrane proteins.
The specificity of PDZ-domain-based interactions is determined primarily by the sequence of the C-terminus of the proteins they bind. Thus, the specificity of PDZ-interactions has been traditionally attributed to the last three residues of the ligand (i.e. positions P-0, P-1, P-2, counting backwards from the terminal residue in the ligand). A classification of PDZ-binding motifs in the C-termini has been proposed, in which the consensus sequence for class I is S/T–X–Φ, and for class II is Φ–X–Φ (where Φ is any hydrophobic residue), with the corresponding PDZ-domain classified into class I or class II binding (Songyang et al., 1997). More recently, this classification system has been challenged based on the discovery that some PDZ-binding sequences do not belong to either of the two classes and the observation that certain PDZ-domains promiscuously bind to both class I and class II ligands. Moreover, it has become apparent that residues further N-terminal are important for specificity as well. Indeed, the human genome alone contains hundreds of PDZ-domains that are able to bind specific targets, a feat which would seem difficult to achieve with as few as three or four (relatively similar) recognition sites. A comprehensive comparative analysis of a large number of PDZ-peptide complexes is expected to yield further insights into determinants of specificity. Such a task, and other types of comparative analyses of the important PDZ-interactions will require an easily accessible source of specialized data, given the very large number of complexes between PDZ-domains and ligands that have been reported. To construct such a source of data, we have systematically analyzed all articles in the PubMed database containing the keyword PDZ (>1000 articles) and have constructed a comprehensive database to store the known interactions. A web-based server, named PDZBase, that allows for querying and analyzing of this database of PDZ-domain-mediated protein–protein interactions, is presented below.
DATABASE CONTENT
PDZBase currently contains ∼300 interactions, all of which have been manually extracted from the literature, and have been independently verified by two curators. The extracted information comes from in vivo (co-immunoprecipitation) or in vitro experiments (GST-fusion or related pull-down experiments). Interactions identified solely from high throughput methods (e.g. yeast two-hybrid or mass spectrometry) were not included in PDZBase. Other prerequisites for inclusion in the database are: (1) that knowledge of the binding sites on both interacting proteins must be available (for instance through a truncation or mutagenesis experiment); (2) that interactions must be mediated directly by the PDZ-domain, and not by any other possible domain within the protein. The database is continuously maintained and will be updated regularly with new interactions reported in the literature.
IMPLEMENTATION AND DATABASE SCHEMA
For PDZBase, we have used the Java Data Object API (JDO) to connect between the web application logic layer and the database backend. This technology presents the strong advantage that persistent objects are modeled as object-oriented classes (in the Java language), but can be stored either in a relational DBMS or an object-oriented DBMS. The Kodo implementation of JDO has been used to connect to an Oracle 8.1.7 backend. The JDOQL is used to query the database.
The database structure schema consists of the classes PossibleInteraction, PDZProtein, PDZDomain and Ligand. The PossibleInteraction class contains the PDZProtein, the interacting PDZDomain, the Ligand, the literature reference and information about the location of the interface. To permit storage of non-existing interactions (negative controls) as well, a Phenotype field in PossibleInteraction can indicate whether the interaction exists or not. The PDZProtein and Ligand classes contain the Swiss-Prot accession code (Boeckmann et al., 2003) the amino acid sequence of the protein and the organism. The PDZDomain class contains the start and end points of the domain, and its location within the protein (i.e. which domain number).
DATABASE ACCESS
PDZBase currently provides a simple search interface (Fig. 1 left) that enables the database to be queried for interactions using the names and external identifiers (e.g. Swiss-Prot; Boeckmann et al., 2003) of the interacting proteins. Additionally, sets of proteins in the database can be retrieved if they have a specific sequence motif in common, or share a specific residue type at a certain position. For instance, querying with ‘S-X-V’ in the ‘Enter a motif to query ligands’ search field (Fig. 1) returns all ligands in PDZBase with a Ser at P-2 and a Val at P-0. A query with ‘aB1 H’ in the ‘Enter a generic position and a residue type to query PDZ domains’ search field returns all PDZ-domains with a His at position αB1 (the first residue of the second α-helix). All interactions involving the chosen subset of proteins can then be retrieved. The sequences of interacting proteins can be visualized as an HTML-formatted alignment, and exported as a FASTA format text-file. Finally, each interaction is linked to a details-page, which shows the residues of the interacting proteins on a 2D-diagram generated by the residue-based-diagram-generator (RbDg; Campagne et al., 2003) (Fig. 1, right), and provides interaction-specific links to other databases (Swiss-Prot; Boeckmann et al., 2003 PubMed).
Fig. 1
PDZBase interface. Proteins or interactions can be queried using names, identifiers, or motifs (left). Details of interactions are represented as 2D-diagrams (right).
We thank Dr Nathalie Basdevant for curating part of the interactions. The work is supported by NIH grants K05 DA00060, P01 DA124080 and P01 DA12923.
REFERENCES
Boeckmann, B., Bairoch, A., Apweiler, R., Blatter, M.C., Estreicher, A., Gasteiger, E., Martin, M.J., Michoud, K., O'Donovan, C., Phan, I., Pilbout, S., Schneider, M.
2003
The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003.
Nucleic Acids Res.
31
365
–370
Campagne, F., Bettler, E., Vriend, G., Weinstein, H.
2003
Batch mode generation of residue-based diagrams of proteins.
Bioinformatics
19
1854
–1855
Dueber, J.E., Yeh, B.J., Chak, K., Lim, W.A.
2003
Reprogramming control of an allosteric signaling switch through modular recombination.
Science
301
1904
–1908
Fan, J.S. and Zhang, M.
2002
Signaling complex organization by PDZ domain proteins.
Neurosignals
11
315
–321
Hung, A.Y. and Sheng, M.
2002
PDZ domains: structural modules for protein complex assembly.
J. Biol. Chem.
277
5699
–5702
Nourry, C., Grant, S.G., Borg, J.P.
2003
PDZ domain proteins: plug and play!.
Sci. STKE
2003
RE7
Sheng, M. and Sala, C.
2001
PDZ domains and the organization of supramolecular complexes.
Annu. Rev. Neurosci.
24
1
–29
Songyang, Z., Fanning, A.S., Fu, C., Xu, J., Marfatia, S.M., Chishti, A.H., Crompton, A., Chan, A.C., Anderson, J.M., Cantley, L.C.
1997
Recognition of unique carboxyl-terminal motifs by distinct PDZ domains.
Science
275
73
–77
van Ham, M. and Hendriks, W.
2003
PDZ domains–glue and guide.
Mol. Biol. Rep.
30
69
–82
© The Author 2004. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oupjournals.org
Citations
Views
Altmetric
Metrics
Total Views 6,541
5,748 Pageviews
793 PDF Downloads
Since 11/1/2016
Month: | Total Views: |
---|---|
November 2016 | 10 |
December 2016 | 3 |
January 2017 | 41 |
February 2017 | 68 |
March 2017 | 85 |
April 2017 | 56 |
May 2017 | 74 |
June 2017 | 62 |
July 2017 | 62 |
August 2017 | 52 |
September 2017 | 47 |
October 2017 | 66 |
November 2017 | 71 |
December 2017 | 110 |
January 2018 | 111 |
February 2018 | 115 |
March 2018 | 127 |
April 2018 | 97 |
May 2018 | 88 |
June 2018 | 76 |
July 2018 | 94 |
August 2018 | 69 |
September 2018 | 70 |
October 2018 | 108 |
November 2018 | 86 |
December 2018 | 61 |
January 2019 | 60 |
February 2019 | 55 |
March 2019 | 109 |
April 2019 | 94 |
May 2019 | 86 |
June 2019 | 49 |
July 2019 | 68 |
August 2019 | 42 |
September 2019 | 48 |
October 2019 | 57 |
November 2019 | 46 |
December 2019 | 37 |
January 2020 | 64 |
February 2020 | 63 |
March 2020 | 58 |
April 2020 | 112 |
May 2020 | 53 |
June 2020 | 47 |
July 2020 | 40 |
August 2020 | 32 |
September 2020 | 51 |
October 2020 | 65 |
November 2020 | 43 |
December 2020 | 39 |
January 2021 | 74 |
February 2021 | 75 |
March 2021 | 81 |
April 2021 | 85 |
May 2021 | 55 |
June 2021 | 69 |
July 2021 | 81 |
August 2021 | 47 |
September 2021 | 83 |
October 2021 | 81 |
November 2021 | 71 |
December 2021 | 94 |
January 2022 | 85 |
February 2022 | 83 |
March 2022 | 72 |
April 2022 | 77 |
May 2022 | 115 |
June 2022 | 76 |
July 2022 | 65 |
August 2022 | 59 |
September 2022 | 64 |
October 2022 | 72 |
November 2022 | 70 |
December 2022 | 71 |
January 2023 | 104 |
February 2023 | 59 |
March 2023 | 72 |
April 2023 | 68 |
May 2023 | 65 |
June 2023 | 76 |
July 2023 | 80 |
August 2023 | 61 |
September 2023 | 24 |
October 2023 | 72 |
November 2023 | 76 |
December 2023 | 55 |
January 2024 | 76 |
February 2024 | 60 |
March 2024 | 79 |
April 2024 | 76 |
May 2024 | 65 |
June 2024 | 51 |
July 2024 | 57 |
August 2024 | 34 |
September 2024 | 49 |
October 2024 | 50 |
Citations
158 Web of Science
×
Email alerts
Citing articles via
More from Oxford Academic