BitterDB: a database of bitter compounds (original) (raw)

Abstract

Basic taste qualities like sour, salty, sweet, bitter and umami serve specific functions in identifying food components found in the diet of humans and animals, and are recognized by proteins in the oral cavity. Recognition of bitter taste and aversion to it are thought to protect the organism against the ingestion of poisonous food compounds, which are often bitter. Interestingly, bitter taste receptors are expressed not only in the mouth but also in extraoral tissues, such as the gastrointestinal tract, indicating that they may play a role in digestive and metabolic processes. BitterDB database, available at http://bitterdb.agri.huji.ac.il/bitterdb/, includes over 550 compounds that were reported to taste bitter to humans. The compounds can be searched by name, chemical structure, similarity to other bitter compounds, association with a particular human bitter taste receptor, and so on. The database also contains information on mutations in bitter taste receptors that were shown to influence receptor activation by bitter compounds. The aim of BitterDB is to facilitate studying the chemical features associated with bitterness. These studies may contribute to predicting bitterness of unknown compounds, predicting ligands for bitter receptors from different species and rational design of bitterness modulators.

INTRODUCTION

The bitterness recognition puzzle

The mammalian sense of taste is mediated by receptor proteins located in the oral cavity. Each of the five basic taste qualities – sour, salty, sweet, bitter and umami, serves a specific function in identifying components found in an animal's diet (1,2). Since many toxic plant metabolites taste bitter to mammals, bitter taste receptors are thought to have evolved to protect the organism against the ingestion of poisonous food compounds. Bitter compounds, surveyed in (3–7), are very diverse in terms of their chemical structure and physicochemical properties (8,9).

How can hundreds or more of structurally diverse compounds be detected by a limited number of receptors is an intriguing question. In particular, in humans, bitter-taste perception is mediated by 25 G-protein coupled receptors (GPCRs) of the hTAS2R gene family (10). Characterization of the receptive range of many hTAS2Rs (11,12) suggests that, this receptor family consists of both broadly tuned and narrowly tuned receptors. Apparently, individual hTAS2Rs are activated by a battery of both structurally related and unrelated bitter compounds, while preserving selectivity for chemical groups and even stereo-selectivity (11,13,14). The structural basis for hTAS2Rs’ unique ability to recognize numerous chemically diverse, and low-affinity (micromolar range) agonists is not fully understood. Assignment of bitter tastants to individual receptors has been accomplished for some ligands (11,12,14–16) but remains unknown for many others.

Not only taste perception

Recent reports about expression of taste receptors in non gustatory tissues suggest that these proteins may have additional functions apart from detection of taste (8,17,18). Of the extraoral tissues expressing taste receptors, the gastrointestinal tract has received much attention since mounting evidence indicates that after being ingested, bitter tastants might play important regulatory roles in digestive and metabolic processes (18). hTAS2Rs were also found in human airway smooth muscle tissue and the effect of bitter tastants on function of human bronchi is currently being studied (19,20).

Identification and design of modulators

Studying bitter molecules and designing selective and potent agonists and antagonists of bitter taste receptors has broad applications, both in the field of food sciences and for therapeutic indications. To date, antagonist molecules have been found by massive screening (21) and unexpectedly, when studying bitter taste receptor signaling (22). Electronic databases of compounds, such as the publicly available ZINC database (23) (http://zinc.docking.org/), have been extensively used in recent years for computer-aided drug design. The need for focused and specialized resources of compounds for virtual screening is growing as computational methods are being applied to novel research fields beyond classical drug discovery. BitterDB is an important addition to the recent databases in the field of chemical senses, such as sweet tastant molecules database (24) and database of volatile compounds relevant to odors and scents (25).

Motivation

To summarize, taste recognition is important for food sensing and possibly plays additional physiological roles. Characterizing molecules as bitter and assigning them to a specific receptor represents a crucial step towards prediction and rational modulation of bitterness. Creating a database of known bitter compounds, which, in addition to associations between bitter molecules and their receptors, also contains the chemicophysical properties of the ligands and mutagenesis data relevant for ligand binding [e.g. (26,27)], is an essential step in this direction.

DATABASE OVERVIEW

BitterDB overview/description

BitterDB currently contains more than 550 compounds that were cited in the literature as bitter. For each compound, BitterDB offers information regarding its molecular properties, references for the compound's bitterness, including additional information about the bitterness category of the compound (e.g. a ‘bitter-sweet’ or ‘slightly bitter’ annotation), different compound identifiers (smiles, CAS registry number, IUPAC systematic name), an indication whether the compound is derived from a natural source or is synthetic, a link to the compound's PubChem entry (28) and different file formats for downloading (sdf, image, smiles).

Around 100 bitter compounds have been experimentally linked to their corresponding human bitter taste receptors (12,15,16,29,30). For those compounds, BitterDB provides additional information, including links to the publications indicating these ligand–receptor interactions, the effective concentration for receptor activation and/or the EC50 value and links to the associated bitter taste receptors entries in the BitterDB.

Importantly, the BitterDB database offers its users a rich interface for querying and browsing the bitter compounds dataset. BitterDB is designed to enable the user to easily find specific bitter compounds, as well as to perform more advanced searches for a range of bitter compounds with specific properties. The ‘simple search’ option allows one to retrieve bitter compounds by name, identifiers (smiles description, PubChem ID, CAS registry number) or a specific associated receptor (Figure 1A). The ‘advanced search’ allows one to retrieve compounds that fit different criteria, such as a combination of specific physical properties (see Example 1 and Figure 1A) or a combination of associated human bitter taste receptors. The 2D structure similarity search allows one to retrieve bitter compounds that are similar to a chemical structure query (Example 2 and Figure 1A). In addition to the different querying options, the user can browse through a table with all the BitterDB compounds and their properties. The compounds in the table can be sorted according to different criteria (Example 3 and Figure 1B).

Figure 1.

Figure 1.

Screenshots of the BitterDB interface. (A) Illustration of three available search options: (i) Simple search: search compounds using keywords. (ii) Structure-based similarity search: 2D similarity search using different structural formats (drawing, smiles and user uploaded structure files). (iii) Advanced search, including searching by chemical descriptors, molecule identifiers and by associated human bitter taste receptors. Orphan receptors (shown in gray) cannot be searched by ligands. (B) The BitterDB compound browse table (which can be sorted by different criteria). (C) An example of a query result page: the user can download an SDF file of selected compounds from the result page. (D) A representative compound entry page.

BitterDB also contains data about the 25 known human bitter taste receptors. Several properties are displayed for each receptor: molecular weight, protein sequence, genomic location [according to the UCSC genome browser (31)], as well as links to the corresponding UCSC Genome browser (31) and UniProt (32) entries. For each receptor, a list of its known bitter ligands, collected from the literature, is displayed with links to the individual ligand entries and an option to download ligand structures as a SDF file. A 2D protein diagram, which offers an intuitive way of looking at the sequence of the receptor and its trans-membrane (TM) regions, is displayed for each receptor. For several receptors, data regarding mutagenesis experiments performed on the receptor sequence were found in the literature. For these receptors we generated a mutation table that summarizes the mutagenesis experiments and their effects on receptor function.

The bitter taste receptors can be searched using different criteria: name, known ligands and UniProt accession number. Using a table that presents all the bitter taste receptors and some information about them, the user can browse and also sort by various options, such as the number of bitter ligands associated with the receptor.

In addition, BitterDB offers two extra features: a local BLAST service to determine the local similarity between a query sequence and the different human bitter taste receptors (Example 4), and a global alignment of the 25 human bitter taste receptors. The alignment was generated using ClustalW2 (33), and displays the secondary structure of each receptor, as predicted by the TOPCONS server (34). In addition, the most conserved residue in each TM helix is annotated by its Ballesteros–Weinstein numbering scheme (35); the most conserved residue in a given TM is assigned the index X.50, where X is the TM number, and the remaining residues are numbered relative to this position.

Importantly, BitterDB offers an ‘upload’ option, by which the users are encouraged to submit information about bitter compounds. These data will be reviewed by the authors and uploaded to the database accordingly. The users can join the BitterDB mailing list in order to receive database updates.

MATERIALS AND METHODS

Data acquisition

Data were collected from the literature and from various web resources. Chemical compounds were included in the database only if they were described as bitter in the literature, or if they were reported experimentally as capable of activating at least one human bitter taste receptor. The main resources were (numbers in brackets indicate the numbers of compounds retrieved): the Merck index (36) [_300_], Fenaroli's Handbook of Flavor Ingredients (37) [_94_] and a recent publication by Meyerhof et al. (12) [_98_]. Most of the 2D chemical structures of the compounds and the IUPAC systematic names were taken from PubChem (28). A set of physical descriptors such as molecular weight, logP, molecular solubility, water solubility, H-bond donors, H-bond acceptors and the number of rings, aromatic rings and rotatable bonds were computed for each compound using the ‘calculate molecular properties protocol’ in Discovery Studio 2.5.5 (DS2.5.5; Accelrys, Inc.).

Data on human bitter taste receptors were taken mainly from the UniprotKB database (32). Experimental mutagenesis data on the bitter taste receptors were collected from the literature. The main resource was Brockhoff et al. (26). The secondary structure/topology of the receptors was predicted by the TOPCONS server (34).

Structure

Similarity searches

MarvinSketch, 5.3.8, 2010 (ChemAxon, Inc.), an advanced chemical editor for drawing chemical structures by the user, was implemented as the built-in molecular editor. The structure similarity search was implemented with Open Babel Package, version 2.2.3 http://openbabel.sourceforge.net/ (accessed December 2010). The degree of similarity between the query molecule and database entries is assessed by the Tanimoto coefficient using the MACCS fingerprints. MACCS fingerprints are a set of bitstrings; each bit refers to the presence or absence of one of the 166 MACCS keys in a given molecule. The MACCS fingerprints were pre-calculated for all the BitterDB molecules and are calculated during the search for the query structure, in order to compare it to the database entries. The Tanimoto coefficient, which is a number between 0 and 1 (1 corresponding to ‘maximum similarity’), is used to measure the distance between two molecules (query and database entry). The Tanimoto coefficient calculates the number of bit positions set to 1 in both fingerprints, divided by the number of bit positions set to 1 in at least one of the fingerprints. The top 20 results are returned.

Blast

BitterDB contains a local BLAST service, powered by the NCBI BLAST package (38). We created a custom database of the 25 protein sequences of the human bitter taste receptors whose query sequence is searched against. The user is able to control the different search parameters, such as scoring matrix, gap cost and others.

Visualization tools/features

BitterDB contains a range of visualization tools. These include a 2D compound structure display and a 2D image for each compound, which are generated using Marvin, 5.3.8, 2010 (ChemAxon http://www.chemaxon.com/), a 2D transmembrane protein diagram with indication of the most conserved residue in each helix and residues for which mutation data is available (see Figure 2) for each human bitter receptor, which is based on the TOPCONS secondary structure prediction (34) and an annotated bitter receptor alignment, which is powered by JalView (39).

Figure 2.

Figure 2.

2D transmembrane protein diagram. A convenient representation of the receptor sequence and its TM regions, shown here for hTAS2R16. The seven TM helices are displayed as predicted by Topcons. The most conserved residue in each helix X (BW number X.50) has black background. Residues for which mutation data are available are marked with red, and information about the mutations can be found in the site ‘Known Mutations Table’ section below the diagram http://bitterdb.agri.huji.ac.il/bitterdb/Receptor.php?id=16#ReceptorKnownMutations.

Web Server

BitterDB was designed as a relational database on a MariaDB server http://mariadb.org/. The website was built with the PHP server side, and web access is enabled via an Apache HTTP Server. The site is best viewed in Firefox, Opera, Chrome and Safari browsers.

EXAMPLES

Examples of BitterDB usage are provided below. A movie illustrating each example was recorded from the BitterDB website; the movies can be accessed from the ‘Examples’ link of the database, http://bitterdb.agri.huji.ac.il/bitterdb/examples.php

Possible usage: filtering molecules using their properties

Specific Example #1: Searching for bitter molecules that satisfy Lipinski's rule of five criteria

It is often useful to obtain a subset of molecules having particular physicochemical properties. As an example, we demonstrate how one can obtain a subset of bitter molecules that satisfy Lipinski's rule of five (40). This rule, based on properties of known orally available drugs, states that in general, an orally active drug has no more than one violation of the following criteria: Not more than 5 H-bond donors or 10 H-bond acceptors, a molecular mass under 500Da, and an octanol-water partition coefficient (logP) of less than five (40). In movie #1 (http://bitterdb.agri.huji.ac.il/bitterdb/examples.php#ex1), we demonstrate how, using Lipinksi's criteria as a filter, we retrieve 83% of the molecules in our database. These molecules satisfy Lipniski's rule of five and therefore can be considered ‘drug-like’.

Specific Example #2: Searching for compounds with a 2D structure similarity to aristolochic acid

It is commonly assumed that chemically similar compounds may bind to the same receptor (41–43). However, the chemical features of ligands of the same bitter taste receptor are very diverse (see Figure 3). It is therefore worthwhile to extend our exploration of the chemical space around known ligands of bitter taste receptors, in order to gain more information about the physicochemical determinants for bitterness. Furthermore, since most of the bitter compounds are not yet assigned to a particular bitter taste receptor target, similarity of a query compound to a ligand with known receptor association(s) may provide a helpful clue.

Figure 3.

Figure 3.

A subset of four hTAS214 ligands demonstrating a high degree of structural variability. These four structurally diverse compounds, which are all hTAS214 ligands, demonstrate that hTAS214 ligands lie in a broad chemical spectrum that includes compounds from different classes such as aromatic polycyclic compounds (1), polyacetylenes (2), sulfoxides (3) and cycloparaffins (4).

In Example 2 (http://bitterdb.agri.huji.ac.il/bitterdb/examples.php#ex2) we searched for compounds with a 2D structure similarity to aristolochic acid, a known ligand of the bitter taste receptor hTAS2R14 (as well as hTAS2R43 and hTAS2R44) (12). Nine of the top ten results were not yet assigned to any bitter receptor target. However, result number six, Noscapine, is also a ligand of hTAS2R14. This interesting result suggests that additional aromatic compounds that bear resemblance to aristolochic acid could be potential hTAS2R14 ligands.

Possible usage: Browsing and sorting bitter compounds

Specific Example #3: Sorting compounds by their logP value

Although the structural and physicochemical determinants for bitterness remain largely unknown, there are some chemical properties within specific chemical families that are related to bitterness (44–46). For example, several studies proposed a correlation between hydrophobicity and bitterness (44,45). Additionally, a recent study showed that bitterness of the soy peptides is predominantly associated with molecular mass (47).

Hence, it is interesting to sort the dataset of BitterDB compounds by different molecular properties and to explore the associated ranges of bitterness. In this example (http://bitterdb.agri.huji.ac.il/bitterdb/examples.php#ex3), we sorted the dataset according to the logP-value, which is one of the means of measuring small molecules’ hydrophobicity.

Possible usage: relating bitter taste receptors from different species to human bitter receptors

Specific Example #4: finding human bitter taste receptors with sequence similarity to the zebra fish (Danio rerio) bitter taste receptor TAS2R5

Organisms’ sense of bitter taste helps them avoid toxic and harmful substances. Although research on bitter taste in humans has advanced rapidly, less is known about taste perception in other organisms. Thus, it is beneficial to relate novel taste receptors to human ones, and learn about potential common ligands, relevant known mutations, etc.

The Zebrafish has only four intact TAS2R genes as opposed to 25 genes in humans. In this example (http://bitterdb.agri.huji.ac.il/bitterdb/examples.php#ex4), the zebrafish TAS2R5 receptor protein sequence was searched against the human bitter taste receptors sequences in BitterDB. The goal was to find the most similar human receptors using the BitterDB local BLAST service. The human receptor with the highest score for local similarity is hTAS2R20 (also known as hTAS2R49). hTAS2R20 has two known ligands—diphenidol and cromolyn, which, to the best of our knowledge, have not yet been tested on zebrafish T2Rs. The second most similar receptor, hTAS2R42, is still an orphan receptor, and the next one, hTAS2R4, has 15 known ligands. Interestingly, one of the hTAS2R4 ligands is denatonium benzoate, which was also reported as a zebrafish TAS2R5 ligand (48). Detailed studies of ligand–receptor interactions from different species are needed to better understand the range of activity of the different receptors and its relation to the number of receptor types in a given species. Yet, as a first guess, the receptor similarity option can be used to provide a preliminary list of suspected ligands, based on the ligands of human receptors with the highest sequence similarity, which were already de-orphanized.

SUMMARY AND OUTLOOK

Our goals in the present study were (i) to compile a comprehensive dataset of all chemical structures that have been cited in the literature as having bitter taste, and (ii) to assign bitter ligands to their cognate receptors in humans, where possible. To the best of our knowledge, although a proprietary database of bitter compounds was established in the past (5), BitterDB is the first publicly available, electronically searchable database of bitter molecules.

Interestingly, not only does the receptor receptive range vary within bitter taste receptors (12)—also the number of functional bitter taste receptors varies among different species [e.g. (49)]. The number of receptors in a given species may relate to the receptors’ receptive range, the effects of variation within the population (4) and the environmental pressure. The ability to understand and predict these effects relies on the knowledge of what is ‘bitter’ (or aversive) to different species, and on the ability to cluster ligands based on their chemical similarity. BitterDB will be extended in the future to include additional species, and thus will contribute to elucidating evolutionary aspects of bitter taste perception.

Fundamental questions as well as potential therapeutic applications of the modulation of bitter taste receptors continue to motivate further development of the BitterDB database. Future extensions may include implementing 3D similarity searches of compounds, modeling 3D structures of the receptors and mapping single nucleotide polymorphisms (SNPs) onto these models.

SUPPLEMENTARY DATA

Supplementary Data are available at NAR Online: Supplementary Files.

FUNDING

Niedersachsen-Israel Research cooperation fund and BSF grant number 2007296. Funding for open access charge: Intramural grant from the Hebrew University.

Conflict of interest statement. None declared.

ACKNOWLEDGEMENTS

The authors are grateful to Dvorah Art-Weisman, Tomer Levy and Marylin Safran for technical help and Dr Merav Fichman for discussions. MYN holds the Women's League for Israel Senior Lectureship for Nutrition.

REFERENCES