CLC-Pred: A freely available web-service for in silico prediction of human cell line cytotoxicity for drug-like compounds (original) (raw)

CLC-Pred 2.0: A Freely Available Web Application for In Silico Prediction of Human Cell Line Cytotoxicity and Molecular Mechanisms of Action for Druglike Compounds

International Journal of Molecular Sciences

In vitro cell-line cytotoxicity is widely used in the experimental studies of potential antineoplastic agents and evaluation of safety in drug discovery. In silico estimation of cytotoxicity against hundreds of tumor cell lines and dozens of normal cell lines considerably reduces the time and costs of drug development and the assessment of new pharmaceutical agent perspectives. In 2018, we developed the first freely available web application (CLC-Pred) for the qualitative prediction of cytotoxicity against 278 tumor and 27 normal cell lines based on structural formulas of 59,882 compounds. Here, we present a new version of this web application: CLC-Pred 2.0. It also employs the PASS (Prediction of Activity Spectra for Substance) approach based on substructural atom centric MNA descriptors and a Bayesian algorithm. CLC-Pred 2.0 provides three types of qualitative prediction: (1) cytotoxicity against 391 tumor and 47 normal human cell lines based on ChEMBL and PubChem data (128,545 st...

In silico prediction of drug toxicity

Journal of computer-aided molecular design

It is essential, in order to minimise expensive drug failures due to toxicity being found in late development or even in clinical trials, to determine potential toxicity problems as early as possible. In view of the large libraries of compounds now being handled by combinatorial chemistry and high-throughput screening, identification of putative toxicity is advisable even before synthesis. Thus the use of predictive toxicology is called for. A number of in silico approaches to toxicity prediction are discussed. Quantitative structure-activity relationships (QSARs), relating mostly to specific chemical classes, have long been used for this purpose, and exist for a wide range of toxicity endpoints. However, QSARs also exist for the prediction of toxicity of very diverse libraries, although often such QSARs are of the classification type; that is, they predict simply whether or not a compound is toxic, and do not give an indication of the level of toxicity. Examples are given of all of...

First report on predictive chemometric modeling, 3D-toxicophore mapping and in silico screening of in vitro basal cytotoxicity of diverse organic chemicals

Toxicology in Vitro, 2013

Classification and regression based quantitative structure-toxicity relationship (QSTR) as well as toxicophore models were developed for the first time on basal cytotoxicity data (in vitro 3T3 neutral red uptake data) of a diverse series of chemicals (including drugs and environmental pollutants) collected from the ACuteTox database (http://www.acutetox.eu/). Statistically significant QSTR models were obtained using linear discriminant analysis (classification) and partial least squares (regression) methodologies. Generated toxicophore models showed four important features responsible for basal cytotoxicity: (i) two hydrophobic aliphatic groups (HYD Aliphatic), (ii) ring aromatic group (RA) and (iii) hydrogen bond donor (HBD). The most predictive hypothesis (Hypo 1) had a correlation coefficient of 0.932 for the training set, a low rms deviation of 1.105, and an acceptable cost difference of 62.8 bits, which represents a true correlation and a good predictivity. QSTR and toxicophore models were rigorously validated internally as well as externally along with the randomization test to nullify the possibilities of chance correlation. Our in silico models enable to identify the essential structural attributes and quantify the prime molecular prerequisites which were chiefly responsible for in vitro basal cytotoxicity. The developed models were also implemented to screen basal cytotoxicity for huge number DrugBank database (http://www.drugbank.ca/) compounds.

A structural analysis of the differential cytotoxicity of chemicals in the NCI60 cancer cell lines

Bioorganic & Medicinal Chemistry, 2008

The primary functions of cancer chemotherapeutic agents are not only to inhibit the growth or kill the cancer cells, but to do so without eliciting unreasonable cytotoxic effects on the healthy cells and to withstand the ability of the cancer cells to develop resistance against it. This has unfortunately been proven so far to be a very difficult objective. In this perspective, the ability of small molecules (anti-tumor agents) to 'see' different cell types differently can be a key attribute. Thus the term 'differential cytotoxicity' is normally used to describe the drug's specificity. In the present paper, we have quantified differential cytotoxicity from a study of the chemicals tested in the National Cancer Institute's Developmental Therapeutics Program. The MULTICASE (Multiple Computer Automated Structure Evaluation) methodology was used to discover statistically significant structural fragments (biophores) related to the differential cytotoxicity of the compounds. We found that even small structural features often become important in this regard which is evident from the biophores that were found in structurally diverse chemicals. By utilizing the difference between the raw and normalized differential cytotoxicity indices, we found that the a,b-unsaturated carbonyl group (O@CAC@CH 2 ) is the major biophore associated with compounds with essentially parallel concentration profiles in the cell lines in question. These compounds have high non-normalized differential cytotoxicity but considerably low normalized differential cytotoxocity. The models developed were cross validated for their predictive ability.

Prediction of anticancer molecules using hybrid model developed on molecules screened against NCI-60 cancer cell lines

BMC cancer, 2015

In past, numerous quantitative structure-activity relationship (QSAR) based models have been developed for predicting anticancer activity for a specific class of molecules against different cancer drug targets. In contrast, limited attempt have been made to predict the anticancer activity of a diverse class of chemicals against a wide variety of cancer cell lines. In this study, we described a hybrid method developed on thousands of anticancer and non-anticancer molecules tested against National Cancer Institute (NCI) 60 cancer cell lines. Our analysis of anticancer molecules revealed that majority of anticancer molecules contains 18-24 carbon atoms and are dominated by functional groups like R2NH, R3N, ROH, RCOR, and ROR. It was also observed that certain substructures (e.g., 1-methoxy-4-methylbenzene, 1-methoxy benzene, Nitrobenzene, Indole, Propenyl benzene) are more abundant in anticancer molecules. Next, we developed anticancer molecule prediction models using various machine-l...

Activity and toxicity modelling of some NCI selected compounds against leukemia P388ADR cell line using genetic algorithm-multiple linear regressions

Journal of King Saud University - Science, 2018

Cancer-causing nature is one of the toxicological endpoints bringing about the most elevated concern. Likewise, the standard bioassays in rodents used to survey the cancer-mitigating capability of chemicals and medications are expensive and require the sacrifice of animals. Thus, we have endeavored the development of a worldwide QSAR model utilizing an information set of 85 compounds, including drugs for their anti-leukemia potential. Considering expansive number of information focuses with different structural elements utilized for model development (ntraining = 68) and model validation (ntest = 17), the model developed in this study has an encouraging statistical quality (leave-one-out Q2 = 0.833, R2pred = 0.716) for pLC50 and (leave-one-out Q2 = 0.744, R2pred = 0.614) for pGI50. Our developed model suggests that the absence of methanal fragments, low dipole moment and presence of some 2D autocorrelated molecular descriptors reduces the carcinogenicity. Branching, size and shape are found to be crucial factors for drug-mitigating carcinogenicity.

Computational toxicology in drug development

Drug Discovery Today, 2008

Computational tools for predicting toxicity have been envisaged for their potential to considerably impact the attrition rate of compounds in drug discovery and development. In silico techniques like knowledge-based expert systems (quantitative) structure activity relationship tools and modeling approaches may therefore help to significantly reduce drug development costs by succeeding in predicting adverse drug reactions in preclinical studies. It has been shown that commercial as well as proprietary systems can be successfully applied in the pharmaceutical industry. As the prediction has been exhaustively optimized for early safety-relevant endpoints like genotoxicity, future activities will now be directed to prevent the occurrence of undesired toxicity in patients by making these tools more relevant to human disease.

A Simple and Readily Integratable Approach to Toxicity Prediction

Journal of Chemical Information and Modeling, 2003

A simple, highly extensible computational strategy to assess compound toxicity has been developed with the premise that a compound's toxicity can be gauged from the toxicities of structurally similar compounds. Using a reference set of 13645 compounds with reported acute toxicity endpoint dose data (oral, rat-LD 50 data normalized in mg/kg), a generic utility which assigns a compound the average toxicity of structurally similar compounds is shown to correlate well with reported values. In a leave-one-out simulation using the requirement that at least one structurally similar member in a "voting consortium" is present within a reference set, the strategy demonstrates a predictive correlation (q∧2) of 0.82 with 57.3% of the compounds being predicted. Similar leave-one-out simulations on a set of 1781 drugs demonstrate a q∧2 of 0.74 with 51.8% of the compounds being predicted. Simulations to optimize similarity cutoff definitions, consortium member size, and reference set size illustrate that a significant improvement in the quality and quantity of predictions can be obtained by increasing the reference set size. Similar application of the strategy to subchronic and chronic toxicity data should be possible given reasonably sized reference sets.

Compound Cytotoxicity Profiling Using Quantitative High-Throughput Screening

Environmental Health Perspectives, 2007

BACKGROUND: The propensity of compounds to produce adverse health effects in humans is generally evaluated using animal-based test methods. Such methods can be relatively expensive, lowthroughput, and associated with pain suffered by the treated animals. In addition, differences in species biology may confound extrapolation to human health effects. OBJECTIVE: The National Toxicology Program and the National Institutes of Health Chemical Genomics Center are collaborating to identify a battery of cell-based screens to prioritize compounds for further toxicologic evaluation. METHODS: A collection of 1,408 compounds previously tested in one or more traditional toxicologic assays were profiled for cytotoxicity using quantitative high-throughput screening (qHTS) in 13 human and rodent cell types derived from six common targets of xenobiotic toxicity (liver, blood, kidney, nerve, lung, skin). Selected cytotoxicants were further tested to define response kinetics. RESULTS: qHTS of these compounds produced robust and reproducible results, which allowed cross-compound, cross-cell type, and cross-species comparisons. Some compounds were cytotoxic to all cell types at similar concentrations, whereas others exhibited species-or cell type-specific cytotoxicity. Closely related cell types and analogous cell types in human and rodent frequently showed different patterns of cytotoxicity. Some compounds inducing similar levels of cytotoxicity showed distinct time dependence in kinetic studies, consistent with known mechanisms of toxicity. CONCLUSIONS: The generation of high-quality cytotoxicity data on this large library of known compounds using qHTS demonstrates the potential of this methodology to profile a much broader array of assays and compounds, which, in aggregate, may be valuable for prioritizing compounds for further toxicologic evaluation, identifying compounds with particular mechanisms of action, and potentially predicting in vivo biological response. KEY WORDS: 1,536-well, cell viability, NTP 1,408 compound library, PubChem, qHTS, RT-CES.

Data Mining the NCI60 to Predict Generalized Cytotoxicity

Journal of Chemical Information and Modeling - J CHEM INF MODEL, 2008

Elimination of cytotoxic compounds in the early and later stages of drug discovery can help reduce the costs of research and development. Through the application of principal components analysis (PCA), we were able to data mine and prove that ∼89% of the total log GI 50 variance is due to the nonspecific cytotoxic nature of substances. Furthermore, PCA led to the identification of groups of structurally unrelated substances showing very specific toxicity profiles, such as a set of 45 substances toxic only to the Leukemia_SR cancer cell line. In an effort to predict nonspecific cytotoxicity on the basis of the mean log GI 50 , we created a decision tree using MACCS keys that can correctly classify over 83% of the substances as cytotoxic/ noncytotoxic in silico, on the basis of the cutoff of mean log GI 50 ) -5.0. Finally, we have established a linear model using least-squares in which nine of the 59 available NCI60 cancer cell lines can be used to predict the mean log GI 50 . The model has R 2 ) 0.99 and a root-mean-square deviation between the observed and calculated mean log GI 50 (RMSE) ) 0.09. Our predictive models can be applied to flag generally cytotoxic molecules in virtual and real chemical libraries, thus saving time and effort.