An automated workflow by using KNIME Analytical Platform: a case study for modelling and predicting HIV-1 protease inhibitors (original) (raw)

HIVprotI: an integrated web based platform for prediction and design of HIV proteins inhibitors

Journal of cheminformatics, 2018

A number of anti-retroviral drugs are being used for treating Human Immunodeficiency Virus (HIV) infection. Due to emergence of drug resistant strains, there is a constant quest to discover more effective anti-HIV compounds. In this endeavor, computational tools have proven useful in accelerating drug discovery. Although methods were published to design a class of compounds against a specific HIV protein, but an integrated web server for the same is lacking. Therefore, we have developed support vector machine based regression models using experimentally validated data from ChEMBL repository. Quantitative structure activity relationship based features were selected for predicting inhibition activity of a compound against HIV proteins namely protease (PR), reverse transcriptase (RT) and integrase (IN). The models presented a maximum Pearson correlation coefficient of 0.78, 0.76, 0.74 and 0.76, 0.68, 0.72 during tenfold cross-validation on ICand percent inhibition datasets of PR, RT, I...

QSAR models for prediction study of HIV protease inhibitors using support vector machines, neural networks and multiple linear regression

Arabian Journal of Chemistry, 2017

Support vector machines (SVM) represent one of the most promising Machine Learning (ML) tools that can be applied to develop a predictive quantitative structure-activity relationship (QSAR) models using molecular descriptors. Multiple linear regression (MLR) and artificial neural networks (ANNs) were also utilized to construct quantitative linear and non linear models to compare with the results obtained by SVM. The prediction results are in good agreement with the experimental value of HIV activity; also, the results reveal the superiority of the SVM over MLR and ANN model. The contribution of each descriptor to the structure-activity relationships was evaluated.

Hybrid-genetic algorithm based descriptor optimization and QSAR models for predicting the biological activity of Tipranavir analogs for HIV protease inhibition

Journal of Molecular Graphics and Modelling, 2010

The prediction of biological activity of a chemical compound from its structural features plays an important role in drug design. In this paper, we discuss the quantitative structure activity relationship (QSAR) prediction models developed on a dataset of 170 HIV protease enzyme inhibitors. Various chemical descriptors that encode hydrophobic, topological, geometrical and electronic properties are calculated to represent the structures of the molecules in the dataset. We use the hybrid-GA (genetic algorithm) optimization technique for descriptor space reduction. The linear multiple regression analysis (MLR), correlation-based feature selection (CFS), non-linear decision tree (DT), and artificial neural network (ANN) approaches are used as fitness functions. The selected descriptors represent the overall descriptor space and account well for the binding nature of the considered dataset. These selected features are also human interpretable and can be used to explain the interactions between a drug molecule and its receptor protein (HIV protease). The selected descriptors are then used for developing the QSAR prediction models by using the MLR, DT and ANN approaches. These models are discussed, analyzed and compared to validate and test their performance for this dataset. All three approaches yield the QSAR models with good prediction performance. The models developed by DT and ANN are comparable and have better prediction than the MLR model. For ANN model, weight analysis is carried out to analyze the role of various descriptors in activity prediction. All the prediction models point towards the involvement of hydrophobic interactions. These models can be useful for predicting the biological activity of new untested HIV protease inhibitors and virtual screening for identifying new lead compounds.

Pattern recognition: Application of Support Vector Machines, Artificial Neural Networks and Decision Trees for anti-HIV activity prediction of organic compounds

2011 International Conference on Multimedia Computing and Systems, 2011

Atstaa-Predicting the biological activity of molecules from their chemical structures ir a principal problem in drug discovery. Pattern recognition has gained attention as metbods covering this need. ln this study three classification models for anti-HIV activity, based on pattern recognition methods such as' Support Vector Macbines, Artificial Neural Networks and Decision . Trees, are developed. All models give good results in tearaing and prediction phases. These results indicate that these models ctn be used a3 an allernative tool for classilicotion problems in structure antËIIW activity relationship.

Support vector machines: Development of QSAR models for predicting anti-HIV-1 activity of TIBO derivatives

European Journal of Medicinal Chemistry, 2010

The tetrahydroimidazo [4,5,1-jk][1,4]benzodiazepinone (TIBO) derivatives, as non-nucleoside reverse transcriptase inhibitors, acquire a significant place in the treatment of the infections by the HIV. In the present paper, the support vector machines (SVM) are used to develop quantitative relationships between the anti-HIV activity and four molecular descriptors of 82 TIBO derivatives. The results obtained by SVM give good statistical results compared to those given by multiple linear regressions and artificial neural networks. The contribution of each descriptor to structure-activity relationships was evaluated. It indicates the importance of the hydrophobic parameter. The proposed method can be successfully used to predict the anti-HIV of TIBO derivatives with only four molecular descriptors which can be calculated directly from molecular structure alone.

Regression QSAR Models for Predicting HIV-1 Integrase Inhibitors

Pre-print, 2021

The Human Immunodeficiency Virus (HIV) infection is a global pandemic that has claimed 33 million lives to date. One of the most efficacious treatment for naïve or pre-treated HIV patients is with the HIV integrase strand transfer inhibitors (INSTIs). However, given that HIV treatment is lifelong , the emergence of HIV-1 strains resistant to INSTIs is an imminent challenge. In this work, we showed two best regression QSAR models that were constructed using a boosted Random Forest algorithm (r 2 = 0.998, q 10CV 2 = 0.721, q external_test 2 = 0.754) and a boosted K* algorithm (r 2 = 0.987, q 10CV 2 = 0.721, q external_test 2 = 0.758) to predict the pIC50 values of INSTIs. Subsequently, the regression QSAR models were deployed against the Drugbank database for drug repositioning. The top ranked compounds were further evaluated for their target engagement activity using molecular docking studies and their potential as INSTIs evaluated from our literature search. Our study offers the first example of a large-scale regression QSAR modelling effort for discovering highly active INSTIs to combat HIV infection.

MOESM1 of HIVprotI: an integrated web based platform for prediction and design of HIV proteins inhibitors

2018

Additional file 1. Supporting information including Table S1. Performance of QSAR predictive models on three times randomly picked ~ 10% independent/validation data. These models were developed using remaining ~ 90% data during training/testing respectively for each of the six datasets; Table S2. Details of statistical parameters used for the development of IC50 based QSAR models; Table S3. Details of statistical parameters used for the development of percent inhibition based QSAR models; Table S4. Details of chemical descriptors used in the development of IC50 based QSAR models; Table S5. Details of chemical descriptors used in the development of percent inhibition based QSAR models; Table S6. Details of slopes k (predicted vs. observed inhibition) and k' (observed vs. predicted inhibition) of the regression lines for the QSAR models; Table S7. Details of Y-randomization test performed on the QSAR models; Figure S1. Chemical space analysis of QSAR studies (Table 5) for Protease...

Using hybrid GA-ANN to predict biological activity of HIV protease inhibitors

2008

Abstract The prediction of biological activity of a chemical compound from its structural features, representing its physico-chemical properties, plays an important role in drug discovery, design and development. Since the biological data is highly non-linear, the machine-learning techniques have been widely used for modeling it. In the present work, the clustering, genetic algorithm (GA) and artificial neural networks (ANN) are used to develop computational prediction models on a dataset of HIV protease inhibitors.

Machine Learning Model and Molecular Docking for Screening Medicinal Plants as HIV-1 Reverse Transcriptase Inhibitors

Karbala international journal of modern science, 2024

The human immunodeficiency virus type 1 reverse transcriptase (HIV-1 RT) plays a significant role in viral replication and is one of the targets for anti-HIV. However, a mutation in viral strains rapidly developed the resistance of the com-pounds to the protein, reducing the effectiveness of the inhibitors. This work seeks to utilize machine learning-based quantitative structure-activity relationship (QSAR) analysis in combination with molecular docking simulations to forecast the presence of active compounds derived from medicinal plants. Specifically, the objective is to identify com-pounds that have the potential to operate as inhibitors of HIV-1 reverse transcriptase (RT), encompassing both wild-type and mutant variants. It is demonstrated that some substances are no longer suitable as inhibitors due to changes in the HIV-1 RT enzyme. Based on the screening results, four medicinal plants, Melissa officinalis, Punica granatum, Psidium guajava, and Curcuma longa, are worth further investigation. Nevertheless, the findings from the in vitro study suggest that extracts derived from pomegranate rind and guava leaves exhibit significant promise as HIV-1 RT inhibitors.