Nisha Bajiya - Academia.edu (original) (raw)
Papers by Nisha Bajiya
Protein Science, Oct 15, 2023
Frontiers in bioinformatics, Feb 6, 2024
In the past, several methods have been developed for predicting the single-label subcellular loca... more In the past, several methods have been developed for predicting the single-label subcellular localization of messenger RNA (mRNA). However, only limited methods are designed to predict the multi-label subcellular localization of mRNA. Furthermore, the existing methods are slow and cannot be implemented at a transcriptome scale. In this study, a fast and reliable method has been developed for predicting the multi-label subcellular localization of mRNA that can be implemented at a genome scale. Machine learning-based methods have been developed using mRNA sequence composition, where the XGBoost-based classifier achieved an average area under the receiver operator characteristic (AUROC) of 0.709 (0.668-0.732). In addition to alignment-free methods, we developed alignment-based methods using motif search techniques. Finally, a hybrid technique that combines the XGBoost model and the motif-based approach has been developed, achieving an average AUROC of 0.742 (0.708-0.816). Our method-MRSLpred-outperforms the existing stateof-the-art classifier in terms of performance and computation efficiency. A publicly accessible webserver and a standalone tool have been developed to facilitate researchers (webserver: https://webs.iiitd.edu.in/raghava/mrslpred/).
bioRxiv (Cold Spring Harbor Laboratory), Jan 19, 2023
In the past, number of methods have been developed for predicting single label subcellular locali... more In the past, number of methods have been developed for predicting single label subcellular localization of mRNA in a cell. Only limited methods had been built to predict multi-label subcellular localization of mRNA. Most of the existing methods are slow and cannot be implemented at transcriptome scale. In this study, a fast and reliable method had been developed for predicting multi-label subcellular localization of mRNA that can be implemented at genome scale. Firstly, deep learning method based on convolutional neural network method have been developed using one-hot encoding and attained an average AUROC-0.584 (0.543-0.605). Secondly, machine learning based methods have been developed using mRNA sequence composition, our XGBoost classifier achieved an average AUROC-0.709 (0.668-0.732). In addition to alignment free methods, we also developed alignment-based methods using similarity and motif search techniques. Finally, a hybrid technique has been developed that combine XGBoost models and motif-based searching and achieved an average AUROC 0.742 (0.708-0.816). Our method-MRSLpred, developed in this study is complementary to the existing method. One of the major advantages of our method over existing methods is its speed, it can scan all mRNA of a transcriptome in few hours. A publicly accessible webserver and a standalone tool has been developed to facilitate researchers (Webserver: https://webs.iiitd.edu.in/raghava/mrslpred/).
Briefings in Bioinformatics
In the current era, one of the major challenges is to manage the treatment of drug/antibiotic-res... more In the current era, one of the major challenges is to manage the treatment of drug/antibiotic-resistant strains of bacteria. Phage therapy, a century-old technique, may serve as an alternative to antibiotics in treating bacterial infections caused by drug-resistant strains of bacteria. In this review, a systematic attempt has been made to summarize phage-based therapy in depth. This review has been divided into the following two sections: general information and computer-aided phage therapy (CAPT). In the case of general information, we cover the history of phage therapy, the mechanism of action, the status of phage-based products (approved and clinical trials) and the challenges. This review emphasizes CAPT, where we have covered primary phage-associated resources, phage prediction methods and pipelines. This review covers a wide range of databases and resources, including viral genomes and proteins, phage receptors, host genomes of phages, phage–host interactions and lytic protein...
In the past, number of methods have been developed for predicting single label subcellular locali... more In the past, number of methods have been developed for predicting single label subcellular localization of mRNA in a cell. Only limited methods had been built to predict multi-label subcellular localization of mRNA. Most of the existing methods are slow and cannot be implemented at transcriptome scale. In this study, a fast and reliable method had been developed for predicting multi-label subcellular localization of mRNA that can be implemented at genome scale. Firstly, deep learning method based on convolutional neural network method have been developed using one-hot encoding and attained an average AUROC - 0.584 (0.543 – 0.605). Secondly, machine learning based methods have been developed using mRNA sequence composition, our XGBoost classifier achieved an average AUROC - 0.709 (0.668 - 0.732). In addition to alignment free methods, we also developed alignment-based methods using similarity and motif search techniques. Finally, a hybrid technique has been developed that combine XGB...
This study focuses on the development of in silico models for predicting antibacterial peptides a... more This study focuses on the development of in silico models for predicting antibacterial peptides as a potential solution for combating antibiotic-resistant strains of bacteria. Existing methods for predicting antibacterial peptides are mostly designed to target either gram-positive or gram-negative bacteria. In this study, we introduce a novel approach that enables the prediction of antibacterial peptides against several bacterial groups, including gram-positive, gram-negative, and gram-variable bacteria. Firstly, we developed an alignment-based approach using BLAST to identify antibacterial peptides and achieved poor sensitivity. Secondly, we employed a motif-based approach to predict antibacterial peptides and obtained high precision with low sensitivity. To address the similarity issue, we developed machine learning-based models using a variety of compositional and binary features. Our machine learning-based model developed using the amino acid binary profile of terminal residues ...
Protein Science, Oct 15, 2023
Frontiers in bioinformatics, Feb 6, 2024
In the past, several methods have been developed for predicting the single-label subcellular loca... more In the past, several methods have been developed for predicting the single-label subcellular localization of messenger RNA (mRNA). However, only limited methods are designed to predict the multi-label subcellular localization of mRNA. Furthermore, the existing methods are slow and cannot be implemented at a transcriptome scale. In this study, a fast and reliable method has been developed for predicting the multi-label subcellular localization of mRNA that can be implemented at a genome scale. Machine learning-based methods have been developed using mRNA sequence composition, where the XGBoost-based classifier achieved an average area under the receiver operator characteristic (AUROC) of 0.709 (0.668-0.732). In addition to alignment-free methods, we developed alignment-based methods using motif search techniques. Finally, a hybrid technique that combines the XGBoost model and the motif-based approach has been developed, achieving an average AUROC of 0.742 (0.708-0.816). Our method-MRSLpred-outperforms the existing stateof-the-art classifier in terms of performance and computation efficiency. A publicly accessible webserver and a standalone tool have been developed to facilitate researchers (webserver: https://webs.iiitd.edu.in/raghava/mrslpred/).
bioRxiv (Cold Spring Harbor Laboratory), Jan 19, 2023
In the past, number of methods have been developed for predicting single label subcellular locali... more In the past, number of methods have been developed for predicting single label subcellular localization of mRNA in a cell. Only limited methods had been built to predict multi-label subcellular localization of mRNA. Most of the existing methods are slow and cannot be implemented at transcriptome scale. In this study, a fast and reliable method had been developed for predicting multi-label subcellular localization of mRNA that can be implemented at genome scale. Firstly, deep learning method based on convolutional neural network method have been developed using one-hot encoding and attained an average AUROC-0.584 (0.543-0.605). Secondly, machine learning based methods have been developed using mRNA sequence composition, our XGBoost classifier achieved an average AUROC-0.709 (0.668-0.732). In addition to alignment free methods, we also developed alignment-based methods using similarity and motif search techniques. Finally, a hybrid technique has been developed that combine XGBoost models and motif-based searching and achieved an average AUROC 0.742 (0.708-0.816). Our method-MRSLpred, developed in this study is complementary to the existing method. One of the major advantages of our method over existing methods is its speed, it can scan all mRNA of a transcriptome in few hours. A publicly accessible webserver and a standalone tool has been developed to facilitate researchers (Webserver: https://webs.iiitd.edu.in/raghava/mrslpred/).
Briefings in Bioinformatics
In the current era, one of the major challenges is to manage the treatment of drug/antibiotic-res... more In the current era, one of the major challenges is to manage the treatment of drug/antibiotic-resistant strains of bacteria. Phage therapy, a century-old technique, may serve as an alternative to antibiotics in treating bacterial infections caused by drug-resistant strains of bacteria. In this review, a systematic attempt has been made to summarize phage-based therapy in depth. This review has been divided into the following two sections: general information and computer-aided phage therapy (CAPT). In the case of general information, we cover the history of phage therapy, the mechanism of action, the status of phage-based products (approved and clinical trials) and the challenges. This review emphasizes CAPT, where we have covered primary phage-associated resources, phage prediction methods and pipelines. This review covers a wide range of databases and resources, including viral genomes and proteins, phage receptors, host genomes of phages, phage–host interactions and lytic protein...
In the past, number of methods have been developed for predicting single label subcellular locali... more In the past, number of methods have been developed for predicting single label subcellular localization of mRNA in a cell. Only limited methods had been built to predict multi-label subcellular localization of mRNA. Most of the existing methods are slow and cannot be implemented at transcriptome scale. In this study, a fast and reliable method had been developed for predicting multi-label subcellular localization of mRNA that can be implemented at genome scale. Firstly, deep learning method based on convolutional neural network method have been developed using one-hot encoding and attained an average AUROC - 0.584 (0.543 – 0.605). Secondly, machine learning based methods have been developed using mRNA sequence composition, our XGBoost classifier achieved an average AUROC - 0.709 (0.668 - 0.732). In addition to alignment free methods, we also developed alignment-based methods using similarity and motif search techniques. Finally, a hybrid technique has been developed that combine XGB...
This study focuses on the development of in silico models for predicting antibacterial peptides a... more This study focuses on the development of in silico models for predicting antibacterial peptides as a potential solution for combating antibiotic-resistant strains of bacteria. Existing methods for predicting antibacterial peptides are mostly designed to target either gram-positive or gram-negative bacteria. In this study, we introduce a novel approach that enables the prediction of antibacterial peptides against several bacterial groups, including gram-positive, gram-negative, and gram-variable bacteria. Firstly, we developed an alignment-based approach using BLAST to identify antibacterial peptides and achieved poor sensitivity. Secondly, we employed a motif-based approach to predict antibacterial peptides and obtained high precision with low sensitivity. To address the similarity issue, we developed machine learning-based models using a variety of compositional and binary features. Our machine learning-based model developed using the amino acid binary profile of terminal residues ...