Lung Cancer Predictive Analysis Using Optimized Ensemble and Hybrid Machine Learning Techniques (original) (raw)

Enhancing lung cancer disease diagnosis by employing ensemble deep learning approaches

Indonesian Journal of Electrical Engineering and Computer Science

Cancer is a disease that results from the unnatural proliferation of aberrant cells that infest the body’s healthy cells and spread throughout the body. Lung cancer is characterized by an imbalance in the cells of the affected organs, namely the lungs. The prediction of lung cancer at an early stage is very important, particularly in countries that are densely populated and have lower incomes. Clinically conventional approaches, such as blood tests and other types of treatments, are used by specialists. The age of artificial intelligence (AI) has begun, and today, it is feasible to construct a computer-aided diagnostic mechanism with the assistance of machine learning and deep learning algorithms. In this particular piece of research, one deep learning algorithm, an artificial neural network (ANN), has been investigated to determine whether or not lung cancer could be detected at an earlier stage. In addition to conventional ANN, ensemble ANN with weighted averaging and soft and har...

Performance evaluation of deep learning techniques for lung cancer prediction

Springer - Soft Computing, 2023

Due to the increase in pollution, the number of deaths caused by lung disease is rising rapidly. It is essential to predict the disease in earlier stages by means of high-level knowledge and acquaintance. Deep learning-based lung cancer prediction plays a vital role in assisting the medical practioners for diagnosing lung cancer in earlier stage. Computer-Aided diagnosis is considered to bring a boost to the field of medicine by tying it to automated systems. In this research paper, several models are experimented by using chest X-ray image or CT scan as an input to detect a particular disease. This research work is carried out to identify the best performing deep learning techniques for lung disease prediction. The performance of the method is evaluated using various performance metrics, such as precision, recall, accuracy and Jaccard index.

Predicting Lung Cancer Survivability: a Machine Learning Ensemble Method on Seer Data

Ensemble methods are powerful techniques used in machine learning to improve the prediction accuracy of classifier learning systems. In this study, different ensemble learning methods for lung cancer survival prediction were evaluated on the Surveillance, Epidemiology and End Results (SEER) dataset. Data were preprocessed in several steps before applying classification models. The popular ensemble methods Bagging, Adaboost and three classification algorithms, K-Nearest Neighbours, Decision Tree and Neural Networks as base classifiers were evaluated for lung cancer survival prediction. The results empirically showed that ensemble methods are able to evaluate the performance of their base classifiers and they are appropriate methods for analysis of cancer survival.

Literature Survey for Lung Cancer Analysis and Prediction

International Journal for Research in Applied Science & Engineering Technology (IJRASET), 2022

Lung cancer is one of the most common and deadly cancers worldwide which can be cured only if it is discovered at an early stage. Lung cancer can be diagnosed using various technologies, including MRI, isotopes, X-rays, and CT. One of the most effective ways to fight cancer is to discover it early enough to significantly improve the patient's chances of survival which can be done by the means of Artificial Intelligence. The proposed approach uses past medical records to determine if the patient has lung cancer. The CT scans are analyzed by a Convolutional Neural Network (CNN) model to determine the stage of cancer. Finally, the suggested model would forecast the patient's estimated medical insurance costs. Machine learning (ML) and Deep Learning (DL) approaches will be used to train and test the models by utilizing open-source datasets.

Artificial Intelligence–Based Prediction of Lung Cancer Risk Using Nonimaging Electronic Medical Records: Deep Learning Approach

Journal of Medical Internet Research, 2021

Background Artificial intelligence approaches can integrate complex features and can be used to predict a patient’s risk of developing lung cancer, thereby decreasing the need for unnecessary and expensive diagnostic interventions. Objective The aim of this study was to use electronic medical records to prescreen patients who are at risk of developing lung cancer. Methods We randomly selected 2 million participants from the Taiwan National Health Insurance Research Database who received care between 1999 and 2013. We built a predictive lung cancer screening model with neural networks that were trained and validated using pre-2012 data, and we tested the model prospectively on post-2012 data. An age- and gender-matched subgroup that was 10 times larger than the original lung cancer group was used to assess the predictive power of the electronic medical record. Discrimination (area under the receiver operating characteristic curve [AUC]) and calibration analyses were performed. Result...

Lung Cancer Prediction Using Ensemble Learning

International Journal of Scientific Research in Computer Science, Engineering and Information Technology, 2021

Lung Cancer is the most commonly occurring type of cancer in the world. Despite all the research in the field of lung cancer is still maintains a extremely high mortality rate and a cure rate of of less than 15%. Majority of lung cancer patients are diagnosed at a very advanced stage which is why randomized clinical trials have come under intense scrutiny from the medical practitioners and have led to a new resurgence of interest in its screening methods and development of newer techniques to improve its efficiency. The existing screening and detection techniques have known to be slow, cost ineffective and have other discrepancies such as false positives. Keeping this in mind we propose to use ensemble learning methods to train our data-set to overcome the drawbacks and improve upon the individual algorithms.

Lung cancer disease identification using hybrid models

The Scientific Temper

Using hybrid models, we present a novel method for detecting lung cancer in this study. Our method uses the random forest and convolutional neural network (CNN) techniques to incorporate machine learning and deep learning advantages. The proposed composite method combines structured clinical data with unprocessed imaging data for a more complete lung cancer diagnosis. The CNN component of our hybrid model excels at extracting features from images of lung cancer, while the random forest component excels at capturing complex relationships in structured data. For greater precision and consistency, the results of the two models may be averaged. The hybrid model outperforms the existing methods. The hybrid model acquired an accuracy rate of 98%. Future lung cancer detection will be rapid and accurate due to the hybrid model’s improved performance and decreased inference periods.

A Comparative Study of Machine Learning Based Classifications on Lung Cancer Detection

Journal of Artificial Intelligence and System Modelling, 2024

This research addresses a significant gap in lung cancer prediction, focusing on the critical need for highly accurate models to improve early detection and treatment outcomes. Despite advances in machine learning, achieving higher accuracy in classification models for lung cancer remains a persistent challenge. This study aims to fill this gap by developing advanced machine-learning models specifically designed to enhance the prediction of lung cancer outcomes. The research employs a rigorous methodology that involves the systematic exploration of various machine learning algorithms applied to a comprehensive lung cancer dataset. Through meticulous phases of model development, training, validation, and testing, this study evaluates the performance and accuracy of the proposed models. The findings underscore the effectiveness of these models in addressing the challenge of early lung cancer detection, offering more precise and reliable classification outcomes than existing approaches. The novelty of this research lies in the extensive experimental work, which demonstrates the superiority of the selected algorithm for lung cancer data classification. This not only highlights the potential of the algorithm for clinical application but also provides a concrete solution to the ongoing issue of achieving higher accuracy in lung cancer prediction models. By bridging this research gap, the study offers promising new avenues for early-stage lung cancer prognosis and treatment strategies, with the potential to significantly impact clinical practices.

Analysis of Lung Disease Prediction using Machine Learning Algorithms

International Journal of Advanced Research in Science, Communication and Technology (IJARSCT), 2024

Lung diseases are a notable in global health concern, requiring early diagnosis for better recovery and survival rates. Deep learning strategies, especially CNNs, have shown great promise in self learning lung disease diagnosis from medical images like chest X-rays. Ensemble learning methods using pretrained networks such as VGG16, InceptionV3, and MobileNetV2 have achieved up to 94% accuracy in identifying conditions like COVID-19, pneumonia, and lung opacity. Lightweight CNN models also performed well, with accuracy up to 89.89%. Traditional machine learning algorithms, including Random Forest and Logistic Regression, yielded accuracy rates between 88% and 90%. A hybrid deep learning approach, combining CNN based feature extraction with classifiers like AdaBoost, SVM, and Random Forest, improved classification accuracy by 3.1% and reduced computational complexity by 16.91%. This hybrid method highlights the main feature for integrating deep learning with traditional classifiers to enhance lung disease detection efficiency

A Selected Deep Learning Cancer Prediction Framework

IEEE Access

Deep learning (DL) algorithms are crucial for predicting various diseases because they can analyze a large amount of healthcare data within a short prediction time. One of these diseases is cancer, which causes one out of six deaths worldwide. Many researchers have adopted predictive frameworks such as machine learning and DL to predict cancer prognosis, in addition to the probability of its recurrence, progression, and the patients' survival estimation. Currently, all stakeholders are interested in the accuracy of cancer prognosis prediction. This study selected a framework within high accuracy and short prediction time from three DL frameworks for improving the performance of cancer prognosis prediction. This prediction requires a quick and high-accuracy optimizer, so we propose a binary version of the continuous AC-parametric whale optimization algorithm. This version is built on S-shaped transfer functions to identify the minimal optimal subset of features and maximize the classification accuracy. These frameworks proposed have the following forms: the first is a Feed-Forward Neural Network (FFNN) in which the input is the optimal set of feature selection. The second is an optimized parameter FFNN. The third is composed of a feature selection layer in which the best subset of selected features is for use as inputs in the optimized FFNN. We compared these frameworks using a comparative study. Our results show that, under all conditions, the third framework is superior to the others with an average accuracy of 100%, whereas the first and second frameworks achieved 94.97% and 93.12% accuracy, respectively.