CNN-Based Covid-19 Data Analysis and Case Prediction (original) (raw)
This project aims to propose a machine learning model to detect COVID-19 positive cases more precisely utilizing chest X-ray images. I have collected and merged all the publicly available chest X-ray datasets of COVID-19 infected patients from Kaggle and Github, and pre-processed it using random sampling approach. Then, I proposed and applied an enhanced convolutional neural network (CNN) model to this dataset and obtained a 94.03% accuracy, 95.52% AUC and 94.03% f-measure for detecting COVID-19 positive patients. I have also performed a comparative performance between our proposed CNN model with several state-of-the-art machine learning classifiers including support vector machine, random forest, k-nearest neighbor, logistic regression, gaussian naïve bayes, bernoulli naïve bayes, decision tree, Xgboost, multilayer perceptron, nearest centroid and perceptron as well as deep learning and pretrained models such as deep neural network, residual neural network, visual geometry group network 16, and inception network V3 were employed, where our model yielded outperforming results compared to all other models. While evaluating the performance of our models, we have emphasized on specificity along with accuracy to identify non-COVID-19 individuals more accurately, which may potentially facilitate the early detection of COVID-19 patients for their preliminary screening, especially in under-resourced health infrastructure with insufficient PCR testing systems and testing facilities. Moreover, this model could also be applicable to the cases of other lung infections.