IJERT-Diabetes Prediction using Machine Learning Techniques (original) (raw)

Prediction Model for Diabetes Mellitus Using Machine Learning Techniques

— In today " s world diabetes is the major health challenges in India. It is a group of a syndrome that results in too much sugar in the blood. It is a protracted condition that affects the way the body mechanizes the blood sugar. Prevention and prediction of diabetes mellitus is increasingly gaining interest in medical sciences. The aim is how to predict at an early stage of diabetes using different machine learning techniques. In this paper basically, we use well-known classification that are Decision tree, K-Nearest Neighbors, Support Vector Machine, and Random forest. These classification techniques used with Pima Indians diabetes dataset. Therefore, we predict diabetes at different stage and analyze the performance of different classification techniques. We Also proposed a conceptual model for the prediction of diabetes mellitus using different machine learning techniques. In this paper we also compare the accuracy of the different machine learning techniques to finding the diabetes mellitus at early stage.

Advanced Diabetes Prediction using Supervised Machine Learning Technique: Random Forest

Tropical Journal of Applied Natural Science, 2024

Diabetes remains one of the major causes of untimely death globally. Over 11% of the global population is diabetic, possibly due to late disease detection, inadequate interventions, and lifestyle choices etc. The growing severity of diabetes is driving scientific interest in leveraging Digital Health Technologies (DHTs) for improved management and treatment. Early diagnosis of diabetes is essential for effective interventions, reducing complications, and lowering the mortality rate associated with the disease. Thus, this study focuses on prediction of diabetes using supervised machine learning technique, specifically Random Forest Algorithm (RFA) for timely detection and prevention of the disease. The model was trained using Pima Indian dataset (diabetes), which is freely available on Kaggle database. Trial result indicate that the model was promising, with an accuracy of 92%, 89% precision, 88% recall, and a 90% F1-score. The study shows that applying the Random Forest algorithm significantly improves the accuracy and efficiency of early diabetes detection and diagnosis. However, in spite of the prospects of ML models in diabetes management, there are still concerns about its drawbacks including algorithmic bias, legal and ethical issues, and implementation challenges in clinical environment. Thus, we recommend that legal framework should be put in place to guide the use of ML algorithms, and other digital health technologies in clinical diabetes care delivery.

Review on Diabetes mellitus prediction using machine learning

altinbas univeersity, 2023

Predictive analytics on large datasets is a common application of several machine learning methods. The application of predictive analytics in healthcare is complex, but it has the potential to improve the quality-of-care patients receive by allowing doctors to make more informed decisions, faster, based on information gleaned from large datasets. Six different machine learning algorithms are used to discuss predictive analytics in healthcare in this paper. A dataset consisting of medical records from actual patients is obtained and subjected to six distinct machine learning algorithms for the sake of this experiment. Algorithms are compared in terms of their performance and accuracy. In this review, we compare several machine learning methods to determine which one is most effective for diabetes prediction. The purpose of this review paper is to aid medical professionals in making accurate diabetes risk assessments using machine learning methods. The methods for machine learning are dissected in detail in this review study. Some examples of these classes are logistic regression, support vector machines with a linear or nonlinear kernel, a random forest, a decision tree, an adaptive boosting classifier, a K-nearest neighbor, and a naive bayes.

Prediction of Diabetes Mellitus using Machine Learning Algorithms: Comparative Analysis of K-Nearest Neighbor, Random Forest and Logistic Regression

G.C Ogwume , 2023

Diabetes Mellitus is a chronic and one of the deadliest diseases. Diabetes disease increases the risk of long-term complications, including heart diseases and kidney failures, among others. Undoubtedly, Diabetes Mellitus patients may live longer and lead healthier lives if the disease is detected early. Over the years, several efforts have been on more accurate and early detection procedures to safe patients of Diabetes Mellitus. Interestingly, with the applications of Information Technology to the disease diagnoses and therapy managements, more attention has been on using machine learning in the predictions and early detection of Diabetes Mellitus. Unfortunately, determining the most appropriate machine learning algorithm with the best performance in terms of optimum accuracy still remains a challenge. The study proposes a framework for Diabetes Mellitus detection using Machine Learning Algorithms. The proposed framework was evaluated using K-nearest neighbor (KNN), Random Forest (RF), and Logistic Regression (LR). Extensive experiments were conducted to analyze the performance of the framework focusing on four distinct different clinical datasets. To ensure robust, web compatible framework, Python and its popular data science related packages, Pandas, Numpy, Seaborn, Matplotlib and Pickle were used for the implementation. Significantly, using the standard datasets obtained from the National Institute of Diabetes and Kidney Disease, Random Forest was able to predict Diabetes Mellitus in the datasets with the best accuracy of 93.4 %.

PREDICTION OF DIABETES MELLITUS THROUGH MACHINE LEARNING

Diabetes Mellitus (or) Diabetes, it is a chronic disease and an increasing health problem. Diabetes is a disease that has the potential to cause a worldwide health crisis. The main cause of the type 2 diabetes is due to the absence of insulin. When pancreas is unable to produce insulin, the body develops type 2 diabetes. There are also other contributing environmental factors for the cause of diabetes such as inactive and overweight. Due to the rapid growth potentiality of the disease, it is our responsibility to decrease the range of it. In the current project, we propose to involve experts from various fields to generate data and perform interdisciplinary studies to obtain knowledge about Insulin Resistance. The aim of this project is to develop a model that performs the prediction of diabetes with better accuracy by combining the results of four supervised machine learning techniques including: Decision tree, Random forest, ANN, Logistic regression. Dataset we used in this project is "Diabetes" which is taken from Kaggle.

Prediction of Diabetes Mellitus Using Machine Learning Techniques

INTERNATIONAL JOURNAL OF COMPUTER ENGINEERING AND TECHNOLOGY

Diabetes mellitus is a common disease caused by a set of metabolic ailments where the sugar stages over drawn-out period is very high. It touches diverse organs of the human body which therefore harm a huge number of the body's system, in precise the blood strains and nerves. Early prediction in such disease can be exact and save human life. To achieve the goal, this research work mainly discovers numerous factors associated to this disease using machine learning techniques. Machine learning methods provide effectual outcome to extract knowledge by building predicting models from diagnostic medical datasets together from the diabetic patients. Quarrying knowledge from such data can be valuable to predict diabetic patients. In this research, six popular used machine learning techniques, namely Random Forest (RF), Logistic Regression (LR), Naive Bayes (NB), C4.5 Decision Tree (DT), K-Nearest Neighbor (KNN), and Support Vector Machine (SVM) are compared in order to get outstanding machine learning techniques to forecast diabetic mellitus. Our new outcome shows that Support Vector Machine (SVM) achieved higher accuracy compared to other machine learning techniques.

Diabetes Prediction Using Machine Learning

International Journal of Scientific Research in Computer Science, Engineering and Information Technology, 2020

Diabetes is a chronic disease with the potential to cause a worldwide health care crisis. According to International Diabetes Federation 382 million people are living with diabetes across the whole world. By 2035, this will be doubled as 592 million. Diabetes is a disease caused due to the increase level of blood glucose. This high blood glucose produces the symptoms of frequent urination, increased thirst, and increased hunger. Diabetes is a one of the leading cause of blindness, kidney failure, amputations, heart failure and stroke. When we eat, our body turns food into sugars, or glucose. At that point, our pancreas is supposed to release insulin. Insulin serves as a key to open our cells, to allow the glucose to enter and allow us to use the glucose for energy. But with diabetes, this system does not work. Type 1 and type 2 diabetes are the most common forms of the disease, but there are also other kinds, such as gestational diabetes, which occurs during pregnancy, as well as other forms. Machine learning is an emerging scientific field in data science dealing with the ways in which machines learn from experience. The aim of this project is to develop a system which can perform early prediction of diabetes for a patient with a higher accuracy by combining the results of different machine learning techniques. The algorithms like K nearest neighbour, Logistic Regression, Random forest, Support vector machine and Decision tree are used. The accuracy of the model using each of the algorithms is calculated. Then the one with a good accuracy is taken as the model for predicting the diabetes.

Machine Learning Approach For Diabetes Prediction

International Journal of Information Systems and Computer Sciences, 2019

Diabetes the silent killer which kills part by part of our life Diabetes can strike anyone, from any walk of life. It is not only a disease but also a creator of different kinds of diseases like heart attack, blindness, kidney diseases etc. It is found that in last decades the cases of people living with diabetes jumped almost 350 million worldwide. Early detection of diabetes can reduce the health risk in patients. With rise of new technology like machine learning we have a solution to this issue, a system for predicting the chance of diabetes. Machine learning techniques increase medical diagnosis accuracy and reduce medical cost. This paper aims to predict diabetes via supervised machine learning algorithms such as decision tree. In our work we have used the Pima Indians Diabetes Dataset from UCI Machine Learning Repository.

Utilising Machine Learning Algorithms for Predictive Analysis of Diabetes

Diabetes is a category of metabolic illnesses characterised by persistently elevated blood sugar levels. Frequent urination, increased thirst, and increased appetite are all signs of elevated blood sugar. Diabetes can lead to a wide range of repercussions if ignored. Hyperosmolar hyperglycemia, diabetic ketoacidosis, and even mortality are examples of acute complications. With 537 million sufferers worldwide, diabetes is both the deadliest and the most prevalent noncommunicable illness. Diabetes may be brought on by several causes, including being overweight, having high cholesterol, having a family history, being physically inactive, having improper eating habits, etc. One of this disease's most prevalent symptoms is increased urination-those who have diabetes. Big data analytics are very important in the healthcare industry. The size of databases in the healthcare industry is enormous. By employing big data analytics to analyze massive datasets, one may find hidden patterns and information, learn from the data, and make precise predictions. The present strategy has poor classification and prediction accuracy. In this work, we proposed a diabetes prediction model that incorporates a few extrinsic factors that contribute to the development of diabetes together with other widely used indicators like blood sugar, pregnancies, body mass index (BMI), age, skin thickness, outcome diabetes pedigree function etc. The new dataset enhances classification accuracy over the previous one. To improve classification accuracy, a pipeline model for diabetes prediction has also been implemented. Among all the algorithms that have been used such as Logistic Regression, Ada boost, Decision tree classifier, K Neighbor classifier and Random Forest Classifier, the best accuracy is in Random Forest. Estimates of variable significance, or neural networks, are presented via random forests. They also provide a better way to deal with missing data. The variable that appears the most frequently in a given node fills in the gaps left by missing values. Random forests offer the best accuracy of all the categorisation techniques that are currently available. The random forest method can also handle large amounts of data with hundreds of variables. When a class in the data is less frequent than other classes, it can automatically balance data sets. The approach is appropriate for challenging assignments since it handles variables quickly. To further discuss previous research, the paradigm used in this study will be used for ensemble and hybridization Machine Learning. In the future, a more thorough comparison analysis between different datasets and their features may be carried out to pinpoint all the essential characteristics for predicting diabetes. The proposed work has achieved a more thorough comparative analysis between different datasets and their features that may be carried o u t t o pinpoint all the essential characteristics for predicting diabetes. The best and most accurate diabetes prediction algorithm may be found by comparing a wide range of algorithms and algorithm combinations.

An Effective Machine Learning Approach for Diabetes Prediction

International Journal of Scientific Research in Computer Science, Engineering and Information Technology, 2023

Diabetes is a chronic condition that could lead to a global health care disaster. 382 million people worldwide have diabetes, according to the International Diabetes Federation. This will double to 592 million by 2035. Diabetes is a condition brought on by elevated blood glucose levels. The symptoms of this elevated blood sugar level include frequent urination, increased thirst, and increased hunger. One of the main causes of stroke, kidney failure, heart failure, amputations, blindness, and kidney failure is diabetes. Our bodies convert food into sugars, such as glucose, when we eat. Our pancreas is then expected to release insulin. Insulin acts as a key to unlock our cells, allowing glucose to enter and be used by us as fuel. However, this mechanism does not function in diabetes. The most prevalent forms of the disease are type 1 and type 2, but there are other varieties as well, including gestational diabetes, which develops during pregnancy. Data science has an emerging topic called machine learning that studies how machines learn from experience. The goal of this study is to create a system that, by fusing the findings of several machine learning approaches, can more accurately conduct early diabetes prediction for a patient. K closest neighbour, Logistic Regression, Random Forest, Support Vector Machine, and Decision Tree are some of the techniques employed. Each algorithm's accuracy is calculated along with the model's accuracy. The model for predicting diabetes is then chosen from those with good accuracy.