Diabetes-Prediction-using-ensemble-techniques

The main aim of this study is to improve the accuracy of diabetes mellitus prediction by utilizing various machine learning techniques, including ensemble methods such as Stacking, Hard Voting, and Soft Voting, with base classifiers like AdaBoost, Logistic Regression, Random Forest, Gradient Boost, Linear Discriminant Analysis, Extra Trees, and Cat Boost. For this experimentation, we will be using the Pima Indians Diabetes dataset, which gathers details on patients with and without diabetes, to construct and evaluate each model before selecting the optimal ensemble model to address this issue. The best performing model was the ensemble model using soft voting. However, the model had a high bias and low variance, which was addressed by calibration. The final model achieved an accuracy of 93.75%, precision of 95.24%, recall of 86.96%, and an F1 score of 90.91%. This study highlights the potential of machine learning techniques for predicting diabetes and the importance of calibration to improve model performance.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
CSE4068_DIABETES_PREDICTION_REPORT.pdf		CSE4068_DIABETES_PREDICTION_REPORT.pdf
HC Final.html		HC Final.html
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Diabetes-Prediction-using-ensemble-techniques

About

Releases

Packages

Languages

YuvashreeRchan/Diabetes-Prediction-using-ensemble-techniques

Folders and files

Latest commit

History

Repository files navigation

Diabetes-Prediction-using-ensemble-techniques

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages