Identification of Diabetes Mellitus Risk in Women using Random Forest
DOI:
https://doi.org/10.63017/jdsi.v3i1.95Keywords:
Diabetes, Algorithm, Classification, Women, Random ForestAbstract
Diabetes Mellitus (DM) is one of the chronic diseases that can cause various serious complications, especially in women. Early risk identification is an important step in preventing the progression of this disease. This study aims to identify the factors influencing the risk of diabetes in women by analyzing data from several parameters, namely the number of pregnancies, glucose level, blood pressure, skin thickness, insulin level, body mass index (BMI), diabetes pedigree function, and age. A quantitative approach was used in this study with descriptive and inferential statistical analysis methods. The research results show that glucose levels and BMI are the most significant factors in increasing the risk of diabetes, followed by family history of diabetes and age. In addition, the number of pregnancies also has an impact on the risk of diabetes, especially in women with a history of gestational diabetes. This research concludes that the combination of several parameters can be used to predict the risk of diabetes more accurately, especially in women. These results are expected to support early prevention efforts and better clinical decision-making in the management of diabetes.
References
E. Dritsas and M. Trigka, “Data-Driven Machine-Learning Methods for Diabetes Risk Prediction,” Sensors, vol. 22, no. 14, 2022, doi: 10.3390/s22145304.
Q. Zou, K. Qu, Y. Luo, D. Yin, Y. Ju, and H. Tang, “Predicting Diabetes Mellitus With Machine Learning Techniques,” Front. Genet., vol. 9, no. November, pp. 1–10, 2018, doi: 10.3389/fgene.2018.00515.
A. Tuppad and S. D. Patil, “Machine learning for diabetes clinical decision support: a review,” Adv. Comput. Intell., vol. 2, no. 2, pp. 1–24, 2022, doi: 10.1007/s43674-022-00034-y.
S. Yadu, R. Chandra, and V. K. Sinha, “Comparing Different Machine Learning Techniques in Predicting Diabetes on Early Stage,” p. 20, 2024, doi: 10.3390/engproc2024062020.
Y. Zhao et al., “Using Machine Learning Techniques to Develop Risk Prediction Models for the Risk of Incident Diabetic Retinopathy Among Patients With Type 2 Diabetes Mellitus: A Cohort Study,” Front. Endocrinol. (Lausanne)., vol. 13, no. May, pp. 1–8, 2022, doi: 10.3389/fendo.2022.876559.
P. D. Petridis, A. S. Kristo, and A. K. Sikalidis, “A Review on Trending Machine Learning Techniques for Type 2 Diabetes Mellitus Management,” pp. 1–24, 2024.
N. Ghaffar Nia, E. Kaplanoglu, and A. Nasab, “Evaluation of artificial intelligence techniques in disease diagnosis and prediction,” Discov. Artif. Intell., vol. 3, no. 1, 2023, doi: 10.1007/s44163-023-00049-5.
C. N. Noviyanti and A. Alamsyah, “Early Detection of Diabetes Using Random Forest Algorithm,” J. Inf. Syst. Explor. Res., vol. 2, no. 1, 2024, doi: 10.52465/joiser.v2i1.245.
R. Qureshi et al., “Artificial Intelligence and Biosensors in Healthcare and Its Clinical Relevance: A Review,” IEEE Access, vol. 11, pp. 61600–61620, 2023, doi: 10.1109/ACCESS.2023.3285596.
K. Wu, “Optimizing Diabetes Prediction with Machine Learning: Model Comparisons and Insights,” J. Sci. Technol., vol. 5, no. 4, pp. 41–51, 2024, doi: 10.55662/jst.2024.5403.
D. Lamba, W. H. Hsu, and M. Alsadhan, Predictive analytics and machine learning for medical informatics: A survey of tasks and techniques, no. February. 2021. doi: 10.1016/B978-0-12-821777-1.00023-9.
M. S. Reza, A. R. S. Fakir, M. R. Islam, A. M. T. Alom, T. Sen, and M. S. Islam, “Early Stage Diabetes Prediction Using Machine Learning Techniques,” 2023 26th Int. Conf. Comput. Inf. Technol. ICCIT 2023, pp. 1–21, 2023, doi: 10.1109/ICCIT60459.2023.10441427.
S. Ucha Putri, E. Irawan, F. Rizky, S. Tunas Bangsa, P. A. -Indonesia Jln Sudirman Blok No, and S. Utara, “Implementasi Data Mining Untuk Prediksi Penyakit Diabetes Dengan Algoritma C4.5,” Januari, vol. 2, no. 1, pp. 39–46, 2021.
A. Veronica Agustin and A. Voutama, “Implementasi Data Mining Klasifikasi Penyakit Diabetes Pada Perempuan Menggunakan Naïve Bayes,” JATI (Jurnal Mhs. Tek. Inform., vol. 7, no. 2, pp. 1002–1007, 2023, doi: 10.36040/jati.v7i2.6808.
C. Y. Chou, D. Y. Hsu, and C. H. Chou, “Predicting the Onset of Diabetes with Machine Learning Methods,” J. Pers. Med., vol. 13, no. 3, 2023, doi: 10.3390/jpm13030406.

Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Data Science Insights

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.