INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND MATHEMATICAL THEORY (IJCSMT )
E-ISSN 2545-5699
P-ISSN 2695-1924
VOL. 10 NO. 3 2024
DOI: 10.56201/ijcsmt.v10.no3.2024.pg173.186
Daniel Edem Thompson
In middle-income countries as well as advanced economies of the world, one of the leading health problems is heart disease. Heart disease, otherwise known as cardiovascular disease, is a category of diseases and disorders usually characterized by abnormalities of the heart and the blood vessels, including coronary artery disease, heart failure, arrhythmias, and congenital heart defects. This study is very important as it aims to fill a knowledge gap and tackle the special challenges that primary healthcare systems in areas with few resources face leading to improved health results and more effective healthcare resource usage. In this study, we attempt to assess the efficacy of various machine learning (ML) methods for early heart diseases detection in Primary healthcare. We tested some ML algorithms – logistic regression, random forest, support vector machines (SVMs), K-nearest neighbors, CatBoost and XGBoost (XGB) using medical records from 2022 Primary Healthcare Facilities in Southern Nigeria. Each model showed different strengths in accuracy, precision, recall, and F1 score with logistic regression model achieving an overall accuracy of 85.61%. The Synthetic Minority Over- Sampling Technique (SMOTE) enabled us to mitigate class imbalance which boosted recall from 0.04 to 0.6016 and also balanced the F1 score from 0.08 to 0.3356; thus accurately identifies heart disease cases while maintaining fewer false negatives. Age (0.570193), daily smokes (0.370494), and blood pressure (0.365896) topped the list of heart risk factors. Blood sugar (0.189080), heart rate (0.096976), BMI (0.052322), and cholesterol (0.047680) also play a part in predicting overall risk. We recommend adding ML tools into routine healthcare, supported by policies, community outreach, targeted interventions, and continuous research to manage heart disease worldwide.
Heart Disease, Machine Learning, Primary Healthcare, Feature Importance,
Chawla, N.V., Bowyer, K.W., Hall, L.O., & Kegelmeyer, W.P. (2002). SMOTE: Synthetic
Minority Over-sampling Technique. Journal of Artificial Intelligence Research, 16,
321-357.
Ekle, F., Bakpo, F., Udanor, C., & Eneh, A. (2023). A machine learning model and application
for heart disease prediction using prevalent risk factors in Nigeria. International
Journal of Mathematical Analysis and Modelling, 6(2).
He, H., & Garcia, E.A. (2009). Learning from imbalanced data. IEEE Transactions on
Knowledge and Data Engineering, 21(9), 1263-1284.
Kebede Kassaw, A., Yimer, A., Abey, W. et al. (2023). The application of machine learning
approaches to determine the predictors of anemia among under-five children in
Ethiopia. Scientific Reports, 13(22919).
Krittanawong, C., Zhang, H., Wang, Z., Aydar, M., & Kitai, T. (2017). Artificial intelligence
in precision cardiovascular medicine. Journal of the American College of Cardiology,
69(21), 2657-2664.
Miotto, R., Wang, F., Wang, S., Jiang, X., & Dudley, J.T. (2018). Deep learning for healthcare:
review, opportunities and challenges. Briefings in Bioinformatics, 19(6), 1236-1246.
Samuel, O., Zewotir, T., & North, D. (2024). Application of machine learning methods for
predicting under-five mortality: analysis of Nigerian demographic health survey 2018
dataset. BMC Medical Informatics and Decision Making, 24(86).
World Health Organization. (2021). Cardiovascular diseases (CVDs). Retrieved from
https://www.who.int/news-room/fact-sheets/detail/cardiovascular-diseases-(cvds).