Data Handling Algorithms for the Healthcare System for the Prediction of Diabetes in Health Data Science (HDS): A Review Report

Year : 2024 | Volume : 11 | Issue : 02 | Page : 1 10
    By

    Vinay Bhatt,

  • Mayank Kumar,

  1. Research Scholar, Department of Computer Science and Engineering, Asian International University, Imphal West, Manipur, India
  2. Associate professor, Department of Computer Science and Engineering, Asian International University, Imphal West, Manipur, India

Abstract

In recent years, diabetes has become the biggest disease in different countries around the world. This disease is caused by adulteration in food ingredients, unhealthy food habits, a lack of physical exercise, and changing the lifestyle every time without a routine chart. The main objective of this review paper is to provide a proper understanding of the machine learning algorithm used in the healthcare system to handle diabetic patients’ data. We use a previous research article for the idea of machine learning (ML) algorithm implementation in diabetic prediction to find more accuracy. The handling of diabetes patients’ data is very difficult in the healthcare system, so use the modern technology of computer science as data science. Data science technology is used for handling data in healthcare systems, so the term is introduced as health data science (HDS). In this survey paper, we review different data handling algorithms as ML algorithms for handling diabetic patient data. This paper presents diabetes prediction based on previous research, which is discussed in the literature review section of this paper. In previous work, different ML algorithms such as Support vector machine (SVM), K-nearest neighbor (KNN), Logistic regression (LR), Random forest (RF), Decision tree (DT), Deep neural network (DNN), and Naïve Bayesian classifiers were used for handling the diabetes data, but different challenges faced researchers, so we focused on the challenges of previous research works. In this review paper, we focus on the research challenges of previous research and set research goals on behalf of research gaps for the next research directions. The future work of this paper is to analyze different data handling algorithms for the prediction of diabetes in different cases.

Keywords: Data science, health data science (HDS) machine learning (ML), supervised learning algorithm, data handling algorithms, Support vector machine (SVM), K-nearest neighbor (KNN), Random forest (RF), Decision tree (DT), DNN, healthcare system, diabetic prediction

[This article belongs to Journal of Artificial Intelligence Research & Advances ]

How to cite this article:
Vinay Bhatt, Mayank Kumar. Data Handling Algorithms for the Healthcare System for the Prediction of Diabetes in Health Data Science (HDS): A Review Report. Journal of Artificial Intelligence Research & Advances. 2024; 11(02):1-10.
How to cite this URL:
Vinay Bhatt, Mayank Kumar. Data Handling Algorithms for the Healthcare System for the Prediction of Diabetes in Health Data Science (HDS): A Review Report. Journal of Artificial Intelligence Research & Advances. 2024; 11(02):1-10. Available from: https://journals.stmjournals.com/joaira/article=2024/view=155828


References

  1. Gharaibeh M, Alzu’bi D, Abdullah M, Hmeidi I, Al Nasar MR, Abualigah L, Gandomi AH. Radiology imaging scans for early diagnosis of kidney tumors: A review of data analytics-based machine learning and deep learning approaches. Big Data Cogn Comput. 2022;6:29. DOI: 10.3390/bdcc6010029.
  2. Akpan IJ, Aguolu OG. The application of data science techniques and algorithms in women’s health studies. medRxiv. 2022.
  3. Sanchez-Pinto LN, Luo Y, Churpek MM. Big data and data science in critical care. Chest. 2018;154:1239–1248. DOI: 10.1016/j.chest.2018.04.037, PubMed: 29752973.
  4. Khan FA, Zeb K, Al-Rakhami M, Derhab A, Bukhari SAC. Detection and prediction of diabetes using data mining: A comprehensive review. IEEE Access. 2021;9:43711–43735. DOI: 10.1109/ACCESS.2021.3059343.
  5. Sisodia D, Sisodia DS. Prediction of diabetes using classification algorithms. Procedia Comput Sci. 2018;132:1578–1585. DOI: 10.1016/j.procs.2018.05.122.
  6. Kaur G, Chhabra A. Improved J48 Classification Algorithm for the Prediction of Diabetes. Int J Comput Appl. 2014;98:13–17. DOI: 10.5120/17314-7433.
  7. Selvin E, Steffes MW, Gregg E, Brancati FL, Coresh J. Performance of A1C for the classification and prediction of diabetes. Diabetes Care. 2011;34:84–89. DOI: 10.2337/dc10-1235, PubMed: 20855549.
  8. Alam TM, Iqbal MA, Ali Y, Wahab A, Ijaz S, Baig TI, Hussain A, Malik MA, Raza MM, Ibrar S, Abbas Z. A model for early prediction of diabetes. Inform Med Unlocked. 2019;16:100204. DOI: 10.1016/j.imu.2019.100204.
  9. Srivastava S, Sharma L, Sharma V, Kumar A, Darbari H. Prediction of diabetes using artificial neural network approach. In: Engineering Vibration, Communication and Information Processing: ICoEVCI 2018. Springer: Singapore; 2019. pp. 679–687. DOI: 10.1007/978-981-13-1642-5_59.
  10. Sharma A, Mishra PK. Performance analysis of machine learning based optimized feature selection approaches for breast cancer diagnosis. Int J Inf Technol. 2021;1–4.
  11. Sharma A, Mishra PK. State-of-the-art in performance metrics and future directions for data science algorithms. J Sci Res. 2020;64:221–238. DOI: 10.37398/JSR.2020.640232.
  12. Kakoly IJ, Hoque MR, Hasan N. Data-driven diabetes risk factor prediction using machine learning algorithms with feature selection technique. Sustainability. 2023;15:4930. DOI: 10.3390/su15064930.
  13. Arumugam K, Naved M, Shinde PP, Leiva-Chauca O, Huaman-Osorio A, Gonzales-Yanac T. Multiple disease prediction using machine learning algorithms. Mater Today: Proc. 2023;80:3682–3685. DOI: 10.1016/j.matpr.2021.07.361.
  14. Febrian ME, Ferdinan FX, Sendani GP, Suryanigrum KM, Yunanda R. Diabetes prediction using supervised machine learning. Proc Comput Sci. 2023;216:21–30. DOI: 10.1016/j.procs.2022.12.
  15. Kulkarni AR, Patel AA, Pipal KV, Jaiswal SG, Jaisinghani MT, Thulkar V, Gajbhiye L, Gondane P, Patel AB, Mamtani M, Kulkarni H. Machine-learning algorithm to non-invasively detect diabetes and pre-diabetes from electrocardiogram. BMJ Innovations. 2023;9:32–42. DOI: 10.1136/bmjinnov-2021-000759.
  16. Tasin I, Nabil TU, Islam S, Khan R. Diabetes prediction using machine learning and explainable AI techniques. Healthcare Technol Lett. 2023;10:1–10. DOI: 10.1049/htl2.12039, PubMed: 37077883.
  17. Chou CY, Hsu DY, Chou CH. Predicting the onset of diabetes with machine learning methods. J Pers Med. 2023;13:406. DOI: 10.3390/jpm13030406, PubMed: 36983587.
  18. Hama Saeed MA. Diabetes type 2 classification using machine learning algorithms with up-sampling technique. J Electr Syst Inf Technol. 2023;10:1–10.
  19. Alwakid G, Gouda W, Humayun M. Deep learning-based prediction of diabetic retinopathy using CLAHE and ESRGAN for enhancement. Inhealthcare. 2023;11. DOI: 10.3390/healthcare11060863, PubMed: 36981520.
  20. Tan KR, Seng JJB, Kwan YH, Chen YJ, Zainudin SB, Loh DHF, Liu N, Low LL. Evaluation of machine learning methods developed for prediction of diabetes complications: A systematic review. J Diabetes Sci Technol. 2023;17:474–489. DOI: 10.1177/19322968211056917, PubMed: 34727783.
  21. Rastogi R, Bansal M. Diabetes prediction model using data mining techniques. Meas Sensors. 2023;25:100605. DOI: 10.1016/j.measen.2022.100605.
  22. Kee OT, Harun H, Mustafa N, Abdul Murad NA, Chin SF, Jaafar R, Abdullah N. Cardiovascular complications in a diabetes prediction model using machine learning: A systematic review. Cardiovasc Diabetol. 2023;22:13. DOI: 10.1186/s12933-023-01741-7, PubMed: 36658644.
  23. ElSayed NA, Aleppo G, Aroda VR, Bannuru RR, Brown FM, Bruemmer D, Collins BS, Gaglia JL, Hilliard ME, Isaacs D, Johnson EL, Kahan S, Khunti K, Leon J, Lyons SK, Perry ML, Prahalad P, Pratley RE, Seley JJ, Stanton RC, Gabbay RA. Classification and diagnosis of diabetes: Standards of care in diabetes—2023. Diabetes Care. 2023;46(Supplement 1):S19–S40. DOI: 10.2337/dc23-S002.
  24. Nti IK, Quarcoo JA, Aning J, Fosu GK. A mini-review of machine learning in big data analytics: Applications, challenges, and prospects. Big Data Min Analytics. 2022;5:81–97. DOI: 10.26599/BDMA.2021.9020028.
  25. Gourisaria MK, Jee G, Harshvardhan GM, Singh V, Singh PK, Workneh TC. Data science appositeness in diabetes mellitus diagnosis for healthcare systems of developing nations. IET Commun. 2022;16:532–547. DOI: 10.1049/cmu2.12338.
  26. Maier-Hein L, Eisenmann M, Sarikaya D, März K, Collins T, Malpani A, Fallert J, Feussner H, Giannarou S, Mascagni P, Nakawala H, Park A, Pugh C, Stoyanov D, Vedula SS, Cleary K, Fichtinger G, Forestier G, Gibaud B, Grantcharov T, Hashizume M, Heckmann-Nötzel D, Kenngott HG, Kikinis R, Mündermann L, Navab N, Onogur S, Roß T, Sznitman R, Taylor RH, Tizabi MD, Wagner M, Hager GD, Neumuth T, Padoy N, Collins J, Gockel I, Goedeke J, Hashimoto DA, Joyeux L, Lam K, Leff DR, Madani A, Marcus HJ, Meireles O, Seitel A, Teber D, Ückert F, Müller-Stich BP, Jannin P, Speidel S. Surgical data science – from concepts toward clinical translation. Med Image Anal. 2022;76:102306. DOI: 10.1016/j.media.2021.102306, PubMed: 34879287.
  27. Mohamed I, Fouda MM, Hosny KM. Machine learning algorithms for COPD patients readmission prediction: A data analytics approach. IEEE Access. 2022;10:15279–15287. DOI: 10.1109/ 2022.3148600.
  28. Krishnamoorthi R, Joshi S, Almarzouki HZ, Shukla PK, Rizwan A, Kalpana C, Tiwari B. A novel diabetes healthcare disease prediction framework using machine learning techniques. J Healthc Eng. 2022;2022:1684017. DOI: 10.1155/2022/1684017, PubMed: 35070225.
  29. Awotunde JB, Adeniyi AE, Ajagbe SA, González-Briones A. Natural computing and unsupervised learning methods in smart healthcare data-centric operations. In: Cognitive and Soft Computing Techniques for the Analysis of Healthcare Data. Cambridge, USA: Academic Press; 2022. pp. 165–190.
  30. Hassan M, Awan FM, Naz A, deAndrés-Galiana EJ, Alvarez O, Cernea A, Fernández-Brillet L, Fernández-Martínez JL, Kloczkowski A. Innovations in genomics and big data analytics for personalized medicine and health care: A review. Int J Mol Sci. 2022;23:4645. DOI: 10.3390/ijms23094645, PubMed: 35563034.
  31. Oyeleye M, Chen T, Titarenko S, Antoniou G. A predictive analysis of heart rates using machine learning techniques. Int J Environ Res Public Health. 2022;19:2417. DOI: 10.3390/ijerph19042417, PubMed: 35206603.
  32. Zubair M, Iqbal MD, Shil A, Chowdhury MJM, Moni MA, Sarker IH. An improved K-means clustering algorithm towards an efficient data-driven modeling. Ann Data Sci. 2022;1–3.
  33. Abdollahi J, Nouri-Moghaddam B. Hybrid stacked ensemble combined with genetic algorithms for diabetes prediction. Iran J Comput Sci. 2022;5:205–220. DOI: 10.1007/s42044-022-00100-1.
  34. Sann R, Lai PC, Liaw SY, Chen CT. Predicting online complaining behavior in the hospitality industry: Application of big data analytics to online reviews. Sustainability. 2022;14:1800. DOI: 10.3390/su14031800.
  35. Di Sotto SD, Viviani M. Health misinformation detection in the social web: An overview and a data science approach. Int J Environ Res Public Health. 2022;19:2173. DOI: 10.3390/ijerph19042173, PubMed: 35206359.
  36. Cerrato P, Halamka J, Pencina M. A proposal for developing a platform that evaluates algorithmic equity and accuracy. BMJ Health Care Inform. 2022;29. DOI: 10.1136/bmjhci-2021-100423, PubMed: 35410952.

Regular Issue Subscription Review Article
Volume 11
Issue 02
Received 31/01/2024
Accepted 06/02/2024
Published 20/06/2024


Login


My IP

PlumX Metrics