Predicting Multiple Diseases Using Machine Learning: A Data-Driven Approach

Notice

This is an unedited manuscript accepted for publication and provided as an Article in Press for early access at the author’s request. The article will undergo copyediting, typesetting, and galley proof review before final publication. Please be aware that errors may be identified during production that could affect the content. All legal disclaimers of the journal apply.

Year : 2025 | Volume : 16 | Issue : 02 | Page : –
    By

    Anamika Rana,

  • Sushma Malik,

  • Kanika,

  • Ashima,

  1. Associate Professor, Department of Computer Applications, Maharaja Surajmal Institute, Janakpuri, Delhi, India
  2. Assistant Professor, Professor, Department of Computer Applications, Maharaja Surajmal Institute, Janakpuri, Delhi, India
  3. Scholar, Department of Computer Applications, Maharaja Surajmal Institute, Janakpuri, Delhi, India
  4. Scholar, Department of Computer Applications, Maharaja Surajmal Institute, Janakpuri, Delhi, India

Abstract

The increasing prevalence of chronic and life-threatening diseases highlights the need for innovative healthcare solutions that enable early detection and proactive management. The Multiple Disease Prediction Platform is a web-based system utilizing Machine Learning (ML) and Deep Learning (DL) algorithms to analyze user-inputted health data, generating real-time predictions of potential health risks. By leveraging Python’s Stream lit library, the platform provides an interactive and accessible diagnostic experience, eliminating the need for frequent clinical visits and enhancing remote healthcare accessibility. This research focuses on developing a supervised learning-based system trained on credible datasets (e.g., Kaggle) for disease prediction. Exploratory and descriptive research methods were employed, incorporating statistical analysis and data visualization to enhance accuracy. Despite its potential, the platform faces limitations, including data privacy concerns, regional biases, reliance on incomplete user data, and computational demands affecting cost and accessibility in resource- limited settings. The study has significant practical implications in advancing digital healthcare, enabling early disease risk assessment and improving decision-making for individuals and healthcare professionals. Integrating bio-inspired algorithms further enhances predictive performance by optimizing model efficiency. While deep learning improves complex pattern recognition, ongoing research is essential to refine these models for greater accuracy, scalability, and reliability. Addressing ethical AI challenges, model biases, and computational efficiency is crucial for ensuring broad applicability and long-term sustainability.

Keywords: Machine learning, deep learning, disease prediction, digital healthcare, bio-inspired algorithms, supervised learning, data privacy, predictive analytics, health informatics, remote healthcare

[This article belongs to Journal of Computer Technology & Applications ]

How to cite this article:
Anamika Rana, Sushma Malik, Kanika, Ashima. Predicting Multiple Diseases Using Machine Learning: A Data-Driven Approach. Journal of Computer Technology & Applications. 2025; 16(02):-.
How to cite this URL:
Anamika Rana, Sushma Malik, Kanika, Ashima. Predicting Multiple Diseases Using Machine Learning: A Data-Driven Approach. Journal of Computer Technology & Applications. 2025; 16(02):-. Available from: https://journals.stmjournals.com/jocta/article=2025/view=208004



References

  1. Shah D, Patel S, Bharti SK. Heart disease prediction using machine learning techniques. SN Computer Science. 2020 Nov;1(6):345.
  2. Arumugam K, Naved M, Shinde PP, Leiva-Chauca O, Huaman-Osorio A, Gonzales-Yanac T. Multiple disease prediction using Machine learning algorithms. Materials today: proceedings. 2023 Jan 1;80:3682–5.
  3. Nilashi M, bin Ibrahim O, Ahmadi H, Shahmoradi L. An analytical method for diseases prediction using machine learning techniques. Computers & Chemical Engineering. 2017 Nov 2;106:212–23.
  4. Mujumdar A, Vaidehi V. Diabetes prediction using machine learning algorithms. Procedia Computer Science. 2019 Jan 1;165:292–9.
  5. Shameer K, Johnson KW, Glicksberg BS, Dudley JT, Sengupta PP. Machine learning in cardiovascular medicine: are we there yet?. Heart. 2018 Jul 1;104(14):1156–64.
  6. Pingale K, Surwase S, Kulkarni V, Sarage S, Karve A. Disease prediction using machine learning. International Research Journal of Engineering and Technology (IRJET). 2019 Dec;6(12):831–3.
  7. Khalilia M, Chakraborty S, Popescu M. Predicting disease risks from highly imbalanced data using random forest. BMC medical informatics and decision making. 2011 Dec;11:1–3.
  8. Deepthi Y, Kalyan KP, Vyas M, Radhika K, Babu DK, Krishna Rao NV. Disease prediction based on symptoms using machine learning. InEnergy Systems, Drives and Automations: Proceedings of ESDA 2019 2020 Sep 1 (pp. 561–569). Singapore: Springer Singapore.
  9. Kohli PS, Arora S. Application of machine learning in disease prediction. In2018 4th International conference on computing communication and automation (ICCCA) 2018 Dec 14 (pp. 1–4). IEEE.
  10. Patil S, Jaybhaye S, Bokariya S, Jain P, Phapale S, Hande T. Parkinson’s disease prediction system in machine learning. InITM Web of Conferences 2023 (Vol. 56, p. 05002). EDP Sciences.
  11. Kaptoge S, Pennells L, De Bacquer D, Cooney MT, Kavousi M, Stevens G, Riley LM, Savin S, Khan T, Altay S, Amouyel P. World Health Organization cardiovascular disease risk charts: revised models to estimate risk in 21 global regions. The Lancet global health. 2019 Oct 1;7(10):e1332–45.
  12. Ulianova S. Cardiovascular Disease dataset Kaggle.com. 2019. Available from: https://www.kaggle.com/datasets/sulianova/cardiovascular-disease-dataset
  13. Mehmet Akturk. Diabetes Dataset. Kaggle.com. 2020. Available from: https://www.kaggle.com/datasets/mathchi/diabetes-data-set
  14. Parkinson’s disease – Symptoms and causes. Mayo Clinic. 2025. Available from: https://www.mayoclinic.org/diseases-conditions/parkinsons-disease/symptoms-causes/syc- 20376055
  15. Ukani V. Parkinson’s Disease Data Set. Kaggle.com. 2019. Available from: https://www.kaggle.com/datasets/vikasukani/parkinsons-disease-data-set
  16. Islam MA, Majumder MZ, Hussein MA. Chronic kidney disease prediction based on machine learning algorithms. Journal of pathology informatics. 2023 Jan 1;14:100189.
  17. UCI Machine Learning Repository. Uci.edu. 2015. Available from: https://archive.ics.uci.edu/dataset/336/chronic+kidney+disease
  18. Jović A, Brkić K, Bogunović . A review of feature selection methods with applications. n2015 38th international convention on information and communication technology, electronics and microelectronics (MIPRO) 2015 May 25 (pp. 1200–1205). Ieee.
  19. Sarker, I. H. (2021). Machine learning: Algorithms, real-world applications and research directions. SN computer science, 2(3), 160.
  20. LeCessie S, Van Houwelingen JC. Ridge estimators in logistic regression. J R Stat Soc Ser C (Appl Stat). 1992;41(1):191–201.
  21. Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V, et al. Scikit-learn: machine learning in python. J Mach Learn Res. 2011;12:2825–30.
  22. Steinbach M, Tan PN. kNN: k-nearest neighbors. InThe top ten algorithms in data mining 2009 Apr 9 (pp. 165–176). Chapman and Hall/CRC.
  23. IBM. Support Vector Machine. Ibm.com. 2023. Available from: https://www.ibm.com/think/topics/support-vector-machine
  24. Podgorelec V, Kokol P, Stiglic B, Rozman I. Decision trees: an overview and their use in medicine. Journal of medical systems. 2002 Oct;26:445–63.

Regular Issue Subscription Review Article
Volume 16
Issue 02
Received 01/03/2025
Accepted 08/03/2025
Published 15/04/2025
Publication Time 45 Days


My IP

PlumX Metrics