Predicting Multiple Diseases Using Machine Learning: A Data-Driven Approach

Year : 2025 | Volume : 16 | Issue : 02 | Page : 16 35
    By

    Sushma Malik,

  • Anamika Rana,

  • Kanika,

  • Ashima,

  1. Associate Professor, Department of Computer Applications, Maharaja Surajmal Institute, Janakpuri, Delhi, India
  2. Assistant Professor, Department of Computer Applications, Maharaja Surajmal Institute, Janakpuri, Delhi, India
  3. Scholar, Department of Computer Applications, Maharaja Surajmal Institute, Janakpuri, Delhi, India
  4. Scholar, Department of Computer Applications, Maharaja Surajmal Institute, Janakpuri, Delhi, India

Abstract

The increasing prevalence of chronic and life-threatening diseases highlights the need for innovative healthcare solutions that enable early detection and proactive management. The Multiple Disease Prediction Platform is a web-based system utilizing machine learning (ML) and deep learning (DL) algorithms to analyze user-inputted health data, generating real-time predictions of potential health risks. By leveraging Python’s Streamlit library, the platform provides an interactive and accessible diagnostic experience, eliminating the need for frequent clinical visits and enhancing remote healthcare accessibility. This research focuses on developing a supervised learning-based system trained on credible datasets (e.g., Kaggle) for disease prediction. Exploratory and descriptive research methods were employed, incorporating statistical analysis and data visualization to enhance accuracy. Despite its potential, the platform faces limitations, including data privacy concerns, regional biases, reliance on incomplete user data, and computational demands affecting cost and accessibility in resource-limited settings. The study has significant practical implications in advancing digital healthcare, enabling early disease risk assessment and improving decision making for individuals and healthcare professionals. Integrating bio-inspired algorithms further enhances predictive performance by optimizing model efficiency. While deep learning improves complex pattern recognition, ongoing research is essential to refine these models for greater accuracy, scalability, and reliability. Addressing ethical artificial intelligence challenges, model biases, and computational efficiency is crucial for ensuring broad applicability and long-term sustainability.

Keywords: Machine learning, deep learning, disease prediction, digital healthcare, bio-inspired algorithms, supervised learning, data privacy, predictive analytics, health informatics, remote healthcare

[This article belongs to Journal of Computer Technology & Applications ]

How to cite this article:
Sushma Malik, Anamika Rana, Kanika, Ashima. Predicting Multiple Diseases Using Machine Learning: A Data-Driven Approach. Journal of Computer Technology & Applications. 2025; 16(02):16-35.
How to cite this URL:
Sushma Malik, Anamika Rana, Kanika, Ashima. Predicting Multiple Diseases Using Machine Learning: A Data-Driven Approach. Journal of Computer Technology & Applications. 2025; 16(02):16-35. Available from: https://journals.stmjournals.com/jocta/article=2025/view=208714


References

  1. Shah D, Patel S, Bharti SK. Heart disease prediction using machine learning techniques. SN Computer Sci. 2020; 1 (6): 345.
  2. Arumugam K, Naved M, Shinde PP, Leiva-Chauca O, Huaman-Osorio A, Gonzales-Yanac T. Multiple disease prediction using machine learning algorithms. Mater Today Proc. 2023; 80: 3682–3685.
  3. Nilashi M, bin Ibrahim O, Ahmadi H, Shahmoradi L. An analytical method for diseases prediction using machine learning techniques. Computers Chem Eng. 2017; 106: 212–223.
  4. Mujumdar A, Vaidehi V. Diabetes prediction using machine learning algorithms. Procedia Computer Sci. 2019; 165: 292–299.
  5. Shameer K, Johnson KW, Glicksberg BS, Dudley JT, Sengupta PP. Machine learning in cardiovascular medicine: are we there yet? Heart. 2018; 104 (14): 1156–1164.
  6. Pingale K, Surwase S, Kulkarni V, Sarage S, Karve A. Disease prediction using machine learning. Int R J Eng Technol. 2019; 6 (12): 831–833.
  7. Khalilia M, Chakraborty S, Popescu M. Predicting disease risks from highly imbalanced data using random forest. BMC Med Informatics Decis Making. 2011; 11: 1–3.
  8. Deepthi Y, Kalyan KP, Vyas M, Radhika K, Babu DK, Krishna Rao NV. Disease prediction based on symptoms using machine learning. In: Sikander A, Acharjee D, Chanda CK, Mondal PK, Verma P, editors. Energy Systems, Drives and Automations: Proceedings of ESDA 2019. Singapore: Springer; 2020. pp. 561–569.
  9. Kohli PS, Arora S. Application of machine learning in disease prediction. In: 2018 4th International Conference on Computing Communication and Automation (ICCCA), Greater Noida, India, December 14–15, 2018. pp. 1–4.
  10. Patil S, Jaybhaye S, Bokariya S, Jain P, Phapale S, Hande T. Parkinson’s disease prediction system in machine learning. ITM Web Conf. 2023; 56: 05002.
  11. Kaptoge S, Pennells L, De Bacquer D, Cooney MT, Kavousi M, Stevens G, Riley LM, Savin S, Khan T, Altay S, Amouyel P. World Health Organization cardiovascular disease risk charts: revised models to estimate risk in 21 global regions. Lancet Global Health. 2019; 7 (10): e1332–e1345.
  12. Ulianova S. Cardiovascular disease dataset. [Online]. Kaggle.com. 2019. Available at https://www.kaggle.com/datasets/sulianova/cardiovascular-disease-dataset
  13. Akturk M. Diabetes dataset. [Online]. Kaggle.com. 2020. Available at https://www.kaggle.com/datasets/mathchi/diabetes-data-set
  14. Mayo Clinic. Parkinson’s disease – symptoms and causes. [Online]. Mayo Clinic. 2025. Available at https://www.mayoclinic.org/diseases-conditions/parkinsons-disease/symptoms-causes/syc-20376055
  15. Ukani V. Parkinson’s disease dataset. [Online]. Kaggle.com. 2019. Available at https://www.kaggle.com/datasets/vikasukani/parkinsons-disease-data-set
  16. Islam MA, Majumder MZ, Hussein MA. Chronic kidney disease prediction based on machine learning algorithms. J Pathol Informatics. 2023; 14: 100189.
  17. University of California, Irvine. UCI machine learning repository. [Online]. Uci.edu. 2015. Available at https://archive.ics.uci.edu/dataset/336/chronic+kidney+disease
  18. Jović A, Brkić K, Bogunović N. A review of feature selection methods with applications. In: 2015 38th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO), Opatija, Croatia, May 25–29, 2015. pp. 1200–1205.
  19. Sarker IH.. Machine learning: algorithms, real-world applications and research directions. SN Computer Sci. 2021; 2 (3): 160.
  20. LeCessie S, Van Houwelingen JC. Ridge estimators in logistic regression. J R Stat Soc Ser C (Appl Stat). 1992; 41 (1): 191–201.
  21. Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V, et al. Scikit-learn: machine learning in Python. J Mach Learn Res. 2011; 12: 2825–2830.
  22. Steinbach M, Tan PN. kNN: k-nearest neighbors. In: The Top Ten Algorithms in Data Mining 2009. London, UK: Chapman & Hall/CRC Press; 2009. pp. 165–176.
  23. IBM. Support vector machine. [Online]. Ibm.com. 2023. Available at https://www.ibm.com/think/topics/support-vector-machine
  24. Podgorelec V, Kokol P, Stiglic B, Rozman I. Decision trees: an overview and their use in medicine. J Med Syst. 2002; 26: 445–463.
  25. Mahesh B. Machine learning algorithms – a review. Int J Sci Res. 2020; 9 (1): 381–386.
  26. Chiu MH, Yu YR, Liaw HL, Chun-Hao L. The use of facial micro-expression state and tree-forest model for predicting conceptual-conflict based conceptual change. In: Lavonen J, Juuti K, Lampiselkä J, Uitto A, Hahl K, editors. Science Education Research: Engaging Learners for a Sustainable Future (ESERA Eproceeding). Helsinki, Finland: ESERA; 2015.,
  27. Han J, Pei J, Tong H. Data Mining: Concepts and Techniques. San Francisco, CA, USA: Morgan Kaufmann; 2022.
  28. Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V, Vanderplas J. Scikit-learn: machine learning in Python. J Mach Learn Res. 2011; 12: 2825–2830.
  29. Wang W, Chakraborty G, Chakraborty B. Predicting the risk of chronic kidney disease (CKD) using machine learning algorithm. Appl Sci. 2020; 11 (1): 202. Available at https://www.researchgate.net/publication/348025909/figure/fig2/AS:1020217916416002@1620250314481/Simplified-structure-of-XGBoost.ppm
  30. Anshul. Guide on AdaBoost algorithm. [Online]. Analytics Vidhya. 2021. Available at https://www.analyticsvidhya.com/blog/2021/09/adaboost-algorithm-a-complete-guide-for-beginners/

Regular Issue Subscription Review Article
Volume 16
Issue 02
Received 01/03/2025
Accepted 08/03/2025
Published 15/04/2025
Publication Time 45 Days


Login


My IP

PlumX Metrics