This is an unedited manuscript accepted for publication and provided as an Article in Press for early access at the author’s request. The article will undergo copyediting, typesetting, and galley proof review before final publication. Please be aware that errors may be identified during production that could affect the content. All legal disclaimers of the journal apply.
Anamika Rana,
Sushma Malik,
Kanika,
Ashima,
- Associate Professor, Department of Computer Applications, Maharaja Surajmal Institute, Janakpuri, Delhi, India
- Assistant Professor, Professor, Department of Computer Applications, Maharaja Surajmal Institute, Janakpuri, Delhi, India
- Scholar, Department of Computer Applications, Maharaja Surajmal Institute, Janakpuri, Delhi, India
- Scholar, Department of Computer Applications, Maharaja Surajmal Institute, Janakpuri, Delhi, India
Abstract
The increasing prevalence of chronic and life-threatening diseases highlights the need for innovative healthcare solutions that enable early detection and proactive management. The Multiple Disease Prediction Platform is a web-based system utilizing Machine Learning (ML) and Deep Learning (DL) algorithms to analyze user-inputted health data, generating real-time predictions of potential health risks. By leveraging Python’s Stream lit library, the platform provides an interactive and accessible diagnostic experience, eliminating the need for frequent clinical visits and enhancing remote healthcare accessibility. This research focuses on developing a supervised learning-based system trained on credible datasets (e.g., Kaggle) for disease prediction. Exploratory and descriptive research methods were employed, incorporating statistical analysis and data visualization to enhance accuracy. Despite its potential, the platform faces limitations, including data privacy concerns, regional biases, reliance on incomplete user data, and computational demands affecting cost and accessibility in resource- limited settings. The study has significant practical implications in advancing digital healthcare, enabling early disease risk assessment and improving decision-making for individuals and healthcare professionals. Integrating bio-inspired algorithms further enhances predictive performance by optimizing model efficiency. While deep learning improves complex pattern recognition, ongoing research is essential to refine these models for greater accuracy, scalability, and reliability. Addressing ethical AI challenges, model biases, and computational efficiency is crucial for ensuring broad applicability and long-term sustainability.
Keywords: Machine learning, deep learning, disease prediction, digital healthcare, bio-inspired algorithms, supervised learning, data privacy, predictive analytics, health informatics, remote healthcare
[This article belongs to Journal of Computer Technology & Applications ]
Anamika Rana, Sushma Malik, Kanika, Ashima. Predicting Multiple Diseases Using Machine Learning: A Data-Driven Approach. Journal of Computer Technology & Applications. 2025; 16(02):-.
Anamika Rana, Sushma Malik, Kanika, Ashima. Predicting Multiple Diseases Using Machine Learning: A Data-Driven Approach. Journal of Computer Technology & Applications. 2025; 16(02):-. Available from: https://journals.stmjournals.com/jocta/article=2025/view=208004
References
- Shah D, Patel S, Bharti SK. Heart disease prediction using machine learning techniques. SN Computer Science. 2020 Nov;1(6):345.
- Arumugam K, Naved M, Shinde PP, Leiva-Chauca O, Huaman-Osorio A, Gonzales-Yanac T. Multiple disease prediction using Machine learning algorithms. Materials today: proceedings. 2023 Jan 1;80:3682–5.
- Nilashi M, bin Ibrahim O, Ahmadi H, Shahmoradi L. An analytical method for diseases prediction using machine learning techniques. Computers & Chemical Engineering. 2017 Nov 2;106:212–23.
- Mujumdar A, Vaidehi V. Diabetes prediction using machine learning algorithms. Procedia Computer Science. 2019 Jan 1;165:292–9.
- Shameer K, Johnson KW, Glicksberg BS, Dudley JT, Sengupta PP. Machine learning in cardiovascular medicine: are we there yet?. Heart. 2018 Jul 1;104(14):1156–64.
- Pingale K, Surwase S, Kulkarni V, Sarage S, Karve A. Disease prediction using machine learning. International Research Journal of Engineering and Technology (IRJET). 2019 Dec;6(12):831–3.
- Khalilia M, Chakraborty S, Popescu M. Predicting disease risks from highly imbalanced data using random forest. BMC medical informatics and decision making. 2011 Dec;11:1–3.
- Deepthi Y, Kalyan KP, Vyas M, Radhika K, Babu DK, Krishna Rao NV. Disease prediction based on symptoms using machine learning. InEnergy Systems, Drives and Automations: Proceedings of ESDA 2019 2020 Sep 1 (pp. 561–569). Singapore: Springer Singapore.
- Kohli PS, Arora S. Application of machine learning in disease prediction. In2018 4th International conference on computing communication and automation (ICCCA) 2018 Dec 14 (pp. 1–4). IEEE.
- Patil S, Jaybhaye S, Bokariya S, Jain P, Phapale S, Hande T. Parkinson’s disease prediction system in machine learning. InITM Web of Conferences 2023 (Vol. 56, p. 05002). EDP Sciences.
- Kaptoge S, Pennells L, De Bacquer D, Cooney MT, Kavousi M, Stevens G, Riley LM, Savin S, Khan T, Altay S, Amouyel P. World Health Organization cardiovascular disease risk charts: revised models to estimate risk in 21 global regions. The Lancet global health. 2019 Oct 1;7(10):e1332–45.
- Ulianova S. Cardiovascular Disease dataset Kaggle.com. 2019. Available from: https://www.kaggle.com/datasets/sulianova/cardiovascular-disease-dataset
- Mehmet Akturk. Diabetes Dataset. Kaggle.com. 2020. Available from: https://www.kaggle.com/datasets/mathchi/diabetes-data-set
- Parkinson’s disease – Symptoms and causes. Mayo Clinic. 2025. Available from: https://www.mayoclinic.org/diseases-conditions/parkinsons-disease/symptoms-causes/syc- 20376055
- Ukani V. Parkinson’s Disease Data Set. Kaggle.com. 2019. Available from: https://www.kaggle.com/datasets/vikasukani/parkinsons-disease-data-set
- Islam MA, Majumder MZ, Hussein MA. Chronic kidney disease prediction based on machine learning algorithms. Journal of pathology informatics. 2023 Jan 1;14:100189.
- UCI Machine Learning Repository. Uci.edu. 2015. Available from: https://archive.ics.uci.edu/dataset/336/chronic+kidney+disease
- Jović A, Brkić K, Bogunović . A review of feature selection methods with applications. n2015 38th international convention on information and communication technology, electronics and microelectronics (MIPRO) 2015 May 25 (pp. 1200–1205). Ieee.
- Sarker, I. H. (2021). Machine learning: Algorithms, real-world applications and research directions. SN computer science, 2(3), 160.
- LeCessie S, Van Houwelingen JC. Ridge estimators in logistic regression. J R Stat Soc Ser C (Appl Stat). 1992;41(1):191–201.
- Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V, et al. Scikit-learn: machine learning in python. J Mach Learn Res. 2011;12:2825–30.
- Steinbach M, Tan PN. kNN: k-nearest neighbors. InThe top ten algorithms in data mining 2009 Apr 9 (pp. 165–176). Chapman and Hall/CRC.
- IBM. Support Vector Machine. Ibm.com. 2023. Available from: https://www.ibm.com/think/topics/support-vector-machine
- Podgorelec V, Kokol P, Stiglic B, Rozman I. Decision trees: an overview and their use in medicine. Journal of medical systems. 2002 Oct;26:445–63.
Journal of Computer Technology & Applications
Volume | 16 |
Issue | 02 |
Received | 01/03/2025 |
Accepted | 08/03/2025 |
Published | 15/04/2025 |
Publication Time | 45 Days |