Disease Prediction Using Ensemble Learning Models: A Comprehensive Approach

[{“box”:0,”content”:”n[if 992 equals=”Open Access”]n

n

n

n

Open Access

nn

n

n[/if 992]n[if 2704 equals=”Yes”]n

n

Notice

nThis is an unedited manuscript accepted for publication and provided as an Article in Press for early access at the author’s request. The article will undergo copyediting, typesetting, and galley proof review before final publication. Please be aware that errors may be identified during production that could affect the content. All legal disclaimers of the journal apply.n

n[/if 2704]n

n

Year : 2025 [if 2224 equals=””]27/09/2025 at 1:59 PM[/if 2224] | [if 1553 equals=””] Volume : 15 [else] Volume : 15[/if 1553] | [if 424 equals=”Regular Issue”]Issue : [/if 424][if 424 equals=”Special Issue”]Special Issue[/if 424] [if 424 equals=”Conference”][/if 424] 03 | Page : 26 33

n

n

nn

n

n

n

    By

    n

    [foreach 286]n

    n

    Altaf O. Mulani, Swapnil R. Takale,

    n t

  • n

    n[/foreach]

    n

n[if 2099 not_equal=”Yes”]n

    [foreach 286] [if 1175 not_equal=””]n t

  1. Professor, Assistant Professor, Department of Electronics & Telecommunication Engineering, SKN Sinhgad College of Engineering, Pandharpur, Department of Electronics & Telecommunication Engineering, SKN Sinhgad College of Engineering, Pandharpur, Maharashtra, Maharashtra, India, India
  2. n[/if 1175][/foreach]

n[/if 2099][if 2099 equals=”Yes”][/if 2099]n

n

Abstract

n

n

nIn recent years, ensemble learning techniques have become pivotal in advancing predictive analytics within healthcare, particularly for early disease detection. The inherent variability and complexity of medical data, often characterized by high dimensionality, class imbalance, and noise, make it challenging for standalone classifiers to maintain high predictive accuracy. Ensemble learning, by integrating multiple models through bagging, boosting, or stacking, offers a more robust and generalizable approach. This study explores the practical application of ensemble learning algorithms to predict diseases such as diabetes, cancer, and cardiovascular conditions using diverse, publicly available datasets. We employed data preprocessing techniques including normalization, imputation, and encoding to prepare clinical datasets. Recursive Feature Elimination (RFE) was applied to identify the most significant features and improve the quality of model inputs. Three ensemble learning methods: Random Forest, Gradient Boosting, and a meta-model-based Stacking Ensemble, were then trained and evaluated. Their performance was evaluated using metrics such as accuracy, precision, recall, and F1-score through multi-fold cross-validation. The results demonstrate that ensemble methods surpass individual models, with the stacking ensemble achieving 90% accuracy and delivering strong results across all evaluation criteria. Furthermore, we discuss deployment challenges such as model interpretability, training time, and integration with clinical decision support systems. In conclusion, ensemble learning models offer a scalable and effective pathway for building intelligent, data-driven diagnostic systems, capable of assisting healthcare professionals in making early and reliable disease predictions.nn

n

n

n

Keywords: Ensemble learning, disease prediction, machine learning, medical diagnosis, classification models

n[if 424 equals=”Regular Issue”][This article belongs to Journal of Communication Engineering & Systems ]

n

[/if 424][if 424 equals=”Special Issue”][This article belongs to Special Issue under section in Journal of Communication Engineering & Systems (joces)][/if 424][if 424 equals=”Conference”]This article belongs to Conference [/if 424]

n

n

n

How to cite this article:
nAltaf O. Mulani, Swapnil R. Takale. [if 2584 equals=”][226 wpautop=0 striphtml=1][else]Disease Prediction Using Ensemble Learning Models: A Comprehensive Approach[/if 2584]. Journal of Communication Engineering & Systems. 22/09/2025; 15(03):26-33.

n

How to cite this URL:
nAltaf O. Mulani, Swapnil R. Takale. [if 2584 equals=”][226 striphtml=1][else]Disease Prediction Using Ensemble Learning Models: A Comprehensive Approach[/if 2584]. Journal of Communication Engineering & Systems. 22/09/2025; 15(03):26-33. Available from: https://journals.stmjournals.com/joces/article=22/09/2025/view=0

nn

n

n[if 992 equals=”Open Access”]Full Text PDF[/if 992]n

n

n[if 992 not_equal=”Open Access”]n

n

n[/if 992]n

nn

nnn

n[if 379 not_equal=””]nn

Browse Figures

n

n

n[foreach 379]

figures

[/foreach]n

n

n

n[/if 379]

n

n

n

n

n

References n

n[if 1104 equals=””]n

  1. Sharma A, Sharma N, Kaushik I, Kumar S, Khatoon N. Predictive analysis of type 2 diabetes using hybrid ML model and IoT. In: IoT security paradigms and applications. CRC Press; 2020 Oct 7; 303–320.
  2. El Hamdaoui H, Boujraf S, El Houda Chaoui N, Alami B, Maaroufi M. Improving Heart Disease Prediction Using Random Forest and AdaBoost Algorithms. Int J Online Biomed Eng. 2021 Nov 1; 17(11): 60–75.
  3. Yurttakal AH, Erbay H, İkizceli T, Karaçavuş S, Biçer C. Diagnosing breast cancer tumors using stacked ensemble model. J Intell Fuzzy Syst. 2021 Dec 31; 42(1): 77–85.
  4. Khan M, Zaman A, Khan SS, Arshad M. A hybrid approach for automatic segmentation and classification to detect tuberculosis. Digit Health. 2024 Aug; 10: 20552076241271869.
  5. Makka S, Reddy VN, Gnaneshwar R, Kumar MA. AI-Powered Alzheimer’s Prediction System using Deep Learning for Early Diagnosis. In 2024 IEEE 4th International Conference on Advancement in Electronics & Communication Engineering (AECE). 2024 Nov 22; 1169–1173.
  6. Tiwari A, Gupta AK, Gupta T. A robust approach to shear strength prediction of reinforced concrete deep beams using ensemble learning with SHAP interpretability. Soft Comput. 2024 Apr; 28(7): 6343–65.
  7. Zhou ZH. Ensemble methods: foundations and algorithms. CRC press; 2025 Feb 15.
  8. Dietterich TG. Ensemble methods in machine learning. In International workshop on multiple classifier systems. Berlin, Heidelberg: Springer Berlin Heidelberg; 2000 Jun 21; 1–15.
  9. Breiman L. Random forests. Mach Learn. 2001 Oct; 45(1): 5–32.
  10. Friedman JH. Greedy function approximation: a gradient boosting machine. Ann Statist. 2001 Oct 1; 29(5): 1189–232.
  11. Godase MV, Mulani A, Ghodak R, Birajadar G, Takale S, Kolte M. A MapReduce and Kalman filter based secure IIoT environment in Hadoop. Sanshodhak. 2024 Jun; 25: 38–47.
  12. Gadade B, Mulani AO, Harale AD. IoT Based Smart School Bus and Student Tracking System. Sanshodhak. 2024 Jun; 19.
  13. Dhanawadel A, Mulani AO, Pise AC. IOT based Smart farming using Agri BOT. Sanshodhak. 2024 Jun; 26: 117–125.
  14. Mulani A, Mane PB. DWT based robust invisible watermarking. Scholars’ Press; 2016.
  15. Tiple AH, Wadhel VB, Sawant HS, Sawant TT, Sawant SS. Design of surveillance-based quadcopter using Arduino. SSRG Int J Electr Electron Eng. 2019; 6(3): 1–4.
  16. Champaty B, Nayak SK, Thakur G, Mohapatra B, Tibarewala DN, Pal K. Development of Bluetooth, Xbee, and Wi-Fi-Based wireless control systems for controlling electric-powered robotic vehicle wheelchair prototype. In: Classification and Clustering in Biomedical Signal Processing. IGI Global Scientific Publishing; 2016; 356–387.
  17. Swami SS, Mulani AO. An efficient FPGA implementation of discrete wavelet transform for image compression. In 2017 IEEE International Conference on Energy, Communication, Data Analytics and Soft Computing (ICECDS). 2017 Aug 1; 3385–3389.
  18. Mane PB, Mulani AO. High speed area efficient FPGA implementation of AES algorithm. Int J Reconfigurable Embed Syst. 2018 Nov; 7(3): 157–65.
  19. Mulani AO, Mane PB. Area efficient high speed FPGA based invisible watermarking for image authentication. Indian J Sci Technol. 2016 Oct; 9(39): 1–6.
  20. Kashid MM, Karande KJ, Mulani AO. IoT-based environmental parameter monitoring using machine learning approach. In Proceedings of the International Conference on Cognitive and Intelligent Computing: ICCIC 2021, Vol. 1. Singapore: Springer Nature Singapore; 2022 Nov 1; 43–51.

nn[/if 1104][if 1104 not_equal=””]n

    [foreach 1102]n t

  1. [if 1106 equals=””], [/if 1106][if 1106 not_equal=””],[/if 1106]
  2. n[/foreach]

n[/if 1104]

n


nn[if 1114 equals=”Yes”]n

n[/if 1114]

n

n

[if 424 not_equal=””]Regular Issue[else]Published[/if 424] Subscription Review Article

n

n

[if 2146 equals=”Yes”][/if 2146][if 2146 not_equal=”Yes”][/if 2146]n

n

n

n

n

n

n

n

n

n

n

n

n

n

n

n

n

n

n

n

n[if 1748 not_equal=””]

[else]

[/if 1748]n

n[if 1746 equals=”Retracted”]n

n

n

n

[/if 1746]n[if 4734 not_equal=””]

n

n

n

[/if 4734]n

n

Volume 15
[if 424 equals=”Regular Issue”]Issue[/if 424][if 424 equals=”Special Issue”]Special Issue[/if 424] [if 424 equals=”Conference”][/if 424] 03
Received 11/07/2025
Accepted 04/09/2025
Published 22/09/2025
Retracted
Publication Time 73 Days

n

n

nn


n

Login

n
My IP
n

PlumX Metrics

nn

n

n

n[if 1746 equals=”Retracted”]n

[/if 1746]nnn

nnn”}]