๐Ÿง‘๐Ÿผโ€๐Ÿ’ป Research - March 27, 2026

SGA-DT: An adaptive fusion framework for missing data imputation and interpretable healthcare classification.

๐ŸŒŸ Stay Updated!
Join AI Health Hub to receive the latest insights in health and AI.

โšก Quick Summary

The study introduces SGA-DT, an innovative framework designed to tackle the challenge of missing data imputation in healthcare analytics. By combining genetically optimized support vector regression with decision tree classification, SGA-DT demonstrates superior performance in accuracy, precision, recall, and F-measure across various healthcare datasets.

๐Ÿ” Key Details

  • ๐Ÿ“Š Datasets Used: Breast Cancer, Mammographic, and Hepatitis datasets, along with real-world and synthetic datasets.
  • ๐Ÿงฉ Features: Non-class attributes with varying levels of missingness.
  • โš™๏ธ Technology: SGA-DT framework utilizing SVR and decision trees.
  • ๐Ÿ† Performance Metrics: Outperformed multiple integrated frameworks in accuracy, precision, recall, and F-measure.

๐Ÿ”‘ Key Takeaways

  • ๐Ÿง  SGA-DT adapts its imputation strategy based on the level of missing data.
  • ๐Ÿ” Genetic algorithms enhance the selection of SVR kernels and hyperparameter tuning.
  • ๐Ÿ“ˆ Robustness and transparency are achieved through decision tree classification.
  • ๐ŸŒŸ Consistent performance across various datasets highlights its generalizability.
  • ๐Ÿ’ก Interpretability analysis supports clinical transparency in predictions.
  • ๐Ÿฅ Potential applications in sensitive healthcare domains where data integrity is crucial.

๐Ÿ“š Background

In healthcare analytics, missing data poses a significant challenge, often leading to compromised model accuracy and clinical reliability. Traditional imputation methods can introduce bias, affecting patient outcomes. As machine learning continues to evolve, the need for robust and interpretable solutions becomes increasingly critical, especially in sensitive areas like healthcare.

๐Ÿ—’๏ธ Study

The research team developed the SGA-DT framework to address the complexities of missing data in healthcare. By integrating support vector regression with decision tree classification, the framework adapts its imputation strategy based on the extent of missingness, ensuring that predictions remain accurate and interpretable.

๐Ÿ“ˆ Results

The SGA-DT framework was rigorously evaluated on multiple healthcare datasets, demonstrating a consistent ability to outperform existing integrated frameworks. Key performance metrics such as accuracy, precision, recall, and F-measure were significantly improved, showcasing the framework’s robustness and adaptability in various scenarios.

๐ŸŒ Impact and Implications

The implications of this study are profound. By providing a reliable method for handling missing data, SGA-DT can enhance the quality of healthcare predictions, leading to better clinical decision-making and patient outcomes. This framework not only improves the accuracy of healthcare analytics but also fosters greater trust and transparency in machine learning applications within the medical field.

๐Ÿ”ฎ Conclusion

The introduction of the SGA-DT framework marks a significant advancement in the field of healthcare analytics. By effectively addressing the challenge of missing data, it paves the way for more accurate and interpretable predictions. As we continue to explore the integration of machine learning in healthcare, frameworks like SGA-DT will play a crucial role in enhancing patient care and clinical outcomes.

๐Ÿ’ฌ Your comments

What are your thoughts on the SGA-DT framework and its potential to transform healthcare analytics? We invite you to share your insights in the comments below or connect with us on social media! ๐Ÿ’ฌ

SGA-DT: An adaptive fusion framework for missing data imputation and interpretable healthcare classification.

Abstract

Despite advances in machine learning and medical data processing, handling missing values remains a critical and complex challenge in healthcare analytics. Missing data, especially in non-class attributes can severely compromise model accuracy, clinical reliability, and interpretability. In sensitive domains such as healthcare, improper imputation may lead to biased outcomes or delayed interventions. To address this challenge, we propose SGA-DT, an adaptive and interpretable learning framework that combines the best features of genetically optimized support vector regression (SVR) with a decision tree (DT) classifier for robust healthcare prediction. The framework adaptively selects an imputation strategy based on the level of missingness. It uses standard SVR for low, iterative SVR for moderate, and k-Nearest Neighbor (KNN) followed by SVR refinement for high missingness. Genetic algorithm (GA) is used to select the best SVR kernel and tune its hyperparameters, enhancing imputation accuracy across different data patterns. The complete dataset is then classified using DT, providing both robustness and transparency in prediction. The SGA-DT framework is evaluated on three healthcare datasets, Breast Cancer, Mammographic, and Hepatitis, along with other real-world and synthetic datasets. For interpretability analysis, decision trees are generated under varying missingness levels to support clinical transparency. Comparative results show that SGA-DT consistently outperforms multiple integrated frameworks across accuracy, precision, recall, and F-measure, demonstrating its robustness, interpretability, and generalizability in healthcare prediction tasks.

Author: [‘Jena M’, ‘Dehuri S’, ‘Cho SB’]

Journal: PLoS One

Citation: Jena M, et al. SGA-DT: An adaptive fusion framework for missing data imputation and interpretable healthcare classification. SGA-DT: An adaptive fusion framework for missing data imputation and interpretable healthcare classification. 2026; 21:e0343619. doi: 10.1371/journal.pone.0343619

Share on facebook
Facebook
Share on twitter
Twitter
Share on linkedin
LinkedIn
Share on whatsapp
WhatsApp

Leave a Reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.