Consistency and Stability in Feature Selection for HighDimensional Microarray Survival Data in Diffuse Large B-Cell Lymphoma Cancer

dc.contributor.authorKazeem A. Dauda
dc.contributor.authorRasheed K. Lamidi
dc.date.accessioned2025-04-07T12:29:13Z
dc.date.available2025-04-07T12:29:13Z
dc.date.issued2025-02-18
dc.description.abstractHigh-dimensional survival data, such as microarray datasets, present significant challenges in variable selection and model performance due to their complexity and dimensionality. Identifying important genes and understanding how these genes influence the survival of patients with cancer are of great interest and a major challenge to biomedical scientists, healthcare practitioners, and oncologists. Therefore, this study combined the strengths of two complementary feature selection methodologies: a filtering (correlation-based) approach and a wrapper method based on Iterative Bayesian Model Averaging (IBMA). This new approach, termed Correlation-Based IBMA, offers a highly efficient and effective means of selecting the most important and influential genes for predicting the survival of patients with cancer. The efficiency and consistency of the method were demonstrated using diffuse large B-cell lymphoma cancer data. The results revealed that the 15 most important genes out of 3835 gene features were consistently selected at a threshold p-value of 0.001, with genes with posterior probabilities below 1% being removed. The influence of these 15 genes on patient survival was assessed using the Cox Proportional Hazards (Cox-PH) Model. The results further revealed that eight genes were highly associated with patient survival at a 0.05 level of significance. Finally, these findings underscore the importance of integrating feature selection with robust modeling approaches to enhance accuracy and interpretability in high-dimensional survival data analysis.
dc.identifier.urihttps://doi.org/10.3390/data10020026
dc.identifier.urihttps://kwasuspace.kwasu.edu.ng/handle/123456789/4865
dc.language.isoen
dc.publisherdata
dc.titleConsistency and Stability in Feature Selection for HighDimensional Microarray Survival Data in Diffuse Large B-Cell Lymphoma Cancer
dc.typeArticle
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
data-10-00026.pdf
Size:
476.51 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed to upon submission
Description: