Improving DNA microarray classification with graph-based gene selection and maximum Clique (MC)

المؤلفون

  • Hadeel najm Thi-Qar University, Iraq
  • Firas Sabar Miften

DOI:

https://doi.org/10.32792/jeps.v15i2.453

الكلمات المفتاحية:

Microarray data classification ,Gene selection, graph, LS-SVM, maximum Clique

الملخص

Abstract:

   Nowadays, one of the most important uses of molecular biology for disease detection is microarray data analysis. Gene selection, which searches for a subset of genes with the lowest internal similarity and highest relatedness to the target class, is an important task in microarray data processing. Data dimensionality can be reduced by removing duplicate, annoying, or unnecessary information. This study uses graph theory to support the gene selection strategy for disease diagnosis. To improve diagnostic skills, this study proposes a gene selection technique to evaluate DNA microarray data that takes advantage of graph theory and social network analysis. Gene selection effectively addresses this problem because it reduces computational complexity while increasing the classification accuracy of microarray data. This research proposes a unique gene selection method based on social network analysis. The two primary goals of the proposed approach are to reduce redundancy and increase its relevance for the selected genes. This algorithm iteratively determines the maximum population size in each cycle. Then, using node clustering, relevant genes are selected from this community's list of genes currently present. According to the published results, the new gene selection method will reduce time complexity while improving microarray data's classification accuracy. The LS-SVM classifier was used to test the suggested approach on several sets of data, with the main focus of the study being classification accuracy. With an average classification accuracy using LS-SVM, the findings demonstrated notable increases in classification accuracy. The study demonstrated that the suggested methodology can efficiently identify genes and greatly increase classification accuracy by utilizing a variety of metrics to assess them. These findings demonstrate how well the suggested strategy analyzes microarray data and increases classification accuracy, which represents a significant advancement in the field of gene expression-based illness categorization.

التنزيلات

منشور

2025-06-01