Improving DNA microarray classification with graph-based gene selection and maximum Clique (MC)
DOI:
https://doi.org/10.32792/jeps.v15i2.453Keywords:
Microarray data classification ,Gene selection, graph, LS-SVM, maximum CliqueAbstract
Abstract:
Nowadays, one of the most important uses of molecular biology for disease detection is microarray data analysis. Gene selection, which searches for a subset of genes with the lowest internal similarity and highest relatedness to the target class, is an important task in microarray data processing. Data dimensionality can be reduced by removing duplicate, annoying, or unnecessary information. This study uses graph theory to support the gene selection strategy for disease diagnosis. To improve diagnostic skills, this study proposes a gene selection technique to evaluate DNA microarray data that takes advantage of graph theory and social network analysis. Gene selection effectively addresses this problem because it reduces computational complexity while increasing the classification accuracy of microarray data. This research proposes a unique gene selection method based on social network analysis. The two primary goals of the proposed approach are to reduce redundancy and increase its relevance for the selected genes. This algorithm iteratively determines the maximum population size in each cycle. Then, using node clustering, relevant genes are selected from this community's list of genes currently present. According to the published results, the new gene selection method will reduce time complexity while improving microarray data's classification accuracy. The LS-SVM classifier was used to test the suggested approach on several sets of data, with the main focus of the study being classification accuracy. With an average classification accuracy using LS-SVM, the findings demonstrated notable increases in classification accuracy. The study demonstrated that the suggested methodology can efficiently identify genes and greatly increase classification accuracy by utilizing a variety of metrics to assess them. These findings demonstrate how well the suggested strategy analyzes microarray data and increases classification accuracy, which represents a significant advancement in the field of gene expression-based illness categorization.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Journal of Education for Pure Science

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
Copyright Policy
Authors retain copyright of their articles published in the Journal of Education for Pure Science (JEPS).
By submitting their work, authors grant the journal a non-exclusive license to publish, distribute, and archive the article in all formats and media.
License
All articles published in JEPS are licensed under the Creative Commons Attribution 4.0 International License (CC BY 4.0).
This license permits unrestricted use, distribution, and reproduction in any medium, provided that the original author(s) and the source are properly credited.
Author Rights
Authors have the right to:
-
Share their articles on personal websites, institutional repositories, and academic platforms
-
Reuse their work in future research and publications
-
Distribute the published version without restriction
Journal Rights
The journal retains the right to:
-
Publish and archive the articles
-
Include them in indexing and archiving systems such as LOCKSS and CLOCKSS
-
Promote and disseminate the published work
Responsibility
The contents of all articles are the sole responsibility of the authors. The journal, editors, and editorial board are not responsible for any errors, opinions, or statements expressed in the published articles.
Open Access Statement
JEPS provides immediate open access to its content, supporting the principle that making research freely available to the public enhances global knowledge exchange.
This work is licensed under a Creative Commons Attribution 4.0 International License.
https://creativecommons.org/licenses/by/4.0/