Abstract
This work focuses on developing a mixed-learning method that combines a filter-based metaheuristic searcher with a shallow learning classifier to reduce the feature space while maximizing the breast cancer prognosis classification. The searcher used a genetic algorithm together with the average symmetrical uncertainty (aSU) and ReliefF (aReliefF) filter functions. This modification allowed us to measure the relevance per capita of a group of features (genes). The proposed method was validated on a data set with 396 instances. The most effective classification scheme emerged from the random forest model, utilizing 60 tree predictors and employing the aReliefF objective function. This configuration achieved an average area under the receiver operating characteristic curve (AUC) score of 0.854 and 0.874 for the training and test stages, respectively. Thus, this classification scheme is the best breast cancer prognosis classification strategy. In addition, we identified a set of master genes through the intersection of both objective functions regarding feature relevance. Nevertheless, evaluating this subset in the test set using the top-performing classification scheme yielded a comparatively lower performance (AUC=0.829), underscoring the necessity for additional genes to maximize classification effectiveness.
| Original language | English |
|---|---|
| Title of host publication | 2024 7th IEEE Biennial Congress of Argentina, ARGENCON 2024 |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| ISBN (Electronic) | 9798350365931 |
| DOIs | |
| State | Published - 2024 |
| Event | 7th IEEE Biennial Congress of Argentina, ARGENCON 2024 - San Nicolas de los Arroyos, Argentina Duration: 18 Sep 2024 → 20 Sep 2024 |
Publication series
| Name | 2024 7th IEEE Biennial Congress of Argentina, ARGENCON 2024 |
|---|
Conference
| Conference | 7th IEEE Biennial Congress of Argentina, ARGENCON 2024 |
|---|---|
| Country/Territory | Argentina |
| City | San Nicolas de los Arroyos |
| Period | 18/09/24 → 20/09/24 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 3 Good Health and Well-being
Keywords
- Genetic algorithm
- Metaheuristics
- Naive Bayes
- Random forest
- ReliefF
- Shallow learning
- Symmetrical uncertainty
- k-nearest neighbors
Fingerprint
Dive into the research topics of 'Towards a Mixed Learning Strategy for Discovering New Gene Signatures in Breast Cancer Prognosis'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver