Abstract
This work addresses the issue of variable selection within the context of breast cancer classification with mammography. A comprehensive repository of feature vectors was used including a hybrid subset gathering image-based and clinical features. It aimed to gather experimental evidence of variable selection in terms of cardinality, type and find a classification scheme that provides the best performance over the Area Under Receiver Operating Characteristics Curve (AUC) scores using the ranked features subset. We evaluated and classified a total of 300 subsets of features formed by the application of Chi-Square Discretization, Information-Gain, One-Rule and RELIEF methods in association with Feed-Forward Backpropagation Neural Network (FFBP), Support Vector Machine (SVM) and Decision Tree J48 (DTJ48) Machine Learning Algorithms (MLA) for a comparative performance evaluation based on AUC scores. A variable selection analysis was performed for Single-View Ranking and Multi-View Ranking groups of features. Features subsets representing Microcalcifications (MCs), Masses and both MCs and Masses lesions achieved AUC scores of 0.91, 0.954 and 0.934 respectively. Experimental evidence demonstrated that classification performance was improved by combining image-based and clinical features. The most important clinical and image-based features were StromaDistortion and Circularity respectively. Other less important but worth to use due to its consistency were Contrast, Perimeter, Microcalcification, Correlation and Elongation.
| Original language | English |
|---|---|
| Title of host publication | Medical Imaging 2013 |
| Subtitle of host publication | Computer-Aided Diagnosis |
| DOIs | |
| State | Published - 2013 |
| Externally published | Yes |
| Event | Medical Imaging 2013: Computer-Aided Diagnosis - Lake Buena Vista, FL, United States Duration: 12 Feb 2013 → 14 Feb 2013 |
Publication series
| Name | Proceedings of SPIE - The International Society for Optical Engineering |
|---|---|
| Volume | 8670 |
| ISSN (Print) | 0277-786X |
Conference
| Conference | Medical Imaging 2013: Computer-Aided Diagnosis |
|---|---|
| Country/Territory | United States |
| City | Lake Buena Vista, FL |
| Period | 12/02/13 → 14/02/13 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 3 Good Health and Well-being
Keywords
- Breast cancer classification
- Clinical descriptors
- Image-based descriptors
- Machine learning algorithms
- Mammography
- Variable selection analysis
Fingerprint
Dive into the research topics of 'Improving breast cancer classification with mammography, supported on an appropriate variable selection analysis'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver