Xiaolong, X., Wen, C. and Xinheng, Wang (2021) RFC: A feature selection algorithm for software defect prediction. Journal of Systems Engineering and Electronics, 32 (2). pp. 389-398.
Preview |
PDF
RFC_A_feature_selection_algorithm_for_software_defect_prediction.pdf - Published Version Download (775kB) | Preview |
Abstract
Software defect prediction (SDP) is used to perform the statistical analysis of historical defect data to find out the distribution rule of historical defects, so as to effectively predictdefects in the new software. However, there are redundant and irrelevant features in the software defect datasets affecting the performance of defect predictors. In order to identify and remove the redundant and irrelevant features in software defectdatasets, we propose Relief F-based clustering (RFC), a cluster-based feature selection algorithm. Then, the correlation between features is calculated based on the symmetric uncertainty. According to the correlation degree, RFC partitions features into kclusters based on the k-medoids algorithm, and finally selects the representative features from each cluster to form the final feature subset. In the experiments, we compare the proposed RFC with classical feature selection algorithms on nine National Aeronautics and Space Administration (NASA) software defectprediction datasets in terms of area under curve (AUC) and F-value. The experimental results show that RFC can effectively improve the performance of SDP.
Item Type: | Article |
---|---|
Identifier: | 10.23919/JSEE.2021.000032 |
Subjects: | Computing |
Depositing User: | Marc Forster |
Date Deposited: | 11 Nov 2024 14:36 |
Last Modified: | 11 Nov 2024 14:45 |
URI: | https://repository.uwl.ac.uk/id/eprint/12879 |
Downloads
Downloads per month over past year
Actions (login required)
View Item |