RFC: A feature selection algorithm for software defect prediction

Xiaolong, X., Wen, C. and Xinheng, Wang (2021) RFC: A feature selection algorithm for software defect prediction. Journal of Systems Engineering and Electronics, 32 (2). pp. 389-398.

[thumbnail of RFC_A_feature_selection_algorithm_for_software_defect_prediction.pdf]
Preview
PDF
RFC_A_feature_selection_algorithm_for_software_defect_prediction.pdf - Published Version

Download (775kB) | Preview

Abstract

Software defect prediction (SDP) is used to perform the statistical analysis of historical defect data to find out the distribution rule of historical defects, so as to effectively predictdefects in the new software. However, there are redundant and irrelevant features in the software defect datasets affecting the performance of defect predictors. In order to identify and remove the redundant and irrelevant features in software defectdatasets, we propose Relief F-based clustering (RFC), a cluster-based feature selection algorithm. Then, the correlation between features is calculated based on the symmetric uncertainty. According to the correlation degree, RFC partitions features into kclusters based on the k-medoids algorithm, and finally selects the representative features from each cluster to form the final feature subset. In the experiments, we compare the proposed RFC with classical feature selection algorithms on nine National Aeronautics and Space Administration (NASA) software defectprediction datasets in terms of area under curve (AUC) and F-value. The experimental results show that RFC can effectively improve the performance of SDP.

Item Type: Article
Identifier: 10.23919/JSEE.2021.000032
Subjects: Computing
Depositing User: Marc Forster
Date Deposited: 11 Nov 2024 14:36
Last Modified: 11 Nov 2024 14:45
URI: https://repository.uwl.ac.uk/id/eprint/12879

Downloads

Downloads per month over past year

Actions (login required)

View Item View Item

Menu