Please use this identifier to cite or link to this item: https://elib.vku.udn.vn/handle/123456789/3187
Title: A Study of Filter-Based Feature Selection in Software Fault Prediction
Authors: Ha, Thi Minh Phuong
Le, Thi My Hanh
Nguyen, Thanh Binh
Keywords: Feature selection
Filter
Machine learning algorithms
Fault prediction
Issue Date: Aug-2022
Publisher: Springer Nature
Abstract: Software fault prediction (SFP) assists developers in diagnosing the potential defects in the early stage. In SFP, software metrics have strong influence on the performance of a predictive model. However, high dimensional data impacts negatively on the predictive accuracy. As a solution, feature selection provides a process of selecting the optimal features that combine with machine learning techniques to build SFP models. For feature selection, filter selection is a way of addressing the high dimensionality, reducing computation time and improving prediction performance. In this research, we investigate a comparative analysis to review how different of nine filter feature selection methods on both datasets in PROMISE repository, namely CM1 and KC1. The experimental results show that the performances of classifiers are varying on different datasets, especially, in the CM1 dataset, Gain Ratio and Relief based on XGBoost (XGB) and Extra Trees (ET) achieved the highest accuracy and AUC values. In KC1, Gain Ratio and Mutual Information presented the greatest performance among nine methods.
Description: International Conference on Intelligence of Things (ICIT 2022); Lecture Notes on Data Engineering and Communications Technologies, Vol.148; pp: 58–67
URI: https://doi.org/10.1007/978-3-031-15063-0_5
http://elib.vku.udn.vn/handle/123456789/3187
ISBN: 978-3-031-15063-0 (e)
Appears in Collections:NĂM 2022

Files in This Item:

 Sign in to read



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.