0% Complete
Home
/
12th International Conference on Computer and Knowledge Engineering
Cross-project Defect Prediction with An Enhanced Transfer Boosting Algorithm
Authors :
Nazgol Nikravesh
1
Mohammad Reza Keyvanpour
2
1- Data Mining Laboratory, Department of Computer Engineering, Faculty of Engineering, , Alzahra University, Tehran, Iran
2- Department of Computer Engineering, Faculty of Engineering, Alzahra University, Tehran, Iran
Keywords :
class imbalance،software defect prediction،cross-project defect prediction،transfer learning،training data selection
Abstract :
A growing number of software projects makes it increasingly crucial to predict software defects. If sufficient historical data is available, within-project defect prediction models can be effective. During the early stages of software development, however, insufficient data exists to train an effective predictor. Cross-project defect prediction (CPDP) uses information from previous mature projects (source data) to predict whether new software modules (target data) will be defective. CPDP models must take into account the fact that data distributions between target and source projects are different. These models often reduce distribution differences by either selecting training data or using transfer learning methods. Using transfer learning effectively reduces distribution differences in recent CPDP models. Yet none of them have taken into account the possibility that negative transfer may occur as a result of imbalanced nature of defect data. In this paper, a four-step model is proposed, of which three steps are dedicated to the preparation of training data and their initial weights for using in the fourth step, which involves an enhanced version of the transfer boosting algorithm. In this algorithm imbalance nature of data is considered and the weighting of the source data is updated to enhance the prediction performance. Therefore, aside from reducing distribution differences between source and target data, the model also addresses issues related to defect data class imbalance. As compared to four state-of-the-art CPDP models, this model provided consistent and accurate predictions for fifteen projects from PROMISE, AEEEM, and SOFTLAB. Our proposed model provided the best average results for both AUC and F-measure and in some datasets, the improvements were more than 5%.
Papers List
List of archived papers
Joint ADC-less Analog Demodulator and Decoder for Extended Binary (8, 4, 4) Hamming Channel Code
Mir Mahdi Safari - Jafar Pourrostam - Behzad Mozaffari Tazehkand
City Intersection Clustering and Analysis Based on Traffic Time Series
Mohammad Aminazadeh - Fakhroddin Noorbehbahani
Performance Evaluation Study of Color Space Selection In Video Based Facial Expression Recognition Using Deep Neural Networks For Sentiment Analysis
Phee Wei Qin - Ervin Gubin Moung - Ali Farzamnia - Farashazillah Yahya - John Julius Danker Khoo - Maisarah Mohd Sufian
Deep Learning-Based Malaysian Sign Language (MSL) Recognition: Exploring the Impact of Color Spaces
Ervin Gubin Moung - Precilla Fiona Suwek - Maisarah Mohd Sufian - Valentino Liaw - Ali Farzamnia - Wei Leong Khong
Time Series Analysis by Bi-GRU for Forecasting Bitcoin Trends based on Sentiment Analysis
Fatemeh Saadatmand - Mohammad Ali Zare Chahoki
A Federated Learning-Based Hybrid Deep Learning Framework for Enhanced Human Activity Recognition
Jamileh Azmoudeh - Sajjad Arghaee - Parisa Valizadeh - Samaneh Dandani - Iman Havangi - Mohammad Hossein Yaghmaee
Improved TrustChain for Lightweight Devices
Seyed Salar Ghazi - Haleh Amintoosi
Supervised Contrastive Learning for Short Text Classification in Natural Language Processing
Mitra Esmaeili - Hamed Vahdat nejad
XAI for Transparent Autonomous Vehicles: A New Approach to Understanding Decision-Making in Self-driving Cars
Maryam Sadat Hosseini Azad - Amir Abbas Hamidi Imani - Shahriar Baradaran Shokouhi
FarSick: A Persian Semantic Textual Similarity And Natural Language Inference Dataset
Zahra Ghasemi - Mohammad Ali Keyvanrad
more
Samin Hamayesh - Version 41.7.6