0% Complete
Home
/
15th International Conference on Computer and Knowledge Engineering
Impact of Oversampling Methods on Imbalanced Dataset for Software Fault Prediction
Authors :
Alireza Abiri
1
Alireza Tajary
2
Mansoor Fateh
3
1- Faculty of Computer Engineering, Shahrood University of Technology, Shahrood, Iran
2- Faculty of Computer Engineering, Shahrood University of Technology, Shahrood, Iran
3- Faculty of Computer Engineering, Shahrood University of Technology, Shahrood, Iran
Keywords :
Software Fault Prediction،Imbalanced Data،Machine Learning،GAN،BugHunter Dataset
Abstract :
In today's world, with the rapid advancement of technology and the increasing use and scale of software systems both in terms of data volume and number of users, the occurrence of software faults has become inevitable. Consequently, software fault prediction has gained significant importance for the early identification of faulty modules during the software development process. However, one of the key challenges in this domain is the class imbalance problem, where the number of faulty and non-faulty instances in software datasets is highly unequal. To address this issue, data oversampling techniques are commonly employed to balance the datasets. In this study, we investigate and compare the performance of three data oversampling methods on the BugHunter software fault dataset. The results indicate that using Generative Adversarial Networks (GANs) for data generation and oversampling is a more effective approach for addressing class imbalance, achieving better performance compared to alternative methods.
Papers List
List of archived papers
A Hybrid Echo State Network for Hypercomplex Pattern Recognition, Classification, and Big Data Analysis
Mohammad Jamshidi - Fatemeh Daneshfar
Attentional Bi-LSTM for Multivariate Time Series Forecasting on Edge Devices: A Case Study on NanoPi Neo Plus2
Navid Hajizadeh - Saeed Yazdani - Sara Ershadi-Nasab
An Improved and Accurate Measure for Mining Correlated High-utility Itemsets
Amir Masoud Heidari Orojloo - Morteza Keshtkaran
Enhancing Lighter Neural Network Performance with Layer-wise Knowledge Distillation and Selective Pixel Attention
Siavash Zaravashan - Sajjad Torabi - Hesam Zaravashan
Generating Hand-Written Symbols With Trajectory Planning Using A Robotic Arm
Arya Parvizi - Armin Salimi-Badr
Cluster Sampling: A Cluster-Driven Sampling Strategy for Deep Metric Learning
Hamideh Rafiee - Ahmad Ali Abin - Seyed Soroush Majd
Multi-Layered Defense Against Modern Phishing: A Dual-Sandbox and CDR Approach
Mahdi Seyfipoor - Mohammad Mahdi Eskandari
Delay Optimization of a Federated Learning-based UAV-aided IoT network
Hossein Mohammadi Firouzjaei - Javad Zeraatkar Moghaddam - Mehrdad Ardebilipour
Non-Functional Requirement Extracting Methods for AI-based Systems: A Survey
Reza Damirchi - Amineh Amini
GAP: Fault tolerance Improvement of Convolutional Neural Networks through GAN-aided Pruning
Pouya Hosseinzadeh - Yasser Sedaghat - Ahad Harati
more
Samin Hamayesh - Version 43.7.0