0% Complete
Home
/
15th International Conference on Computer and Knowledge Engineering
Impact of Oversampling Methods on Imbalanced Dataset for Software Fault Prediction
Authors :
Alireza Abiri
1
Alireza Tajary
2
Mansoor Fateh
3
1- Faculty of Computer Engineering, Shahrood University of Technology, Shahrood, Iran
2- Faculty of Computer Engineering, Shahrood University of Technology, Shahrood, Iran
3- Faculty of Computer Engineering, Shahrood University of Technology, Shahrood, Iran
Keywords :
Software Fault Prediction،Imbalanced Data،Machine Learning،GAN،BugHunter Dataset
Abstract :
In today's world, with the rapid advancement of technology and the increasing use and scale of software systems both in terms of data volume and number of users, the occurrence of software faults has become inevitable. Consequently, software fault prediction has gained significant importance for the early identification of faulty modules during the software development process. However, one of the key challenges in this domain is the class imbalance problem, where the number of faulty and non-faulty instances in software datasets is highly unequal. To address this issue, data oversampling techniques are commonly employed to balance the datasets. In this study, we investigate and compare the performance of three data oversampling methods on the BugHunter software fault dataset. The results indicate that using Generative Adversarial Networks (GANs) for data generation and oversampling is a more effective approach for addressing class imbalance, achieving better performance compared to alternative methods.
Papers List
List of archived papers
Deep Learning Feature Extraction for COVID-19 Detection Algorithm using Computerized Tomography Scan
Maisarah Mohd Sufian - Ervin Gubin Moung - Chong Joon Hou - Ali Farzamnia
DIPT: Diversified Personalized Transformer for QAC systems
Mahdi Dehghani - Samira Vaez Barenji - Saeed Farzi
Blind image quality assessment based on Multi-resolution Local Structures
Seyed Majid Khorashadizadeh - Mehdi Sadeghi Bakhi - Fatemeh Seifishahpar - AliMohammad Latif
Optimizing MR Image Registration for Accurate Brain Volume Measurement in Children with Autism Spectrum Disorder
Shiva Sanati - Mahdi Saadatmand
Blind Load-Balancing Algorithm using Double-Q-learning in the Fog Environment
Niloofar Tahmasebi pouya - Mehdi Agha Sarram
LPCNet: Lane detection by lane points correction network in challenging environments based on deep learning
Sina BaniasadAzad - Seyed Mohammadreza Mousavi mirkolaei
HV-RCE: Reducing Network Bandwidth Usage for Video Transmission via HEVC/VVC Features in Resource-Constrained Environments
Yaghoub Saberi - Mohammadreza Forghani - Sharifeh Sadat Mirkhalaf
DRL-based Decision-Making for Autonomous Vehicle Collision Avoidance
Hoda Gholamrezaee - Seyedreza Taghizadeh - Ali Honarjoo
Hardware-Efficient Pruned CNN Optimized by Neural Architecture Search and Genetic Algorithm for Diabetic Retinopathy Detection on STM32F746
Omid Askari Haddad - Sara Ershadi-Nasab
An effective hybrid algorithm for locating splicing forgery image
Seyed Hesamoddin Hosseini - Amene Vatanparast - Amir Hossein Taherinia
more
Samin Hamayesh - Version 43.7.0