0% Complete
Home
/
11th International Conference on Computer and Knowledge Engineering
Improving the classification of high dimensional class-imbalanced data using the Chaos particle swarm optimization with Levy Flight
Authors :
Mohammad Ali Zarif
1
Javad Hamidzadeh
2
1- Sadjad University of Technology
2- Sadjad University of Technology
Keywords :
Imbalanced data classification; High Dimensional data; Cost-sensitive Learning; Feature Selection; Chaos Particle Swarm Optimization; Levy Flight
Abstract :
Nowadays classification of imbalanced datasets has gone under considerable attention in the field of data mining and machine learning. Classification of imbalanced data is a challenging task due to imbalanced distribution of the data. In addition, conventional algorithms do not perform well in classification of such high dimensional data. Researchers have proposed many techniques for improving the classification of high dimensional class-imbalanced data. These approaches fall into five categories: 1- Data level, 2- Algorithm level, 3- Cost-sensitive learning, 4- Ensemble learning and 5- Feature selection methods. In this paper, we have exploited cost-sensitive and feature selection methods. Our proposed method first uses fuzzy c-means clustering (FCM) to estimate the probability of each sample being a member of the minority class. Using this method, samples which are hard to classify are determined based on their probabilities and then the cost-sensitive learning is employed to estimate their cost. Finally, particle swarm optimization algorithm enhanced by levy flight is applied for feature selection and a SVM model is trained for classification of the data. By using particle swarm optimization alongside with SVM classifier, best features are determined and misclassification rate of minority samples is reduced. The evaluation of the proposed method is done by using G-Mean metric and 10-fold cross validation. The experimental results demonstrate the superiority of the proposed method compared to other approaches in classification of high dimensional class-imbalanced data.
Papers List
List of archived papers
SUT: a new multi-purpose synthetic dataset for Farsi document image analysis
Elham Shabaninia - Fatemeh sadat Eslami - Ali Afkari Fahandari - Hossein Nezamabadi-pour
Towards Study of Research Topics Evolution in Artificial Intelligence based on Topic Embedding
Seyyed Reza Taher Harikandeh - Sadegh Aliakbary - Soroush Taheri
An overview of Business Intelligence research in healthcare organizations using a topic modeling approach
Mohammad Mehraeen - Laya Mahmoudi - Mohammad Hossein Sharifi
A novel hybrid DMHS-GMDH algorithm to predict COVID-19 pandemic time series
Ahmad Taheri - Shahriar Ghashghaei - Amin Beheshti - Keyvan RahimiZadeh
Adaptive Channel Estimation for MIMO-OFDM Systems in Impulsive Noise Environments
Mojtaba Hajiabadi
Segmentation of Coronary Artery Stenosis in X-ray Angiography using Mamba Models
Fatemeh Fouladi - Ali Rostami - Hedieh Sajedi
A New Inter-layer Similarity metric for link prediction in multilayer networks
Alireza Abdollahpouri - Samira Rafiee
Word-level Persian Lipreading Dataset
Javad Peymanfard - Ali Lashini - Samin Heydarian - Hossein Zeinali - Nasser Mozayani
Information Theoretic Learning-based Deep Embedded Clustering (ITL-DEC)
Hoda Shad - Mona Zamiri - Tahereh Bahreini - Reza Monsefi - Ghoshe Abed Hodtani
Efficient Prediction of Cardiovascular Disease via Extra Tree Feature Selection
Mina Abroodi - Mohammad Reza Keyvanpour - Ghazaleh Kakavand Teimoory
more
Samin Hamayesh - Version 42.2.1