0% Complete
Home
/
14th International Conference on Computer and Knowledge Engineering
Efficient Prediction of Cardiovascular Disease via Extra Tree Feature Selection
Authors :
Mina Abroodi
1
Mohammad Reza Keyvanpour
2
Ghazaleh Kakavand Teimoory
3
1- Data Mining Laboratory, Department of Computer Engineering Faculty of Engineering, Alzahra University Tehran, Iran
2- Department of Computer Engineering Faculty of Engineering, Alzahra University Tehran, Iran
3- Data Mining Laboratory, Department of Computer Engineering Faculty of Engineering, Alzahra University Tehran, Iran
Keywords :
cardiovascular disease prediction،Logistic Regression algorithm،Extra Tree،Feature Selection،E-health،Cleveland dataset
Abstract :
All abnormalities related to the heart can be classified as heart diseases. These diseases are also known as cardiovascular diseases. Predicting heart disease is crucial due to its status as a leading global cause of mortality. According to predictions, if this disease can be diagnosed earlier, treatment cost can be reduced, prompt initiation of care for patients and millions of lives can be saved. Utilizing machine learning and artificial intelligence for this purpose is paramount. Previous research has explored various methodologies such as KNN and SVM. However, the preprocessing phase significantly influences model performance, especially in e-health domains. This work proposes a groundbreaking method that uses the Extra Tree Classification method for feature selection, optimizing the selection of key attributes from the Cleveland Heart Disease dataset. Our method is chosen for the low computational cost and interpretability of feature selection through randomized scoring. Additionally, we employ logistic regression, renowned for its effectiveness and efficiency in classification tasks and to ensure the robustness of the model, we used the k-fold cross validation method. Our findings demonstrate a notable accuracy of 87.09% when predicting heart disease, with a precision of 87.45% and an F-measure of 86.02%, surpassing previous benchmarks. Considering the high interpretability of models like logistic regression, this level of accuracy is particularly valuable in medical contexts where transparency and explainability are critical, making it a competitive alternative to more complex, less interpretable methods. These findings highlight the effectiveness of combining Extra Tree-based feature selection with logistic regression in improving the predictive accuracy of heart disease diagnostics. Furthermore, we address missing values and their impact on model performance, further strengthening the reliability of our approach.
Papers List
List of archived papers
Distilling Knowledge from CNN-Transformer Models for Enhanced Human Action Recognition
Hamid Ahmadabadi - Omid Nejati Manzari - Ahmad Ayatollahi
Generating Hand-Written Symbols With Trajectory Planning Using A Robotic Arm
Arya Parvizi - Armin Salimi-Badr
DevRanker: An Effective Approach to Rank Developers for Bug Report Assignment
Mohammad Reza Kardoost - Mohammad Reza Moosavi - Reza Akbari
Word-level Persian Lipreading Dataset
Javad Peymanfard - Ali Lashini - Samin Heydarian - Hossein Zeinali - Nasser Mozayani
Virus-Antiviral Prediction Using Machine and Deep Learning Methods
Shayan Majidifar - Fatemeh Nasiri - Mohsen Hooshmand
A novel hybrid DMHS-GMDH algorithm to predict COVID-19 pandemic time series
Ahmad Taheri - Shahriar Ghashghaei - Amin Beheshti - Keyvan RahimiZadeh
Solving the influence maximization problem by using entropy and weight of edges
Farzaneh Kazemzadeh - Amir Karian - Mitra Mirzarezaee - Ali Asghar Safaei
Automated software design using Machine Learning With Natural Language Processing
Fahimeh Khedmatkon - Seyed Mohammad Hossein Hasheminejad - Jaleh Shoshtarian Malak
Optimizing Question-Answering Framework Through Integration of Text Summarization Model and Third-Generation Generative Pre-Trained Transformer
Ervin Gubin Moung - Toh Sin Tong - Maisarah Mohd Sufian - Valentino Liaw - Ali Farzamnia - Farashazillah Yahya
A Cost-Sensitive Genetic Algorithm for Customer Segmentation in Auto Insurances
Alireza Khajenoori - Mohammad Saniee Abadeh - Mohsen Mohammadzadeh
more
Samin Hamayesh - Version 42.4.1