0% Complete
Home
/
14th International Conference on Computer and Knowledge Engineering
Attention-Boosted Ensemble of Pre-trained Convolutional Neural Networks for Accurate Diabetic Retinopathy Detection
Authors :
Benyamin Mirab Golkhatmi
1
Mohammad Hossein Moattar
2
1- Department of Computer Engineering Mashhad Branch, Islamic Azad University Mashhad, Iran
2- Department of Computer Engineering Mashhad Branch, Islamic Azad University Mashhad, Iran
Keywords :
Diabetic retinopathy،Deep transfer learning،Fine-tuning،Model ensemble،EfficientNetB0،EfficientNetB1،Attention mechanism
Abstract :
Early and accurate detection of diabetic retinopathy (DR) is crucial for preserving vision. Extracting informative and global features that facilitate precise and reliable decision-making is essential. Convolutional neural networks (CNNs), known for their high accuracy, are well-suited for this application. However, these models are susceptible to data scarcity, a challenge that can be mitigated through transfer learning. Additionally, model ensembles have proven effective in similar domains. This study proposes the use of two pre-trained CNNs from the EfficientNet family, specifically EfficientNetB0 and EfficientNetB1, in conjunction, and combines the features from both models to enhance decision-making. A Multi-Head Attention layer is incorporated to extract global and region-independent features, further improving representation. Consequently, the model can focus on the most critical areas of the image, thereby increasing detection accuracy. The proposed approach is evaluated on two datasets, yielding impressive results in binary classification (DR or No-DR) on the IDRiD dataset, it achieved an accuracy of 99.07% and an F1 score of 99.02%, while on the APTOS dataset, it attained an accuracy of 99.19% and an F1 score of 99.07%. These findings illustrate the effectiveness of combining CNNs with attention mechanisms for the accurate and timely diagnosis of DR.
Papers List
List of archived papers
Joint ADC-less Analog Demodulator and Decoder for Extended Binary (8, 4, 4) Hamming Channel Code
Mir Mahdi Safari - Jafar Pourrostam - Behzad Mozaffari Tazehkand
Improving Motor Imagery Classification in BCI Systems Using EMD and Multi-Layer CNNs
Reza Arghand - Ali Chaibakhsh - Moein Radman
Word-level Persian Lipreading Dataset
Javad Peymanfard - Ali Lashini - Samin Heydarian - Hossein Zeinali - Nasser Mozayani
Iris Detection and Segmentation Using Deep Learning
Ali Khaki - Ali Aghagolzadeh - Bagher Rahimpour Cami
Practical Implementation of Real-Time Waste Detection and Recycling based on Deep Learning for Delta Parallel Robot
Hasan Jalali - Shaya Garjani - Ahmad Kalhor - Mehdi Tale Masouleh - Parisa Yousefi
Plant Disease Detection Using Dynamic Knowledge Distillation and Attention Mechanism
Mohammad Ghasemi Arian - Mohammad Hossein Yaghmaee Moghaddam
Averting Mode Collapse for Generative Zero-Shot Learning
Shayan Ramazi - Setare Shabani
Multi-Task Transformer for Stock Market Trend Prediction
Seyed Morteza Mirjebreili - Ata Solouki - Hamidreza Soltanalizadeh - Mohammad Sabokrou
Cardiology Disease Diagnosis by Analyzing Histological Microscopic Images Using Deep Learning
Maria Salehpanah - Jafar Tanha - Zahra Jafari - SeyedEhsan Roshan - Sajad Rezaei
Enhanced Skin Cancer Classification Using Deep Learning and Gradient Boosting Techniques
Amir Mohammad Sharafaddini - Najme Mansouri
more
Samin Hamayesh - Version 43.7.0