0% Complete
Home
/
13th International Conference on Computer and Knowledge Engineering
Capturing Local and Global Features in Medical Images by Using Ensemble CNN-Transformer
Authors :
Javad Mirzapour Kaleybar
1
Hooman Saadat
2
Hooman Khaloo
3
1- Department of Computer Engineering, University College of Nabi Akram, Tabriz, Iran
2- Department of Electrical Engineering, Qazvin Branch, Iran
3- School of Technology Sharif University, Tehran, Iran
Keywords :
Medical Image Analysis،Transformer،Convolutional Neural Network،Deep Learning
Abstract :
This research paper addresses the challenges associated with traffic sign detection in self-driving vehicles and driver assistance systems. The development of reliable and highly accurate algorithms is crucial for the widespread adoption of traffic sign recognition and detection (TSRD) in diverse real-life scenarios. However, this task is complicated by suboptimal traffic images affected by factors such as camera movement, adverse weather conditions, and inadequate lighting. This study specifically focuses on traffic sign detection methods and introduces the application of the Transformer model, particularly the Vision Transformer variants, to tackle this task. The Transformer's attention mechanism, originally designed for natural language processing, offers improved parallel efficiency. Vision Transformers have demonstrated success in various domains, including autonomous driving, object detection, healthcare, and defense-related applications. To enhance the efficiency of the Transformer model, the research proposes a novel strategy that integrates a locality inductive bias and a transformer module. This includes the introduction of the Efficient Convolution Block and the Local Transformer Block, which effectively capture short-term and long-term dependency information, thereby improving both detection speed and accuracy. Experimental evaluations demonstrate the significant advancements achieved by this approach, particularly when applied to the German Traffic Sign Detection dataset.
Papers List
List of archived papers
CSI-Based Human Activity Recognition using Convolutional Neural Networks
Parisa Fard Moshiri - Mohammad Nabati - Reza Shahbazian - Seyed Ali Ghorashi
Lossless Watermarking in Encrypted Triangular Mesh Models Based on Optimized Vertex Estimation and Error Histogram Shifting
Alireza Ghaemi - Habibollah Danyali - Kamran Kazemi - Zahra Qodrati - Amirhossein Ghaemi - Seyedeh Masoumeh Taji
Hybrid navigation based on GPS data and SIFT-based place recognition using Biologically-inspired SLAM
Sahar Salimpour Kasebi - Hadi Seyedarabi - Javad Musevi Niya
Adaptive Channel Estimation for MIMO-OFDM Systems in Impulsive Noise Environments
Mojtaba Hajiabadi
Graph Representation Learning Towards Patents Network Analysis
Mohammad Heydari - Babak Teimourpour
A parallel CNN-BiGRU network for short-term load forecasting in demand-side management
Arghavan Irankhah - Sahar Rezazadeh Saatlou - Mohammad Hossein Yaghmaee - Sara Ershadi-Nasab - Mohammad Alishahi
Underwater Image Super-Resolution using Generative Adversarial Network-based Model
Alireza Aghelan - Modjtaba Rouhani
Multi-Digit Handwritten Recognition: A CNN-LSTM Hybrid Approach with Wavelet Transforms
Amin Kazempour - Jafar Tanha
Emotion Recognition In Persian Speech Using Deep Neural Networks
Ali Yazdani - Hossein Simchi - Yasser Shekofteh
Distilling Knowledge from CNN-Transformer Models for Enhanced Human Action Recognition
Hamid Ahmadabadi - Omid Nejati Manzari - Ahmad Ayatollahi
more
Samin Hamayesh - Version 41.7.6