0% Complete
Home
/
13th International Conference on Computer and Knowledge Engineering
Capturing Local and Global Features in Medical Images by Using Ensemble CNN-Transformer
Authors :
Javad Mirzapour Kaleybar
1
Hooman Saadat
2
Hooman Khaloo
3
1- Department of Computer Engineering, University College of Nabi Akram, Tabriz, Iran
2- Department of Electrical Engineering, Qazvin Branch, Iran
3- School of Technology Sharif University, Tehran, Iran
Keywords :
Medical Image Analysis،Transformer،Convolutional Neural Network،Deep Learning
Abstract :
This research paper addresses the challenges associated with traffic sign detection in self-driving vehicles and driver assistance systems. The development of reliable and highly accurate algorithms is crucial for the widespread adoption of traffic sign recognition and detection (TSRD) in diverse real-life scenarios. However, this task is complicated by suboptimal traffic images affected by factors such as camera movement, adverse weather conditions, and inadequate lighting. This study specifically focuses on traffic sign detection methods and introduces the application of the Transformer model, particularly the Vision Transformer variants, to tackle this task. The Transformer's attention mechanism, originally designed for natural language processing, offers improved parallel efficiency. Vision Transformers have demonstrated success in various domains, including autonomous driving, object detection, healthcare, and defense-related applications. To enhance the efficiency of the Transformer model, the research proposes a novel strategy that integrates a locality inductive bias and a transformer module. This includes the introduction of the Efficient Convolution Block and the Local Transformer Block, which effectively capture short-term and long-term dependency information, thereby improving both detection speed and accuracy. Experimental evaluations demonstrate the significant advancements achieved by this approach, particularly when applied to the German Traffic Sign Detection dataset.
Papers List
List of archived papers
A Hybrid Echo State Network for Hypercomplex Pattern Recognition, Classification, and Big Data Analysis
Mohammad Jamshidi - Fatemeh Daneshfar
City Intersection Clustering and Analysis Based on Traffic Time Series
Mohammad Aminazadeh - Fakhroddin Noorbehbahani
UAV-based Firefighting by Multi-agent Reinforcement Learning
Reza Shami Tanha - Mohsen Hooshmand - Mohsen Afsharchi
Enhancing Vehicle Make and Model Recognition with 3D Attention Modules
Narges Semiromizadeh - Omid Nejati Manzari - Shahriar B. Shokouhi - Sattar Mirzakuchaki
Farsi Optical Character Recognition Using a Transformer-based Model
Fatemeh Asadi Zeydabadi - Elham Shabaninia - Hossein Nezamabadi-pour - Melika Shojaee
MultiPath ViT OCR: A Lightweight Visual Transformer-based License Plate Optical Character Recognition
Alireza Azadbakht - Saeed Reza Kheradpisheh - Hadi Farahani
Real-Time Gender Recognition with a Deep Neural Network
Samad Azimi Abriz - Majid Meghdadi
An Attention-Based Model for Clinical Time Series Prediction: Enhancing ICU Readmission Prediction
Hananeh Sadat Madinei - Mohammad Reza Keyvanpour - Seyed Vahab Shojaedini
A 2D-CNN Architecture for Improving the Classification Accuracy of an Electronic Nose with Different Sensor Positions
Hannaneh Mahdavi - Reza Goldoust - Saeideh Rahbarpour
A Deep Reinforcement Learning Approach Combining Technical and Fundamental Analyses with a Large Language Model for Stock Trading
Mahan Veisi - Sadra Berangi - Mahdi Shahbazi Khojasteh - Armin Salimi-Badr
more
Samin Hamayesh - Version 41.3.1