0% Complete
Home
/
13th International Conference on Computer and Knowledge Engineering
Traffic Sign Recognition Using Local Vision Transformer
Authors :
Ali Farzipour
1
Omid Nejati Manzari
2
Shahriar B. Shokouhi
3
1- School of Electrical Engineering, Iran University of Science and Technology, Tehran, Iran
2- School of Electrical Engineering, Iran University of Science and Technology, Tehran, Iran
3- School of Electrical Engineering, Iran University of Science and Technology, Tehran, Iran
Keywords :
vision transformer،Deep Learning،Traffic Sign Recognition،self-driving vehicles
Abstract :
Abstract—Recognition of traffic signs is a crucial aspect of self- driving cars and driver assistance systems, and machine vision tasks such as traffic sign recognition have gained significant attention. CNNs have been frequently used in machine vision, but introducing vision transformers has provided an alternative approach to global feature learning. This paper proposes a new novel model that blends the advantages of both convolutional and transformer-based networks for traffic sign recognition. The proposed model includes convolutional blocks for capturing local correlations and transformer-based blocks for learning global dependencies. Additionally, a locality module is incorporated to enhance local perception. The performance of the suggested model is evaluated on the Persian Traffic Sign Dataset and German Traffic Sign Recognition Benchmark and compared with SOTA convolutional and transformer-based models. The experimental evaluations demonstrate that the hybrid network with the locality module outperforms pure transformer-based models and some of the best convolutional networks in accuracy. Specifically, our proposed final model reached 99.66% accuracy in the German traffic sign recognition benchmark and 99.8% in the Persian traffic sign dataset, higher than the best convolutional models. Moreover, it outperforms existing CNNs and ViTs while maintaining fast inference speed. Consequently, the proposed model proves to be significantly faster and more suitable for real-world applications.
Papers List
List of archived papers
Distilling Knowledge from CNN-Transformer Models for Enhanced Human Action Recognition
Hamid Ahmadabadi - Omid Nejati Manzari - Ahmad Ayatollahi
Disturbance Rejection in Quadruple-Tank System by Proposing New Method in Reinforcement Learning
Alireza Nezamzadeh - Mohammadreza Esmaeilidehkordi
LPCNet: Lane detection by lane points correction network in challenging environments based on deep learning
Sina BaniasadAzad - Seyed Mohammadreza Mousavi mirkolaei
Hybrid Flow-Rule Placement Method of Proactive and Reactive in SDNs
Mohammadreza Khoobbakht - Mohammadreza Noei - Mohammadreza Parvizimosaed
Performance Evaluation Study of Color Space Selection In Video Based Facial Expression Recognition Using Deep Neural Networks For Sentiment Analysis
Phee Wei Qin - Ervin Gubin Moung - Ali Farzamnia - Farashazillah Yahya - John Julius Danker Khoo - Maisarah Mohd Sufian
A Novel Deformable Registration Method for Cerebral Magnetic Resonance Images
Bahareh Asadpour Dasht Bayaz - Mahdi Saadatmand - Fabrice Wallois
Facial Mask Wearing Condition Detection Using SSD MobileNetV2
Amirhossein Tighkhorshid - Yasamin Borhani - Javad Khoramdel - Esmaeil Najafi
Optimizing MR Image Registration for Accurate Brain Volume Measurement in Children with Autism Spectrum Disorder
Shiva Sanati - Mahdi Saadatmand
AVID: A VARIATIONAL INFERENCE DELIBERATION FOR META-LEARNING
Alireza Javaheri - Arsham Gholamzadeh Khoee - Saeed Reza Kheradpisheh - Hadi Farahani - Mohammad Ganjtabesh
IR-LPR: Large Scale of Iranian License Plate Recognition Dataset
Mahdi Rahmani - Melika Sabaghian - Seyyedeh Mahila Moghadami - Mohammad Mohsen Talaie - Mahdi Naghibi - Mohammad Ali Keyvanrad
more
Samin Hamayesh - Version 42.2.1