0% Complete
Home
/
13th International Conference on Computer and Knowledge Engineering
Traffic Sign Recognition Using Local Vision Transformer
Authors :
Ali Farzipour
1
Omid Nejati Manzari
2
Shahriar B. Shokouhi
3
1- School of Electrical Engineering, Iran University of Science and Technology, Tehran, Iran
2- School of Electrical Engineering, Iran University of Science and Technology, Tehran, Iran
3- School of Electrical Engineering, Iran University of Science and Technology, Tehran, Iran
Keywords :
vision transformer،Deep Learning،Traffic Sign Recognition،self-driving vehicles
Abstract :
Abstract—Recognition of traffic signs is a crucial aspect of self- driving cars and driver assistance systems, and machine vision tasks such as traffic sign recognition have gained significant attention. CNNs have been frequently used in machine vision, but introducing vision transformers has provided an alternative approach to global feature learning. This paper proposes a new novel model that blends the advantages of both convolutional and transformer-based networks for traffic sign recognition. The proposed model includes convolutional blocks for capturing local correlations and transformer-based blocks for learning global dependencies. Additionally, a locality module is incorporated to enhance local perception. The performance of the suggested model is evaluated on the Persian Traffic Sign Dataset and German Traffic Sign Recognition Benchmark and compared with SOTA convolutional and transformer-based models. The experimental evaluations demonstrate that the hybrid network with the locality module outperforms pure transformer-based models and some of the best convolutional networks in accuracy. Specifically, our proposed final model reached 99.66% accuracy in the German traffic sign recognition benchmark and 99.8% in the Persian traffic sign dataset, higher than the best convolutional models. Moreover, it outperforms existing CNNs and ViTs while maintaining fast inference speed. Consequently, the proposed model proves to be significantly faster and more suitable for real-world applications.
Papers List
List of archived papers
Dual Memory Structure for Memory Augmented Neural Networks for Question-Answering Tasks
Amir Bidokhti - Shahrokh Ghaemmaghami
Adaptive Ensemble Learning for Software Defect Prediction: A Dynamic Weighted Hybrid Model Using SVM, DT, and ANFIS-PSO
Mohsen EsfandyariDoulabi - Amin Esfandiyari Doulabi - Javad Khaligh
Word-level Persian Lipreading Dataset
Javad Peymanfard - Ali Lashini - Samin Heydarian - Hossein Zeinali - Nasser Mozayani
The process of multi class fake news dataset generation
Sajjad Rezaei - Mohsen Kahani - Behshid Behkamal
Classification of Audio Streaming in Network Traffic Based on Machine Learning Methods
Mohammad Nikbakht - Mehdi Teimouri
SCDS: A Secure Clustering Protocol Using Dempster-Shafer Theory for VANET in Smart City
Hoda Mosadegh - Nazbanoo Farzaneh
Persian Legal Text Simplification Leveraging Transformer-Based Models
Mohammadreza Joneidi Jafari - Saedeh Tahery - Amirhossein Nikoofard
Low-Cost and Hardware Efficient Implementation of Pooling Layers for Stochastic CNN Accelerators
Mobin Vaziri - Hadi Jahanirad
An Efficient Approach for Breast Abnormality Detection through High-Level Features of Thermography Images
Farhad Abedinzadeh Torghabeh - Yeganeh Modaresnia - Seyyed Abed Hosseini
Improvement of Credit Scoring by LSTM Autoencoder Model
Milad Sattari Maleki - Seyedeh Niusha Motevallian - Faezehsadat Hosseini - Mohammad Sabokrou - Hamidreza Soltanalizadeh Maleki
more
Samin Hamayesh - Version 43.7.0