0% Complete
Home
/
13th International Conference on Computer and Knowledge Engineering
Traffic Sign Recognition Using Local Vision Transformer
Authors :
Ali Farzipour
1
Omid Nejati Manzari
2
Shahriar B. Shokouhi
3
1- School of Electrical Engineering, Iran University of Science and Technology, Tehran, Iran
2- School of Electrical Engineering, Iran University of Science and Technology, Tehran, Iran
3- School of Electrical Engineering, Iran University of Science and Technology, Tehran, Iran
Keywords :
vision transformer،Deep Learning،Traffic Sign Recognition،self-driving vehicles
Abstract :
Abstract—Recognition of traffic signs is a crucial aspect of self- driving cars and driver assistance systems, and machine vision tasks such as traffic sign recognition have gained significant attention. CNNs have been frequently used in machine vision, but introducing vision transformers has provided an alternative approach to global feature learning. This paper proposes a new novel model that blends the advantages of both convolutional and transformer-based networks for traffic sign recognition. The proposed model includes convolutional blocks for capturing local correlations and transformer-based blocks for learning global dependencies. Additionally, a locality module is incorporated to enhance local perception. The performance of the suggested model is evaluated on the Persian Traffic Sign Dataset and German Traffic Sign Recognition Benchmark and compared with SOTA convolutional and transformer-based models. The experimental evaluations demonstrate that the hybrid network with the locality module outperforms pure transformer-based models and some of the best convolutional networks in accuracy. Specifically, our proposed final model reached 99.66% accuracy in the German traffic sign recognition benchmark and 99.8% in the Persian traffic sign dataset, higher than the best convolutional models. Moreover, it outperforms existing CNNs and ViTs while maintaining fast inference speed. Consequently, the proposed model proves to be significantly faster and more suitable for real-world applications.
Papers List
List of archived papers
Realism in Action: Anomaly-Aware Diagnosis of Brain Tumors from Medical Images Using YOLOv8 and DeiT
Seyed Mohammad Hossein Hashemi - Leila Safari - Mohsen Hooshmand - Amirhossein Dadashzadeh Taromi
Swin-RSCBNet: A Transformer-Based Network for Skin Cancer Segmentation with Multi-Scale and Attention Modules
Benyamin Mirab Golkhatmi - Mostafa Heydari - Mahboobeh Houshmand - Seyyed Abed Hosseini
Performance Evaluation Study of Color Space Selection In Video Based Facial Expression Recognition Using Deep Neural Networks For Sentiment Analysis
Phee Wei Qin - Ervin Gubin Moung - Ali Farzamnia - Farashazillah Yahya - John Julius Danker Khoo - Maisarah Mohd Sufian
SUT: a new multi-purpose synthetic dataset for Farsi document image analysis
Elham Shabaninia - Fatemeh sadat Eslami - Ali Afkari Fahandari - Hossein Nezamabadi-pour
Vaccine Distribution Modelling in Pandemics through Multi-Agent Systems: COVID-19 Case
Hossein Yarahmadi - Mohammad Ebrahim Shiri - Hamid Reza Navidi - Arash Sharifi - Moharram Challenger - Hassan Piriaei
Efficient Vision Transformer for Accurate Traffic Sign Detection
Javad Mirzapour Kaleybar - Hooman Khaloo - Avaz Naghipour
Graph-Theoretic Approach and Advanced Data Balancing for Liver Disease Diagnosis Improvement
Soheib Kiani - Sadegh Sulaimany
Deep Learning-based Processing of Autonomous Vehicle Radar Data to Achieve High Resolution
Nima Abdolrahimi Shahamat - Vahideh Moghtadaiee - Esfandiar Mehrshahi
The Internet of Things-Enabled Smart City: An In-Depth Review of Its Domains and Applications
Amir Meydani - Ali Ramezani - Alireza Meidani
Robat-e-Beheshti: A Persian Wake Word Detection Dataset for Robotic Purposes
Parisa Ahmadzadeh Raji - Yasser Shekofteh
more
Samin Hamayesh - Version 43.7.0