0% Complete
Home
/
14th International Conference on Computer and Knowledge Engineering
Enhancing Vehicle Make and Model Recognition with 3D Attention Modules
Authors :
Narges Semiromizadeh
1
Omid Nejati Manzari
2
Shahriar B. Shokouhi
3
Sattar Mirzakuchaki
4
1- School of Electrical Engineering, Iran University of Science and Technology
2- School of Electrical Engineering, Iran University of Science and Technology
3- School of Electrical Engineering, Iran University of Science and Technology
4- School of Electrical Engineering, Iran University of Science and Technology
Keywords :
Deep Learning،Vehicle recognition،Attention module
Abstract :
Vehicle make and model recognition (VMMR) is a crucial component of the Intelligent Transport System, garnering significant attention in recent years. VMMR has been widely utilized for detecting suspicious vehicles, monitoring urban traffic, and autonomous driving systems. The complexity of VMMR arises from the subtle visual distinctions among vehicle models and the wide variety of classes produced by manufacturers. Convolutional Neural Networks (CNNs), a prominent type of deep learning model, have been extensively employed in various computer vision tasks, including VMMR, yielding remarkable results. As VMMR is a fine-grained classification problem, it primarily faces inter-class similarity and intra-class variation challenges. In this study, we implement an attention module to address these challenges and enhance the model’s focus on critical areas containing distinguishing features. This module, which does not increase the parameters of the original model, generates three-dimensional (3-D) attention weights to refine the feature map. Our proposed model integrates the attention module into two different locations within the middle section of a convolutional model, where the feature maps from these sections offer sufficient information about the input frames without being overly detailed or overly coarse. The performance of our proposed model, along with state-of-the-art (SOTA) convolutional and transformer-based models, was evaluated using the Stanford Cars dataset. Our proposed model achieved the highest accuracy, 90.69%, among the compared models.
Papers List
List of archived papers
Optimization Resource Allocation in NOMA-based Fog Computing with a Hybrid Algorithm
Zohreh Torki - S.Mojtaba Matinkhah
The Effect of Network Environment on Traffic Classification
Abolghasem Rezaei Khesal - Mehdi Teimouri
Performance Evaluation Study of Color Space Selection In Video Based Facial Expression Recognition Using Deep Neural Networks For Sentiment Analysis
Phee Wei Qin - Ervin Gubin Moung - Ali Farzamnia - Farashazillah Yahya - John Julius Danker Khoo - Maisarah Mohd Sufian
A Comprehensive Approach to SMS Spam Filtering Integrating Embedded and Statistical Features
Shaghayegh Hosseinpour - Mohammad Reza Keyvanpour
Designing a High Perfomance and High Profit P2P Energy Trading System Using a Consortium Blockchain Network
Poonia Taheri Makhsoos - Behnam Bahrak - Fattaneh Taghiyareh
Multi-Task Transformer for Stock Market Trend Prediction
Seyed Morteza Mirjebreili - Ata Solouki - Hamidreza Soltanalizadeh - Mohammad Sabokrou
Deep Deterministic Policy Gradient in Acoustic To Articulatory inversion
Farzane Abdoli - Hamid Sheikhzade - Vahid Pourahmadi
A 2D-CNN Architecture for Improving the Classification Accuracy of an Electronic Nose with Different Sensor Positions
Hannaneh Mahdavi - Reza Goldoust - Saeideh Rahbarpour
Area-Efficient VLSI Implementation of Bit-Serial Multiplier Using Polynomial Basis over GF(2m)
Saeideh Nabipour - Javad Javidan - Gholamreza Zare Fatin
FAHP-OF: A New Method for Load Balancing in RPL-based Internet of Things (IoT)
Mohammad Koosha - Behnam Farzaneh - Emad Alizadeh - Shahin Farzaneh
more
Samin Hamayesh - Version 43.7.0