0% Complete
Home
/
14th International Conference on Computer and Knowledge Engineering
Enhancing Vehicle Make and Model Recognition with 3D Attention Modules
Authors :
Narges Semiromizadeh
1
Omid Nejati Manzari
2
Shahriar B. Shokouhi
3
Sattar Mirzakuchaki
4
1- School of Electrical Engineering, Iran University of Science and Technology
2- School of Electrical Engineering, Iran University of Science and Technology
3- School of Electrical Engineering, Iran University of Science and Technology
4- School of Electrical Engineering, Iran University of Science and Technology
Keywords :
Deep Learning،Vehicle recognition،Attention module
Abstract :
Vehicle make and model recognition (VMMR) is a crucial component of the Intelligent Transport System, garnering significant attention in recent years. VMMR has been widely utilized for detecting suspicious vehicles, monitoring urban traffic, and autonomous driving systems. The complexity of VMMR arises from the subtle visual distinctions among vehicle models and the wide variety of classes produced by manufacturers. Convolutional Neural Networks (CNNs), a prominent type of deep learning model, have been extensively employed in various computer vision tasks, including VMMR, yielding remarkable results. As VMMR is a fine-grained classification problem, it primarily faces inter-class similarity and intra-class variation challenges. In this study, we implement an attention module to address these challenges and enhance the model’s focus on critical areas containing distinguishing features. This module, which does not increase the parameters of the original model, generates three-dimensional (3-D) attention weights to refine the feature map. Our proposed model integrates the attention module into two different locations within the middle section of a convolutional model, where the feature maps from these sections offer sufficient information about the input frames without being overly detailed or overly coarse. The performance of our proposed model, along with state-of-the-art (SOTA) convolutional and transformer-based models, was evaluated using the Stanford Cars dataset. Our proposed model achieved the highest accuracy, 90.69%, among the compared models.
Papers List
List of archived papers
Paddy Plant Stress Identification Using Few-Shot Learning Framework
Ervin Gubin Moung - Pavindrah Naidu a/l Narayanasamy Naiidu - Maisarah Mohd Sufian - Valentino Liaw - Ali Farzamnia - Lorita Angeline
An Analysis of Botnet Detection Using Graph Neural Network
Faezeh Alizadeh - Mohammad Khansari
Predicting cascading failure with machine learning methods in the interdependent networks
Mohamad Hossein Maghsoodi - Mohamad Khansari
Age Estimation Based on Facial Images Using Hybrid Features and Particle Swarm Optimization
NILOUFAR MEHRABI - SAYED PEDRAM HAERI BOROUJENI
Camouflage Object Segmentation with Attention-Guided Pix2Pix and Boundary Awareness
Erfan Akbarnezhad Sany - Fatemeh Naserizadeh - Parsa Sinichi - Seyyed Abed Hosseini
A large input-space-margin approach for adversarial training
Reihaneh Nikouei - Mohammad Taheri
A Deep CNN Model Based Ensemble Approach for Semantic and Instance Segmentation of Indoor Environment
Sajad Rezaei - Jafar Tanha - Zahra Jafari - SeyedEhsan Roshan - Mohammad-Amin Memar Kochebagh
Improved TrustChain for Lightweight Devices
Seyed Salar Ghazi - Haleh Amintoosi
A Review on Secure Data Storage and Data Sharing Technics in Blockchain-based IoT Healthcare Systems
Seyedeh Somayeh Fatemi Nasab - Davoud Bahrepour - Seyed Reza Kamel Tabbakh
Stock market prediction using multi-objective optimization
Mahshid Zolfaghari - Hamid Fadishei - Mohsen Tajgardan - Reza Khoshkangini
more
Samin Hamayesh - Version 41.7.6