0% Complete
Home
/
12th International Conference on Computer and Knowledge Engineering
MultiPath ViT OCR: A Lightweight Visual Transformer-based License Plate Optical Character Recognition
Authors :
Alireza Azadbakht
1
Saeed Reza Kheradpisheh
2
Hadi Farahani
3
1- Shahid Beheshti University
2- Shahid Beheshti University
3- Shahid Beheshti University
Keywords :
Visual Transformer،Optical Character Recognition (OCR)،License Plate OCR،Persian License Plate OCR
Abstract :
Because of natural conditions of license plates images, the Optical Character Recognition (OCR) of these images is generally a challenging problem, and it is utilized in edge devices with limited computation power. Despite the considerable progress of deep neural networks, the state-of-the-art models are not always a good solution for this problem. Most of the models have a large number of parameters and in practice, they need a lot of resources to train, maintain and implement on edge devices. We propose a lightweight model based on Visual Transformer architecture and we achieve competitive results against traditional CRNN models, due to the lack of a rich and large scale dataset for Persian license plates we gather and annotate 1.3M images of license plates in various natural conditions from a different point of views and different cameras. We call this dataset as LicenseNet. Our proposed model achieves 77.25% accuracy against CNN models with 75.18% accuracy and embedded OCR models in cameras with 60.37% accuracy on the LicenseNet test set. Furthermore, we achieved better accuracy with 3.21 times fewer number of training parameters in comparison to previously proposed models.
Papers List
List of archived papers
FedFog: A Serverless and Privacy-Aware Federated Learning Simulator for Edge–Fog Networks
Seyed Vahid Hashemi Nik - Seyed Mohammad Mahdi Asaadi - Somayeh Sobati-M
A parallel CNN-BiGRU network for short-term load forecasting in demand-side management
Arghavan Irankhah - Sahar Rezazadeh Saatlou - Mohammad Hossein Yaghmaee - Sara Ershadi-Nasab - Mohammad Alishahi
A Novel Method For Fake News Detection Based on Propagation Tree
Mansour Davoudi - Mohammad Reza Moosavi - Mohammad Hadi Sadreddini
Deep Learning-Based Malaysian Sign Language (MSL) Recognition: Exploring the Impact of Color Spaces
Ervin Gubin Moung - Precilla Fiona Suwek - Maisarah Mohd Sufian - Valentino Liaw - Ali Farzamnia - Wei Leong Khong
Novel Insights in Deep Learning for Predicting Climate Phenomena
Mohammad Naisipour - Saghar Ganji - Iraj Saeedpanah - Behnam Mehrakizadeh - Ahmad Reza Labibzadeh
Fine-tuned Generative Adversarial Network-based Model for Medical Image Super-Resolution
Alireza Aghelan - Modjtaba Rouhani
Enhancing Vehicle Make and Model Recognition with 3D Attention Modules
Narges Semiromizadeh - Omid Nejati Manzari - Shahriar B. Shokouhi - Sattar Mirzakuchaki
Semi-automatic Detection of Persian Stopwords using FastText Library
Mohammad Dehghani - Mohammad Manthouri
A routing method with the approach of reducing energy consumption in WSNs with the Jellyfish Search (JS) optimizer algorithm and unequal clustering
Ehsan Gholami - Javad Hamidzadeh
Lempel-Ziv-based Hyper-Heuristic Solution for Longest Common Subsequence Problem
Mahdi Nasrollahi - Reza Shami Tanha - Mohsen Hooshmand
more
Samin Hamayesh - Version 43.7.0