0% Complete
Home
/
12th International Conference on Computer and Knowledge Engineering
MultiPath ViT OCR: A Lightweight Visual Transformer-based License Plate Optical Character Recognition
Authors :
Alireza Azadbakht
1
Saeed Reza Kheradpisheh
2
Hadi Farahani
3
1- Shahid Beheshti University
2- Shahid Beheshti University
3- Shahid Beheshti University
Keywords :
Visual Transformer،Optical Character Recognition (OCR)،License Plate OCR،Persian License Plate OCR
Abstract :
Because of natural conditions of license plates images, the Optical Character Recognition (OCR) of these images is generally a challenging problem, and it is utilized in edge devices with limited computation power. Despite the considerable progress of deep neural networks, the state-of-the-art models are not always a good solution for this problem. Most of the models have a large number of parameters and in practice, they need a lot of resources to train, maintain and implement on edge devices. We propose a lightweight model based on Visual Transformer architecture and we achieve competitive results against traditional CRNN models, due to the lack of a rich and large scale dataset for Persian license plates we gather and annotate 1.3M images of license plates in various natural conditions from a different point of views and different cameras. We call this dataset as LicenseNet. Our proposed model achieves 77.25% accuracy against CNN models with 75.18% accuracy and embedded OCR models in cameras with 60.37% accuracy on the LicenseNet test set. Furthermore, we achieved better accuracy with 3.21 times fewer number of training parameters in comparison to previously proposed models.
Papers List
List of archived papers
Learning to Classify Messier Astronomical Objects with Limited Data: A Few-Shot Learning Approach
AMIRREZA ROUHBAKHSHMEGHRAZI - Shayan Nalbandian - Ghazal Alizadeh - Sheida Shadman - Shuyuan Yang - Bo Li
Data-Optimized Dry Rock Property Prediction Using Ensemble and Kernel-Based ML Methods
Esmael Makarian - Hassanreza Ghasemitabar - Alireza Behinrad - Mahdi Fathi - Andisheh Alimoradi - Ayub Elyasi
Instance Selection from Skewed Class Distributions by Using the multi-objective optimizer
Mona Moradi - Javad Hamidzadeh
Virus-Antiviral Prediction Using Machine and Deep Learning Methods
Shayan Majidifar - Fatemeh Nasiri - Mohsen Hooshmand
Time Series Analysis by Bi-GRU for Forecasting Bitcoin Trends based on Sentiment Analysis
Fatemeh Saadatmand - Mohammad Ali Zare Chahoki
Characterizing Microsatellite Distribution Patterns Across Distinct Gene Categories in Human
Elahe Mehrazin - Mahmoud Naghibzadeh - Sara Jamali
Machine Learning-Driven Prediction of Anti-Alzheimer Drug Efficacy Using PubChem Molecular Fingerprints
Mohammad Javad Sadeghi - Mohammad Javad Nemati - AliAsghar Zare - Mohammadreza Shams
Solving the influence maximization problem by using entropy and weight of edges
Farzaneh Kazemzadeh - Amir Karian - Mitra Mirzarezaee - Ali Asghar Safaei
Multi-Layered Defense Against Modern Phishing: A Dual-Sandbox and CDR Approach
Mahdi Seyfipoor - Mohammad Mahdi Eskandari
Analysis of Insect-plant Interactions Affected by Mining operations, A Graph Mining Approach
Mohammad Heydari - Ali Bayat - Amir Albadvi
more
Samin Hamayesh - Version 43.7.0