0% Complete
Home
/
12th International Conference on Computer and Knowledge Engineering
MultiPath ViT OCR: A Lightweight Visual Transformer-based License Plate Optical Character Recognition
Authors :
Alireza Azadbakht
1
Saeed Reza Kheradpisheh
2
Hadi Farahani
3
1- Shahid Beheshti University
2- Shahid Beheshti University
3- Shahid Beheshti University
Keywords :
Visual Transformer،Optical Character Recognition (OCR)،License Plate OCR،Persian License Plate OCR
Abstract :
Because of natural conditions of license plates images, the Optical Character Recognition (OCR) of these images is generally a challenging problem, and it is utilized in edge devices with limited computation power. Despite the considerable progress of deep neural networks, the state-of-the-art models are not always a good solution for this problem. Most of the models have a large number of parameters and in practice, they need a lot of resources to train, maintain and implement on edge devices. We propose a lightweight model based on Visual Transformer architecture and we achieve competitive results against traditional CRNN models, due to the lack of a rich and large scale dataset for Persian license plates we gather and annotate 1.3M images of license plates in various natural conditions from a different point of views and different cameras. We call this dataset as LicenseNet. Our proposed model achieves 77.25% accuracy against CNN models with 75.18% accuracy and embedded OCR models in cameras with 60.37% accuracy on the LicenseNet test set. Furthermore, we achieved better accuracy with 3.21 times fewer number of training parameters in comparison to previously proposed models.
Papers List
List of archived papers
Improvement of Credit Scoring by LSTM Autoencoder Model
Milad Sattari Maleki - Seyedeh Niusha Motevallian - Faezehsadat Hosseini - Mohammad Sabokrou - Hamidreza Soltanalizadeh Maleki
Spatio-Temporal Graph Neural Networks for Accurate Crime Prediction
Rojan Roshankar - Mohammad Reza Keyvanpour
Optimizing Question-Answering Framework Through Integration of Text Summarization Model and Third-Generation Generative Pre-Trained Transformer
Ervin Gubin Moung - Toh Sin Tong - Maisarah Mohd Sufian - Valentino Liaw - Ali Farzamnia - Farashazillah Yahya
Time Series Analysis by Bi-GRU for Forecasting Bitcoin Trends based on Sentiment Analysis
Fatemeh Saadatmand - Mohammad Ali Zare Chahoki
Delay Optimization of a Federated Learning-based UAV-aided IoT network
Hossein Mohammadi Firouzjaei - Javad Zeraatkar Moghaddam - Mehrdad Ardebilipour
Evolutionary Approach to GAN Hyperparameter Tuning: Minimizing Discriminator and Generator Loss Functions
Sajad Haghzad Klidbary - Anahita Babaei - Ramin Ghorbani
A supervised approach using transformer networks for the detection of turning-related anomalies in urban intersections
Mohammad Mahdi HajiAbadi - Manoochehr Nahvi
Trust Management Enhancement for the Internet of Things: a Smart Contract Approach
Amin Rouzbahani - Fattaneh Taghiyareh
PowerLinear Activation Functions with application to the first layer of CNNs
Kamyar Nasiri - Kamaledin Ghiasi-Shirazi
An Efficient Planning Method for Autonomous Navigation of a Wheeled-Robot based on Deep Reinforcement Learning
Ali Salimi Sadr - Mahdi Shahbazi Khojasteh - Hamed Malek - Armin Salimi-Badr
more
Samin Hamayesh - Version 42.4.1