0% Complete
Home
/
12th International Conference on Computer and Knowledge Engineering
MultiPath ViT OCR: A Lightweight Visual Transformer-based License Plate Optical Character Recognition
Authors :
Alireza Azadbakht
1
Saeed Reza Kheradpisheh
2
Hadi Farahani
3
1- Shahid Beheshti University
2- Shahid Beheshti University
3- Shahid Beheshti University
Keywords :
Visual Transformer،Optical Character Recognition (OCR)،License Plate OCR،Persian License Plate OCR
Abstract :
Because of natural conditions of license plates images, the Optical Character Recognition (OCR) of these images is generally a challenging problem, and it is utilized in edge devices with limited computation power. Despite the considerable progress of deep neural networks, the state-of-the-art models are not always a good solution for this problem. Most of the models have a large number of parameters and in practice, they need a lot of resources to train, maintain and implement on edge devices. We propose a lightweight model based on Visual Transformer architecture and we achieve competitive results against traditional CRNN models, due to the lack of a rich and large scale dataset for Persian license plates we gather and annotate 1.3M images of license plates in various natural conditions from a different point of views and different cameras. We call this dataset as LicenseNet. Our proposed model achieves 77.25% accuracy against CNN models with 75.18% accuracy and embedded OCR models in cameras with 60.37% accuracy on the LicenseNet test set. Furthermore, we achieved better accuracy with 3.21 times fewer number of training parameters in comparison to previously proposed models.
Papers List
List of archived papers
City Intersection Clustering and Analysis Based on Traffic Time Series
Mohammad Aminazadeh - Fakhroddin Noorbehbahani
Enhancing Vehicle Make and Model Recognition with 3D Attention Modules
Narges Semiromizadeh - Omid Nejati Manzari - Shahriar B. Shokouhi - Sattar Mirzakuchaki
TriFuse-PdM: High-Fidelity Machine Failure Prediction Using Hybrid Resampling and Model Calibration
Saghar Shafaati - Javad Mohammadzadeh
An Adaptive Budget and Deadline-aware Algorithm for Scheduling Workflows Ensemble in IaaS Clouds
Negin Shafinezhad - Hamid Abrishami - Saeid Abrishami
VVC-AAR: Adaptive Attention-Aware Resolution and Residual Coding for Perceptually Optimized Ultra-Low Bitrate VVC Compression
Yaghoub Saberi - Somayeh Arab Najafabadi - Mohammadreza Hemmati
DevRanker: An Effective Approach to Rank Developers for Bug Report Assignment
Mohammad Reza Kardoost - Mohammad Reza Moosavi - Reza Akbari
Distilling Knowledge from CNN-Transformer Models for Enhanced Human Action Recognition
Hamid Ahmadabadi - Omid Nejati Manzari - Ahmad Ayatollahi
Using Deep Learning for Classification of Lung Cancer on CT Images in Ardabil Province
Mohammad Ali Javadzadeh Barzaki - Jafar Abdollahi - Mohammad Negaresh - Maryam Salimi - Hadi Zolfeghari - Mohsen Mohammadi - Asma Salmani - Rona Jannati - Firouz Amani
SASIAF, An Scalable Accelerator For Seismic Imaging on Amazon AWS FPGAs
Mostafa Koraei - S.Omid Fatemi
A Survey on Semi-Automated and Automated Approaches for Video Annotation
Samin Zare - Mehran Yazdi
more
Samin Hamayesh - Version 42.7.0