0% Complete
Home
/
13th International Conference on Computer and Knowledge Engineering
Farsi Optical Character Recognition Using a Transformer-based Model
Authors :
Fatemeh Asadi Zeydabadi
1
Elham Shabaninia
2
Hossein Nezamabadi-pour
3
Melika Shojaee
4
1- Department of Electrical Engineering, Shahid Bahonar University of Kerman
2- Department of Applied Mathematics, Faculty of Sciences and Modern Technologies, Graduate University of Advanced
3- Department of Electrical Engineering, Shahid Bahonar University of Kerman
4- Department of Computer Engineering, Shahid Bahonar University of Kerman
Keywords :
Optical Character Recognition (OCR)،Deep learning method،transformer،Farsi language
Abstract :
Optical Character Recognition (OCR) techniques have made significant advances in recent years using new technologies such as Transformers for Latin languages. However, research on under-resourced languages, such as Farsi, remains limited. This is partly due to the complex nature of the Farsi script, which poses unique challenges for OCR. Farsi OCR is essential for various applications, such as document management, digital archiving, and automated data entry. This study introduces a transformer-based deep neural network to recognize Farsi words, achieving promising results. Specifically, we evaluate the performance of our method against state-of-the-art techniques on two datasets, Shotor and Sadri, and demonstrate accuracies of 99.75% and 99.23%, respectively. Our results outperform other methods and highlight the potential of transformer-based approaches for Farsi OCR.
Papers List
List of archived papers
Lempel-Ziv-based Hyper-Heuristic Solution for Longest Common Subsequence Problem
Mahdi Nasrollahi - Reza Shami Tanha - Mohsen Hooshmand
A Cost-Sensitive Genetic Algorithm for Customer Segmentation in Auto Insurances
Alireza Khajenoori - Mohammad Saniee Abadeh - Mohsen Mohammadzadeh
City Intersection Clustering and Analysis Based on Traffic Time Series
Mohammad Aminazadeh - Fakhroddin Noorbehbahani
Capturing Local and Global Features in Medical Images by Using Ensemble CNN-Transformer
Javad Mirzapour Kaleybar - Hooman Saadat - Hooman Khaloo
An Effective Connectomics Approach for Diagnosing ADHD using Eyes-open Resting-state MEG
Nastaran Hamedi - Ali Khadem - Sajjad Vardast - Mehdi Delrobaei - Abbas Babajani-Feremi
Low-Cost and Hardware Efficient Implementation of Pooling Layers for Stochastic CNN Accelerators
Mobin Vaziri - Hadi Jahanirad
Towards Efficient Video Object Detection on Embedded Devices
Mohammad Hajizadeh - Adel Rahmani - Mohammad Sabokrou
An intelligent linguistic error detection approach to automated diagnosis of Dyslexia disorder in Persian speaking children
Fatemeh Asghari - Mahsa Khorasani - Mohsen Kahani - Seyed Amir Amin Yazdi - Mahdi Arkhodi Ghalenoei
Multi Model CNN Based Gas Meter Characters Recognition
Sanaz Tarhib - Jafar Tanha - Soodabeh Imanzadeh - Sahar Hassanzadeh Mostafaei
AgeNet-AT: An End-to-End Model for Robust Joint Speaker Age Estimation and Gender Recognition Based on Attention Mechanism and Titanet
Mahsa Zamani Tarashandeh - Amirhossein Torkanloo - Mohammad Hossein Moattar
more
Samin Hamayesh - Version 42.4.1