0% Complete
Home
/
13th International Conference on Computer and Knowledge Engineering
Farsi Optical Character Recognition Using a Transformer-based Model
Authors :
Fatemeh Asadi Zeydabadi
1
Elham Shabaninia
2
Hossein Nezamabadi-pour
3
Melika Shojaee
4
1- Department of Electrical Engineering, Shahid Bahonar University of Kerman
2- Department of Applied Mathematics, Faculty of Sciences and Modern Technologies, Graduate University of Advanced
3- Department of Electrical Engineering, Shahid Bahonar University of Kerman
4- Department of Computer Engineering, Shahid Bahonar University of Kerman
Keywords :
Optical Character Recognition (OCR)،Deep learning method،transformer،Farsi language
Abstract :
Optical Character Recognition (OCR) techniques have made significant advances in recent years using new technologies such as Transformers for Latin languages. However, research on under-resourced languages, such as Farsi, remains limited. This is partly due to the complex nature of the Farsi script, which poses unique challenges for OCR. Farsi OCR is essential for various applications, such as document management, digital archiving, and automated data entry. This study introduces a transformer-based deep neural network to recognize Farsi words, achieving promising results. Specifically, we evaluate the performance of our method against state-of-the-art techniques on two datasets, Shotor and Sadri, and demonstrate accuracies of 99.75% and 99.23%, respectively. Our results outperform other methods and highlight the potential of transformer-based approaches for Farsi OCR.
Papers List
List of archived papers
Simulation-Based Data Augmentation for Apple Leaf Disease Using Statistical Moments and HSV Color Features
Seyedeh Maryam Moosavi - Morteza Gholipour - Yasser Baleghi
A Vision-Based Method for Human Activity Recognition Using Local Binary Pattern
Babak Goodarzi - Reza Javidan - Mohammad Sadegh Rezaei
Efficient T-Count Fault-tolerant Quantum Clifford+T Multiplexer
Negin Mashayekhi - Shekoofeh Moghimi - Mohammad Reza Reshadinezhad
Multi-source Ensemble Model for Scene Recognition
Amir Hossein Saleknia - Ahmad Ayatollahi
An Attention-Based Model for Clinical Time Series Prediction: Enhancing ICU Readmission Prediction
Hananeh Sadat Madinei - Mohammad Reza Keyvanpour - Seyed Vahab Shojaedini
Analysis of Insect-plant Interactions Affected by Mining operations, A Graph Mining Approach
Mohammad Heydari - Ali Bayat - Amir Albadvi
Speech Emotion Recognition Using a Hierarchical Adaptive Weighted Multi-Layer Sparse Auto-Encoder Extreme Learning Machine with New Weighting and Spectral/SpectroTemporal Gabor Filter Bank Features
Fatemeh Daneshfar - Seyed Jahanshah Kabudian
Evaluating the Impact of Traveling on COVID-19 Prevalence and Predicting the New Confirmed Cases According to the Travel Rate Using Machine Learning: A Case Study in Iran
Anita Ghandehari - Soheil Shirvani - Hadi Moradi
Brain Age Estimation with Twin Vision Transformer using Hippocampus Information Applicable to Alzheimer Dementia Diagnosis
Zahra Qodrati - Seyedeh Masoumeh Taji - Amirhossein Ghaemi - Habibollah Danyali - Kamran Kazemi - Alireza Ghaemi
Multi-Layer Collaborative Graph with BPR Similarity Embedding for Recommender System
Mostafa Ghorbani - Azadeh Mansouri
more
Samin Hamayesh - Version 43.7.0