0% Complete
Home
/
13th International Conference on Computer and Knowledge Engineering
Farsi Optical Character Recognition Using a Transformer-based Model
Authors :
Fatemeh Asadi Zeydabadi
1
Elham Shabaninia
2
Hossein Nezamabadi-pour
3
Melika Shojaee
4
1- Department of Electrical Engineering, Shahid Bahonar University of Kerman
2- Department of Applied Mathematics, Faculty of Sciences and Modern Technologies, Graduate University of Advanced
3- Department of Electrical Engineering, Shahid Bahonar University of Kerman
4- Department of Computer Engineering, Shahid Bahonar University of Kerman
Keywords :
Optical Character Recognition (OCR)،Deep learning method،transformer،Farsi language
Abstract :
Optical Character Recognition (OCR) techniques have made significant advances in recent years using new technologies such as Transformers for Latin languages. However, research on under-resourced languages, such as Farsi, remains limited. This is partly due to the complex nature of the Farsi script, which poses unique challenges for OCR. Farsi OCR is essential for various applications, such as document management, digital archiving, and automated data entry. This study introduces a transformer-based deep neural network to recognize Farsi words, achieving promising results. Specifically, we evaluate the performance of our method against state-of-the-art techniques on two datasets, Shotor and Sadri, and demonstrate accuracies of 99.75% and 99.23%, respectively. Our results outperform other methods and highlight the potential of transformer-based approaches for Farsi OCR.
Papers List
List of archived papers
Improving Machine Learning Classification of Heart Disease Using the Graph-Based Techniques
Abolfazl Dibaji - Sadegh Sulaimany
City Intersection Clustering and Analysis Based on Traffic Time Series
Mohammad Aminazadeh - Fakhroddin Noorbehbahani
Sensitivity Reliability Analysis of Power Distribution Networks Using Fuzzy Logic
Mohammed Wadi - Wisam Elmasry - Ismail Kucuk - Hossein Shahinzadeh
Improvement of Credit Scoring by LSTM Autoencoder Model
Milad Sattari Maleki - Seyedeh Niusha Motevallian - Faezehsadat Hosseini - Mohammad Sabokrou - Hamidreza Soltanalizadeh Maleki
Islamic Geometric algorithms: A survey
Elham Akbari - Azam Bastanfard
An Exploratory Study of the Relationship between SATD and Other Software Development Activities
Shima Esfandiari - Ashkan Sami
SAT Based Analogy Evaluation Framework For Persian Word Embeddings
Seyed Ehsan Mahmoudi - Mehrnoush Shamsfard
A Smart Electrochemical Biosensor for Arsenic Detection in Water
Keyvan Asefpour Vakilian
FedBrain-Distill: Communication-Efficient Federated Brain Tumor Classification Using Ensemble Knowledge Distillation on Non-IID Data
Rasoul Jafari Gohari - Laya Aliahmadipour - Ezat Valipour
Introducing E4MT and LMBNC: Persian pre-processing utilities
Zakieh Shakeri - Mehran Ziabary - Behrooz Vedadian - Fatemeh Azadi - Saeed Torabzadeh - Arian Atefi
more
Samin Hamayesh - Version 41.7.6