0% Complete
Home
/
13th International Conference on Computer and Knowledge Engineering
Farsi Optical Character Recognition Using a Transformer-based Model
Authors :
Fatemeh Asadi Zeydabadi
1
Elham Shabaninia
2
Hossein Nezamabadi-pour
3
Melika Shojaee
4
1- Department of Electrical Engineering, Shahid Bahonar University of Kerman
2- Department of Applied Mathematics, Faculty of Sciences and Modern Technologies, Graduate University of Advanced
3- Department of Electrical Engineering, Shahid Bahonar University of Kerman
4- Department of Computer Engineering, Shahid Bahonar University of Kerman
Keywords :
Optical Character Recognition (OCR)،Deep learning method،transformer،Farsi language
Abstract :
Optical Character Recognition (OCR) techniques have made significant advances in recent years using new technologies such as Transformers for Latin languages. However, research on under-resourced languages, such as Farsi, remains limited. This is partly due to the complex nature of the Farsi script, which poses unique challenges for OCR. Farsi OCR is essential for various applications, such as document management, digital archiving, and automated data entry. This study introduces a transformer-based deep neural network to recognize Farsi words, achieving promising results. Specifically, we evaluate the performance of our method against state-of-the-art techniques on two datasets, Shotor and Sadri, and demonstrate accuracies of 99.75% and 99.23%, respectively. Our results outperform other methods and highlight the potential of transformer-based approaches for Farsi OCR.
Papers List
List of archived papers
A Survey on Semi-Automated and Automated Approaches for Video Annotation
Samin Zare - Mehran Yazdi
Explainable Error Detection Method for Structured Data using HoloDetect framework
Abolfazl Mohajeri Khorasani - Sahar Ghassabi - Behshid Behkamal - Mostafa Milani
A novel hybrid DMHS-GMDH algorithm to predict COVID-19 pandemic time series
Ahmad Taheri - Shahriar Ghashghaei - Amin Beheshti - Keyvan RahimiZadeh
Camouflage Object Segmentation with Attention-Guided Pix2Pix and Boundary Awareness
Erfan Akbarnezhad Sany - Fatemeh Naserizadeh - Parsa Sinichi - Seyyed Abed Hosseini
An Interactive Approach for Query-based Multi-Document Scientific Text Summarization
Mohammadsadra Nejati - Azadeh Mohebi - Abbas Ahmadi
Intensity-Image Reconstruction Using Event Camera Data by Changing in LSTM Update
Arezoo Rahmati Soltangholi - Ahad Harati - Abedin Vahedian
Lossless Watermarking in Encrypted Triangular Mesh Models Based on Optimized Vertex Estimation and Error Histogram Shifting
Alireza Ghaemi - Habibollah Danyali - Kamran Kazemi - Zahra Qodrati - Amirhossein Ghaemi - Seyedeh Masoumeh Taji
DIPT: Diversified Personalized Transformer for QAC systems
Mahdi Dehghani - Samira Vaez Barenji - Saeed Farzi
An effective hybrid algorithm for locating splicing forgery image
Seyed Hesamoddin Hosseini - Amene Vatanparast - Amir Hossein Taherinia
TriMAE: Fashion visual search with Triplet Masked Auto Encoder Vision Transformer
Lachin Zamani - Reza Azmi
more
Samin Hamayesh - Version 41.7.6