0% Complete
Home
/
13th International Conference on Computer and Knowledge Engineering
Farsi Optical Character Recognition Using a Transformer-based Model
Authors :
Fatemeh Asadi Zeydabadi
1
Elham Shabaninia
2
Hossein Nezamabadi-pour
3
Melika Shojaee
4
1- Department of Electrical Engineering, Shahid Bahonar University of Kerman
2- Department of Applied Mathematics, Faculty of Sciences and Modern Technologies, Graduate University of Advanced
3- Department of Electrical Engineering, Shahid Bahonar University of Kerman
4- Department of Computer Engineering, Shahid Bahonar University of Kerman
Keywords :
Optical Character Recognition (OCR)،Deep learning method،transformer،Farsi language
Abstract :
Optical Character Recognition (OCR) techniques have made significant advances in recent years using new technologies such as Transformers for Latin languages. However, research on under-resourced languages, such as Farsi, remains limited. This is partly due to the complex nature of the Farsi script, which poses unique challenges for OCR. Farsi OCR is essential for various applications, such as document management, digital archiving, and automated data entry. This study introduces a transformer-based deep neural network to recognize Farsi words, achieving promising results. Specifically, we evaluate the performance of our method against state-of-the-art techniques on two datasets, Shotor and Sadri, and demonstrate accuracies of 99.75% and 99.23%, respectively. Our results outperform other methods and highlight the potential of transformer-based approaches for Farsi OCR.
Papers List
List of archived papers
Histopathology Image-Based Cancer Classification Utilizing Transfer Learning Approach
Amir Meydani - Alireza Meidani - Ali Ramezani - Maryam Shabani - Mohammad Mehdi Kazeminasab - Shahriar Shahablavasani
Improving Soft Error Reliability of FPGA-based Deep Neural Networks with Reduced Approximate TMR
Anahita Hosseinkhani - Behnam Ghavami
Soccer Video Event Detection Using Metric Learning
Ali Karimi - Ramin Toosi - Mohammad Ali Akhaee
Deep Learning-Based Malaysian Sign Language (MSL) Recognition: Exploring the Impact of Color Spaces
Ervin Gubin Moung - Precilla Fiona Suwek - Maisarah Mohd Sufian - Valentino Liaw - Ali Farzamnia - Wei Leong Khong
A Hybrid Echo State Network for Hypercomplex Pattern Recognition, Classification, and Big Data Analysis
Mohammad Jamshidi - Fatemeh Daneshfar
Segmentation of Coronary Artery Stenosis in X-ray Angiography using Mamba Models
Fatemeh Fouladi - Ali Rostami - Hedieh Sajedi
ExaAEC: A New Multi-label Emotion Classification Corpus in Arabic Tweets
Saeed Sarbazi-Azad - Ahmad Akbari - Mohsen Khazeni
Compressing Deep Neural Networks Using Explainable AI
Kimia Soroush - Mohsen Raji - Behnam Ghavami
Dynamic Hand Gesture Recognition with 2DCNN-LSTM and Improved Keyframe Extraction
Narjes Heidari - Javid Norouzi - Mohammad Sadegh Helfroush - Habibollah Danyal
Improving ADHD Detection with Cost-Sensitive LightGBM
Behnam Yousefimehr - Mehdi Ghatee - Ali Heydari
more
Samin Hamayesh - Version 41.5.3