0% Complete
Home
/
14th International Conference on Computer and Knowledge Engineering
A Comprehensive Dataset of Real-scene Images for Text Detection and Recognition in Persian
Authors :
Iman Souzanchi
1
Ramin Rahimi
2
Mohammad Ali Majidi Anvari
3
Atefeh Baniasadi
4
Ashkan Sadeghi
5
Mohammad Reza Mohammadi
6
1- PART AI Research Center
2- PART AI Research Center
3- PART AI Research Center
4- PART AI Research Center
5- PART AI Research Center
6- School of Computer Engineering, Iran University of Science and Technology
Keywords :
Persian scene text dataset،Scene text recognition،Deep learning
Abstract :
Extracting text from scene images is a widely utilized field owing to the abundance of information available in scene images and their potential utilization in computer vision applications such as self-driving cars, text translation, information extraction from invoices, shopfronts, license plate retrieval, etc. Nonetheless, this field presents challenges because of the varying fonts, styles, sizes, and other characteristics of the text. Despite the existence of numerous studies on scene text recognition for languages such as English that employ deep learning models, a major barrier to implementing these models in Persian is the lack of an appropriate and sufficient dataset both in terms of quantity and quality. This paper aims to introduce a comprehensive collection of Persian scene images obtained from diverse sources, including newspapers, magazines, books, business cards, road signs, advertising billboards, shopfronts, invoices, and scanned documents. This dataset comprises over 250k of annotated text lines from 5000 images, including various lengths, fonts, and sizes that have been prepared under different conditions, including varying brightness and viewing angles. Additionally, more than 2,500,000 images of meaningful sentences have been synthesized since the annotation of real data is so expensive. In order to assess the efficacy of our dataset, a scene text recognition model was trained from existing models, and a word-accuracy of 83.9% was achieved on challenging test images.
Papers List
List of archived papers
An Adaptive Budget and Deadline-aware Algorithm for Scheduling Workflows Ensemble in IaaS Clouds
Negin Shafinezhad - Hamid Abrishami - Saeid Abrishami
A routing method with the approach of reducing energy consumption in WSNs with the Jellyfish Search (JS) optimizer algorithm and unequal clustering
Ehsan Gholami - Javad Hamidzadeh
FarSick: A Persian Semantic Textual Similarity And Natural Language Inference Dataset
Zahra Ghasemi - Mohammad Ali Keyvanrad
Multi-Task Transformer for Stock Market Trend Prediction
Seyed Morteza Mirjebreili - Ata Solouki - Hamidreza Soltanalizadeh - Mohammad Sabokrou
Joint mobility-aware offloading and UAV position optimization in Blockchain-enabled 5G
Zeinab Rabbani - Zeinab Movahedi
A Hybrid Echo State Network for Hypercomplex Pattern Recognition, Classification, and Big Data Analysis
Mohammad Jamshidi - Fatemeh Daneshfar
Robat-e-Beheshti: A Persian Wake Word Detection Dataset for Robotic Purposes
Parisa Ahmadzadeh Raji - Yasser Shekofteh
Hybrid navigation based on GPS data and SIFT-based place recognition using Biologically-inspired SLAM
Sahar Salimpour Kasebi - Hadi Seyedarabi - Javad Musevi Niya
Deep Learning Feature Extraction for COVID-19 Detection Algorithm using Computerized Tomography Scan
Maisarah Mohd Sufian - Ervin Gubin Moung - Chong Joon Hou - Ali Farzamnia
DPRNN-FORMER: AN EFFICIENT WAY TO DEAL WITH BLIND SOURCE SEPARATION
Ramin Ghorbani - Sajad Haghzad Klidbary
more
Samin Hamayesh - Version 41.7.6