0% Complete
Home
/
14th International Conference on Computer and Knowledge Engineering
A Comprehensive Dataset of Real-scene Images for Text Detection and Recognition in Persian
Authors :
Iman Souzanchi
1
Ramin Rahimi
2
Mohammad Ali Majidi Anvari
3
Atefeh Baniasadi
4
Ashkan Sadeghi
5
Mohammad Reza Mohammadi
6
1- PART AI Research Center
2- PART AI Research Center
3- PART AI Research Center
4- PART AI Research Center
5- PART AI Research Center
6- School of Computer Engineering, Iran University of Science and Technology
Keywords :
Persian scene text dataset،Scene text recognition،Deep learning
Abstract :
Extracting text from scene images is a widely utilized field owing to the abundance of information available in scene images and their potential utilization in computer vision applications such as self-driving cars, text translation, information extraction from invoices, shopfronts, license plate retrieval, etc. Nonetheless, this field presents challenges because of the varying fonts, styles, sizes, and other characteristics of the text. Despite the existence of numerous studies on scene text recognition for languages such as English that employ deep learning models, a major barrier to implementing these models in Persian is the lack of an appropriate and sufficient dataset both in terms of quantity and quality. This paper aims to introduce a comprehensive collection of Persian scene images obtained from diverse sources, including newspapers, magazines, books, business cards, road signs, advertising billboards, shopfronts, invoices, and scanned documents. This dataset comprises over 250k of annotated text lines from 5000 images, including various lengths, fonts, and sizes that have been prepared under different conditions, including varying brightness and viewing angles. Additionally, more than 2,500,000 images of meaningful sentences have been synthesized since the annotation of real data is so expensive. In order to assess the efficacy of our dataset, a scene text recognition model was trained from existing models, and a word-accuracy of 83.9% was achieved on challenging test images.
Papers List
List of archived papers
An Attention-Based Model for Clinical Time Series Prediction: Enhancing ICU Readmission Prediction
Hananeh Sadat Madinei - Mohammad Reza Keyvanpour - Seyed Vahab Shojaedini
Optimization Resource Allocation in NOMA-based Fog Computing with a Hybrid Algorithm
Zohreh Torki - S.Mojtaba Matinkhah
Density Estimation Helps Adversarial Robustness
Afsaneh Hasanebrahimi - Bahareh Kaviani Baghbaderani - Reshad Hosseini - Ahmad Kalhor
Brain Age Estimation with Twin Vision Transformer using Hippocampus Information Applicable to Alzheimer Dementia Diagnosis
Zahra Qodrati - Seyedeh Masoumeh Taji - Amirhossein Ghaemi - Habibollah Danyali - Kamran Kazemi - Alireza Ghaemi
Detecting Non-Spherical Clusters Using Modified CURE Algorithm
Arezou Safdari - Pedram Salehpour
A Chaotic Crow Search Algorithm for Overlapping Clustering
Mostafa Sabzekar - Seyed Vahid Mousavainejad
Impossible differential and zero-correlatin linear cryptanalysis of Marx, Marx2, Chaskey andSpeck32
Mahshid Saberi - Nasour Bagheri - Sadegh Sadeghi
T-Rank: Graph Data Analytics for Urban Traffic Modeling
Alireza Safarpour - Iman Gholampour - Amirhossain Aghazadeh Fard - Seyed Mohammad Karbasi
Human vs NotebookLM for Educational Podcasts: A Controlled Experiment on Two General Topics
Ali Banihashemi - Amirali Shahriary - Yadollah Yaghoobzadeh
Robust Distributed Learning over Heterogeneous Adaptive Networks based on Federated BSP Model
Fatemeh Barani - MohammadHafez Yari - Abdorreza Savadi - Hadi Sadoghi Yazdi
more
Samin Hamayesh - Version 43.7.0