0% Complete
Home
/
13th International Conference on Computer and Knowledge Engineering
Farsi Text in Scene: A new dataset
Authors :
Ali Salmasi
1
Ehsanollah Kabir
2
1- Tarbiat Modares University
2- Tarbiat Modares University
Keywords :
Detection،Farsi،FTS،Persian،Dataset،OCR،Scene text
Abstract :
Due to the recent advancements in computer vision and scene understanding, detection & recognition of text within scene images have attracted significant interest from both academia and industry. To develop a reliable and effective text recognition system, having access to a suitable dataset is essential. While many datasets containing English text in scene images are available, the lack of a comparable dataset for Farsi is evident. To lay the groundwork for the advancement of Farsi text detection and recognition systems in the future, a new dataset of Farsi text in scene images is introduced. Images are taken by a smartphone mainly from shop panels and advertising banners in the street. Printed texts on the images are in different fonts, sizes, and colors on diverse and complex backgrounds which give this dataset variety needed for building a robust text detection and recognition system. In total, 1182 images are collected and annotated. To validate our dataset, we trained a deep object detection model and achieved an mAP50 of 46.4% on the test set.
Papers List
List of archived papers
Persis: A Persian Font Recognition Pipeline Using Convolutional Neural Networks
Mehrdad Mohammadian - Neda Maleki - Tobias Olsson - Fredrik Ahlgren
Prediction of West Texas Intermediate Crude-oil Price Using Hybrid Attention-based Deep Neural Networks: A Comparative Study
Alireza Jahandoost - Mahboobeh Houshmand - Seyyed Abed Hosseini
Enhanced Atrial Fibrillation (AF) Detection via Data Augmentation with Diffusion Model
Arash Vashagh - Amirhossein Akhoondkazemi - Sayed Jalal Zahabi - Davood Shafie
Towards Efficient Video Object Detection on Embedded Devices
Mohammad Hajizadeh - Adel Rahmani - Mohammad Sabokrou
A Survey on Semi-Automated and Automated Approaches for Video Annotation
Samin Zare - Mehran Yazdi
Leveraging the Power of Object Detection Models in Identifying Litter for a Significant Reduction in Environmental Pollution
Lim Zhen Xian - Ervin Gubin Moung - Jason Teo Tze Wi - Nordin Saad - Farashazillah Yahya - Tiong Lin Rui - Ali Farzamnia
Energy Efficient Power Allocation in MIMO-NOMA Systems with ZF Receiver Beamforming in Multiple Clusters
Mahdi Nangir - Abdolrasoul Sakhaei Gharagezlou - Nima Imani
A Survey of the AVOA Metaheuristic Algorithm and its Suitability for Power System Optimization and Damping Controller Design
Aliyu Sabo - Theophilus Ebuka Odoh - Samuel Habu - Hossien Shahinzadeh - Farshad Ebrahimi
An Exploratory Study of the Relationship between SATD and Other Software Development Activities
Shima Esfandiari - Ashkan Sami
Improving Machine Learning Classification of Heart Disease Using the Graph-Based Techniques
Abolfazl Dibaji - Sadegh Sulaimany
more
Samin Hamayesh - Version 42.2.1