0% Complete
Home
/
13th International Conference on Computer and Knowledge Engineering
Farsi Text in Scene: A new dataset
Authors :
Ali Salmasi
1
Ehsanollah Kabir
2
1- Tarbiat Modares University
2- Tarbiat Modares University
Keywords :
Detection،Farsi،FTS،Persian،Dataset،OCR،Scene text
Abstract :
Due to the recent advancements in computer vision and scene understanding, detection & recognition of text within scene images have attracted significant interest from both academia and industry. To develop a reliable and effective text recognition system, having access to a suitable dataset is essential. While many datasets containing English text in scene images are available, the lack of a comparable dataset for Farsi is evident. To lay the groundwork for the advancement of Farsi text detection and recognition systems in the future, a new dataset of Farsi text in scene images is introduced. Images are taken by a smartphone mainly from shop panels and advertising banners in the street. Printed texts on the images are in different fonts, sizes, and colors on diverse and complex backgrounds which give this dataset variety needed for building a robust text detection and recognition system. In total, 1182 images are collected and annotated. To validate our dataset, we trained a deep object detection model and achieved an mAP50 of 46.4% on the test set.
Papers List
List of archived papers
A Synergistic Hybrid Architecture with Residual Attention and Mixture-of-Experts for Robust Hour-Ahead Forex Forecasting
Alireza Abbaszadeh - Seyyed Abed Hosseini - Mohammad Reza Akbarzadeh Totonchi
Energy-Aware Dynamic Digital Twin Placement in Mobile Edge Computing
Mahdi Hematyar - Zeinab Movahedi
Cardiology Disease Diagnosis by Analyzing Histological Microscopic Images Using Deep Learning
Maria Salehpanah - Jafar Tanha - Zahra Jafari - SeyedEhsan Roshan - Sajad Rezaei
Optimizing Foreign Exchange Trading Performance Through Reinforcement Machine Learning Framework
Ervin Gubin Moung - Hani Yasmin Binti Murnizam - Maisarah Mohd Sufian - Valentino Liaw - Ali Farzamnia - Lorita Angeline
Towards Efficient Video Object Detection on Embedded Devices
Mohammad Hajizadeh - Adel Rahmani - Mohammad Sabokrou
CSI-Based Human Activity Recognition using Convolutional Neural Networks
Parisa Fard Moshiri - Mohammad Nabati - Reza Shahbazian - Seyed Ali Ghorashi
Attentional Bi-LSTM for Multivariate Time Series Forecasting on Edge Devices: A Case Study on NanoPi Neo Plus2
Navid Hajizadeh - Saeed Yazdani - Sara Ershadi-Nasab
Leveraging a structure-based and learning-based predictor using various feature groups in bioinformatics (case study: protein-peptide region residue-level interaction)
Shima Shafiee - Abdolhossein Fathi
A Comparative Analysis of Clinical Note Categories for Mortality Prediction in ICU Patients
Maryam Karrabi - Mohsen Kahani - Mina Afzali - Nadieh Armin
An Efficient Planning Method for Autonomous Navigation of a Wheeled-Robot based on Deep Reinforcement Learning
Ali Salimi Sadr - Mahdi Shahbazi Khojasteh - Hamed Malek - Armin Salimi-Badr
more
Samin Hamayesh - Version 43.7.0