0% Complete
Home
/
13th International Conference on Computer and Knowledge Engineering
Farsi Text in Scene: A new dataset
Authors :
Ali Salmasi
1
Ehsanollah Kabir
2
1- Tarbiat Modares University
2- Tarbiat Modares University
Keywords :
Detection،Farsi،FTS،Persian،Dataset،OCR،Scene text
Abstract :
Due to the recent advancements in computer vision and scene understanding, detection & recognition of text within scene images have attracted significant interest from both academia and industry. To develop a reliable and effective text recognition system, having access to a suitable dataset is essential. While many datasets containing English text in scene images are available, the lack of a comparable dataset for Farsi is evident. To lay the groundwork for the advancement of Farsi text detection and recognition systems in the future, a new dataset of Farsi text in scene images is introduced. Images are taken by a smartphone mainly from shop panels and advertising banners in the street. Printed texts on the images are in different fonts, sizes, and colors on diverse and complex backgrounds which give this dataset variety needed for building a robust text detection and recognition system. In total, 1182 images are collected and annotated. To validate our dataset, we trained a deep object detection model and achieved an mAP50 of 46.4% on the test set.
Papers List
List of archived papers
Multi-Layer Collaborative Graph with BPR Similarity Embedding for Recommender System
Mostafa Ghorbani - Azadeh Mansouri
An Effective Connectomics Approach for Diagnosing ADHD using Eyes-open Resting-state MEG
Nastaran Hamedi - Ali Khadem - Sajjad Vardast - Mehdi Delrobaei - Abbas Babajani-Feremi
Capsule Routing over Stacked GCN-GAT Embeddings with Negative Sampling for Graph Link Prediction
Fatemeh Safari Sarvandi - Sayeh Mirzaei - Rooholah Abedian
A Framework for Automated Cardiovascular Magnetic Resonance Image Quality Scoring based on EuroCMR Registry Criteria
Shahabedin Nabavi - Mohsen Ebrahimi Moghaddam - Ahmad Ali Abin - Alejandro Frangi
Deep Learning Feature Extraction for COVID-19 Detection Algorithm using Computerized Tomography Scan
Maisarah Mohd Sufian - Ervin Gubin Moung - Chong Joon Hou - Ali Farzamnia
Time Series Analysis by Bi-GRU for Forecasting Bitcoin Trends based on Sentiment Analysis
Fatemeh Saadatmand - Mohammad Ali Zare Chahoki
Dynamic Hand Gesture Recognition with 2DCNN-LSTM and Improved Keyframe Extraction
Narjes Heidari - Javid Norouzi - Mohammad Sadegh Helfroush - Habibollah Danyal
HiCAP: Hierarchical Clustering-based Attention Pooling for Graph Representation Learning
Parsa Haddadian - Rooholah Abedian - Ali Moeini
A Review on Machine Learning Methods for Workload Prediction in Cloud Computing
Mohammad Yekta - Hadi Shahriar Shahhoseini
Sensitivity Reliability Analysis of Power Distribution Networks Using Fuzzy Logic
Mohammed Wadi - Wisam Elmasry - Ismail Kucuk - Hossein Shahinzadeh
more
Samin Hamayesh - Version 43.7.0