0% Complete
Home
/
13th International Conference on Computer and Knowledge Engineering
SUT: a new multi-purpose synthetic dataset for Farsi document image analysis
Authors :
Elham Shabaninia
1
Fatemeh sadat Eslami
2
Ali Afkari Fahandari
3
Hossein Nezamabadi-pour
4
1- Department of Applied mathematics, Faculty of Sciences and Modern Technologies, Graduate University of Advanced Technology, Kerman, Iran
2- Department of Computer Engineering, Sirjan University of Technology, Sirjan, Iran
3- Department of Electrical Engineering, Shahid Bahonar University of Kerman, Kerman, Iran
4- Department of Electrical Engineering, Shahid Bahonar University of Kerman, Kerman, Iran
Keywords :
Document Image Analysis،Farsi database،Document Classification،Optical Character Recognition
Abstract :
This paper introduces a new large-scale dataset for Farsi document images, named SUT, which aims to tackle the challenges associated with obtaining diverse and substantial ground-truth data for supervised models in document image analysis (DIA) tasks, like document image classification, text detection and recognition, and information retrieval. The dataset comprises 62,453 images that have been categorized into 21 distinct classes, including identity documents featuring synthetically generated personal information superimposed on various backgrounds. The dataset also includes corresponding files with labeling information for the images. The ground-truth data is organized in CSV files containing image file paths and associated information about the embedded data. To demonstrate the efficacy of the SUT dataset in DIA tasks, it was utilized for document classification (achieving an accuracy of 86% using a convolutional neural network) and OCR (achieving a CER of 0.083 and 0.072 using Tesseract and EasyOCR engines, respectively). The SUT dataset serves as an esteemed asset for scholars engaged in the development and assessment of supervised models in Farsi document image analysis.
Papers List
List of archived papers
A Novel Hybrid Method for Clustering Text Documents using Evolutionary Optimization
Muhammad Naderi - Maryam Amiri
Distinguishing Abstracts of Human-Written and ChatGPT-Generated Papers in the Field of Computer Science
Mohsen Arzani - Hamed Vahdat-Nejad - Matin Hossein-Pour
Dynamic Knowledge Enhanced Neural Fashion Trend Forecasting with Quantile Loss
Fatemeh Rooholamini - Reza Azmi - Mobina Khademhossein - Maral Zarvani
Analysis of Address Lifespans in Bitcoin and Ethereum
Amir Mohammad Karimi Mamaghan - Amin Setayesh - Behnam Bahrak
Graph Attention Networks for Modeling Multi-Sensor Relationships in Early Prediction of Critical Events in ICU Patients
Amir Akhavan Saffar - Danial Eskandari Faruji - Javad Hamidzadeh
PowerLinear Activation Functions with application to the first layer of CNNs
Kamyar Nasiri - Kamaledin Ghiasi-Shirazi
Probabilistic Short-Term Load Forecasting Using GBDT-Based Sister Forecasts and Ensemble Methods
Hossein Shahinzadeh - Hamed Nafisi - Amirafshin Zamani - Saiedeh Mehrabani-Najafabadi - Arezou Mahmoudi - Farshad Ebrahimi
Introducing E4MT and LMBNC: Persian pre-processing utilities
Zakieh Shakeri - Mehran Ziabary - Behrooz Vedadian - Fatemeh Azadi - Saeed Torabzadeh - Arian Atefi
A supervised approach using transformer networks for the detection of turning-related anomalies in urban intersections
Mohammad Mahdi HajiAbadi - Manoochehr Nahvi
Machine Learning-Driven Prediction of Anti-Alzheimer Drug Efficacy Using PubChem Molecular Fingerprints
Mohammad Javad Sadeghi - Mohammad Javad Nemati - AliAsghar Zare - Mohammadreza Shams
more
Samin Hamayesh - Version 43.7.0