0% Complete
Home
/
13th International Conference on Computer and Knowledge Engineering
SUT: a new multi-purpose synthetic dataset for Farsi document image analysis
Authors :
Elham Shabaninia
1
Fatemeh sadat Eslami
2
Ali Afkari Fahandari
3
Hossein Nezamabadi-pour
4
1- Department of Applied mathematics, Faculty of Sciences and Modern Technologies, Graduate University of Advanced Technology, Kerman, Iran
2- Department of Computer Engineering, Sirjan University of Technology, Sirjan, Iran
3- Department of Electrical Engineering, Shahid Bahonar University of Kerman, Kerman, Iran
4- Department of Electrical Engineering, Shahid Bahonar University of Kerman, Kerman, Iran
Keywords :
Document Image Analysis،Farsi database،Document Classification،Optical Character Recognition
Abstract :
This paper introduces a new large-scale dataset for Farsi document images, named SUT, which aims to tackle the challenges associated with obtaining diverse and substantial ground-truth data for supervised models in document image analysis (DIA) tasks, like document image classification, text detection and recognition, and information retrieval. The dataset comprises 62,453 images that have been categorized into 21 distinct classes, including identity documents featuring synthetically generated personal information superimposed on various backgrounds. The dataset also includes corresponding files with labeling information for the images. The ground-truth data is organized in CSV files containing image file paths and associated information about the embedded data. To demonstrate the efficacy of the SUT dataset in DIA tasks, it was utilized for document classification (achieving an accuracy of 86% using a convolutional neural network) and OCR (achieving a CER of 0.083 and 0.072 using Tesseract and EasyOCR engines, respectively). The SUT dataset serves as an esteemed asset for scholars engaged in the development and assessment of supervised models in Farsi document image analysis.
Papers List
List of archived papers
Farsi Text in Scene: A new dataset
Ali Salmasi - Ehsanollah Kabir
I-ACS: An Improved Ant Colony System to Solve the Time-Dependent Orienteering Problem
Zahra Bakhshandeh - Morteza Keshtkaran
UAV-based Firefighting by Multi-agent Reinforcement Learning
Reza Shami Tanha - Mohsen Hooshmand - Mohsen Afsharchi
FAST: FPGA Acceleration of Neural Networks Training
Alireza Borhani - Mohammad Hossein Goharinejad - Hamid Reza Zarandi
Design and Simulation of a Low PDP Full Adder by Combining Majority Function and TGDI Technique in CNTFET Technology
Mahsa Mohammadi
Enhanced Skin Cancer Classification Using Deep Learning and Gradient Boosting Techniques
Amir Mohammad Sharafaddini - Najme Mansouri
Capturing Local and Global Features in Medical Images by Using Ensemble CNN-Transformer
Javad Mirzapour Kaleybar - Hooman Saadat - Hooman Khaloo
Area-Efficient VLSI Implementation of Bit-Serial Multiplier Using Polynomial Basis over GF(2m)
Saeideh Nabipour - Javad Javidan - Gholamreza Zare Fatin
HiCAP: Hierarchical Clustering-based Attention Pooling for Graph Representation Learning
Parsa Haddadian - Rooholah Abedian - Ali Moeini
Evolutionary Approach to GAN Hyperparameter Tuning: Minimizing Discriminator and Generator Loss Functions
Sajad Haghzad Klidbary - Anahita Babaei - Ramin Ghorbani
more
Samin Hamayesh - Version 42.2.1