0% Complete
Home
/
13th International Conference on Computer and Knowledge Engineering
Multi Model CNN Based Gas Meter Characters Recognition
Authors :
Sanaz Tarhib
1
Jafar Tanha
2
Soodabeh Imanzadeh
3
Sahar Hassanzadeh Mostafaei
4
1- Faculty of Electrical and Computer Engineering University of Tabriz
2- Faculty of Electrical and Computer Engineering University of Tabriz
3- Faculty of Electrical and Computer Engineering University of Tabriz
4- Faculty of Electrical and Computer Engineering University of Tabriz
Keywords :
Recognition،Detection،Convolutional Neural Network،Long Short-Term Memory،Gated Recurrent Unit
Abstract :
The recognition and extraction of text from natural scene images is a highly challenging task in the field of computer vision. Convolutional neural networks (CNNs) have been shown to be highly effective in recognizing characters and words from images as they can perceive the structural patterns of characters and words. This makes CNNs one of the most suitable approaches for solving recognition problems, such as text recognition in natural scene images. In this study, we aim to recognize numerical texts from images and employ three models for this task: CNN models, a combination of CNN-LSTM models, and a combination of CNN-GRU models. The dataset used in this study comprises images taken from gas meters, which were collected by our team using different phones at different times. Our results show that the accuracy achieved by the CNN, CNN-LSTM, and CNN-GRU models in recognizing numerical texts from images is 72.9%, 96.6%, and 97.63%, respectively. These findings suggest that the CNN-LSTM and CNN-GRU models are highly effective in recognizing numerical texts in images, with the CNN-GRU model exhibiting the highest accuracy. Overall, these results demonstrate the potential of using deep learning models for recognizing numerical texts in images, particularly the combination of CNN and Gated recurrent unit (GRU) models.
Papers List
List of archived papers
A Self-Configurable Model for Cloud Resource Allocation
Ali Bazghandi
Innovative Customer Segmentation based on Multi-Step Sequential Deep Clustering in the Telecommunication Industry
Fatemeh Jalali Farahani - Shima Tabibian
A Genetic-based Fusion Approach of Persian and Universal Phonetic results for Spoken Language Identification
Ashkan Moradi - Yasser Shekofteh - Saeed Zarei
Islamic Geometric algorithms: A survey
Elham Akbari - Azam Bastanfard
MultiPath ViT OCR: A Lightweight Visual Transformer-based License Plate Optical Character Recognition
Alireza Azadbakht - Saeed Reza Kheradpisheh - Hadi Farahani
Real-Time Forecasting Using Mixed Frequency Time-Series Data
Armin Khayati - Mohammad Taheri - Koorush Ziarati
Fine-tuned Generative Adversarial Network-based Model for Medical Image Super-Resolution
Alireza Aghelan - Modjtaba Rouhani
Attention-Boosted Ensemble of Pre-trained Convolutional Neural Networks for Accurate Diabetic Retinopathy Detection
Benyamin Mirab Golkhatmi - Mohammad Hossein Moattar
An Exploratory Study of the Relationship between SATD and Other Software Development Activities
Shima Esfandiari - Ashkan Sami
A Language-Independent Approach to Classification of Textual File Fragments: Case Study of Persian, English, and Chinese Languages
Fatemeh Mansouri Hanis - Hamidreza Khoshvaghti - Mehdi Teimouri - Hadi Veisi
more
Samin Hamayesh - Version 42.4.1