0% Complete
Home
/
14th International Conference on Computer and Knowledge Engineering
Multi-Digit Handwritten Recognition: A CNN-LSTM Hybrid Approach with Wavelet Transforms
Authors :
Amin Kazempour
1
Jafar Tanha
2
1- University of Tabriz
2- University of Tabriz
Keywords :
Handwritten Digit Recognition،Deep Learning،Convolution Neural Network،Attention Mechanism،Wavelet transforms
Abstract :
Handwritten digit recognition remains a pivotal area in machine learning and computer vision, essential for applications like license plate identification, form processing, and historical document reading. Addressing the challenges of multi-digit and multi-language recognition, including variations in handwriting styles across different languages, we propose a novel model integrating convolutional and recurrent neural networks with an attention mechanism. Unlike conventional methods, our model employs wavelet transforms instead of max pooling to preserve image texture and edges. We created a comprehensive dataset containing both English and Persian digits, featuring 80,000 training and 20,000 test images with 1–5 digit numbers. To demonstrate the superiority of the proposed model, we conducted extensive experiments and compared it to some state-of-the-art models. Our model demonstrated remarkable accuracy, achieving 99.58% for single digits and 98.03% for sequences. Extensive experiments validated the efficacy of our approach, highlighting its potential for future research in multi-digit recognition systems across various languages.
Papers List
List of archived papers
A Novel Hybrid Method for Clustering Text Documents using Evolutionary Optimization
Muhammad Naderi - Maryam Amiri
Prediction of West Texas Intermediate Crude-oil Price Using Hybrid Attention-based Deep Neural Networks: A Comparative Study
Alireza Jahandoost - Mahboobeh Houshmand - Seyyed Abed Hosseini
Analysis of Insect-plant Interactions Affected by Mining operations, A Graph Mining Approach
Mohammad Heydari - Ali Bayat - Amir Albadvi
Enhanced Atrial Fibrillation (AF) Detection via Data Augmentation with Diffusion Model
Arash Vashagh - Amirhossein Akhoondkazemi - Sayed Jalal Zahabi - Davood Shafie
Dual Memory Structure for Memory Augmented Neural Networks for Question-Answering Tasks
Amir Bidokhti - Shahrokh Ghaemmaghami
SUT: a new multi-purpose synthetic dataset for Farsi document image analysis
Elham Shabaninia - Fatemeh sadat Eslami - Ali Afkari Fahandari - Hossein Nezamabadi-pour
Leveraging a structure-based and learning-based predictor using various feature groups in bioinformatics (case study: protein-peptide region residue-level interaction)
Shima Shafiee - Abdolhossein Fathi
Semi-automatic Detection of Persian Stopwords using FastText Library
Mohammad Dehghani - Mohammad Manthouri
Adaptive Active Queue Management for Time Slot Channel Hopping in Industrial Internet of Things
Mehdi Zirak - Yasser Sedaghat - Mohammad Hossein Yaghmaee Moghaddam
Developing Convolutional Neural Networks using a Novel Lamarckian Co-Evolutionary Algorithm
Zaniar Sharifi - Khabat Soltanian - Ali Amiri
more
Samin Hamayesh - Version 41.7.6