0% Complete
Home
/
14th International Conference on Computer and Knowledge Engineering
Multi-Digit Handwritten Recognition: A CNN-LSTM Hybrid Approach with Wavelet Transforms
Authors :
Amin Kazempour
1
Jafar Tanha
2
1- University of Tabriz
2- University of Tabriz
Keywords :
Handwritten Digit Recognition،Deep Learning،Convolution Neural Network،Attention Mechanism،Wavelet transforms
Abstract :
Handwritten digit recognition remains a pivotal area in machine learning and computer vision, essential for applications like license plate identification, form processing, and historical document reading. Addressing the challenges of multi-digit and multi-language recognition, including variations in handwriting styles across different languages, we propose a novel model integrating convolutional and recurrent neural networks with an attention mechanism. Unlike conventional methods, our model employs wavelet transforms instead of max pooling to preserve image texture and edges. We created a comprehensive dataset containing both English and Persian digits, featuring 80,000 training and 20,000 test images with 1–5 digit numbers. To demonstrate the superiority of the proposed model, we conducted extensive experiments and compared it to some state-of-the-art models. Our model demonstrated remarkable accuracy, achieving 99.58% for single digits and 98.03% for sequences. Extensive experiments validated the efficacy of our approach, highlighting its potential for future research in multi-digit recognition systems across various languages.
Papers List
List of archived papers
A Weighted TF-IDF-based Approach for Authorship Attribution
Ali Abedzadeh - Reza Ramezani - Afsaneh Fatemi
MC-BioCLIPSR: A Mamba-CNN Hybrid Network with BioMedCLIP-Guided Loss for High-Resolution Brain MRI Reconstruction
Amin Kazempour - Jafar Tanha - SeyedEhsan Roshan - Mahdi Zarrin - Haniyeh Nikkhah
FAST: FPGA Acceleration of Neural Networks Training
Alireza Borhani - Mohammad Hossein Goharinejad - Hamid Reza Zarandi
A Novel Density-Based KNN in Pattern Recognition
Sajad Haghzad Klidbary - Abazar Arabameri
Deep Deterministic Policy Gradient in Acoustic To Articulatory inversion
Farzane Abdoli - Hamid Sheikhzade - Vahid Pourahmadi
A Language-Independent Approach to Classification of Textual File Fragments: Case Study of Persian, English, and Chinese Languages
Fatemeh Mansouri Hanis - Hamidreza Khoshvaghti - Mehdi Teimouri - Hadi Veisi
FarCQA: A Farsi Community Dataset for Question Classification and Answer Selection
Saba Emami - Maedeh Mosharraf
A Novel Method For Fake News Detection Based on Propagation Tree
Mansour Davoudi - Mohammad Reza Moosavi - Mohammad Hadi Sadreddini
Degarbayan-SC: A Colloquial Paraphrase Farsi Subtitles Dataset
Mohammad Javad Aghajani - Mohammad Ali Keyvanrad
A Vision-Based Method for Human Activity Recognition Using Local Binary Pattern
Babak Goodarzi - Reza Javidan - Mohammad Sadegh Rezaei
more
Samin Hamayesh - Version 43.7.0