0% Complete
Home
/
14th International Conference on Computer and Knowledge Engineering
Multi-Digit Handwritten Recognition: A CNN-LSTM Hybrid Approach with Wavelet Transforms
Authors :
Amin Kazempour
1
Jafar Tanha
2
1- University of Tabriz
2- University of Tabriz
Keywords :
Handwritten Digit Recognition،Deep Learning،Convolution Neural Network،Attention Mechanism،Wavelet transforms
Abstract :
Handwritten digit recognition remains a pivotal area in machine learning and computer vision, essential for applications like license plate identification, form processing, and historical document reading. Addressing the challenges of multi-digit and multi-language recognition, including variations in handwriting styles across different languages, we propose a novel model integrating convolutional and recurrent neural networks with an attention mechanism. Unlike conventional methods, our model employs wavelet transforms instead of max pooling to preserve image texture and edges. We created a comprehensive dataset containing both English and Persian digits, featuring 80,000 training and 20,000 test images with 1–5 digit numbers. To demonstrate the superiority of the proposed model, we conducted extensive experiments and compared it to some state-of-the-art models. Our model demonstrated remarkable accuracy, achieving 99.58% for single digits and 98.03% for sequences. Extensive experiments validated the efficacy of our approach, highlighting its potential for future research in multi-digit recognition systems across various languages.
Papers List
List of archived papers
TD-PINNs: Efficient Shared-Memory Parallelization of Physics-Informed Neural Networks for Time-Dependent PDEs
Mahdi Movahedian Moghaddam - Kourosh Parand
Underwater Image Super-Resolution using Generative Adversarial Network-based Model
Alireza Aghelan - Modjtaba Rouhani
Lossless Watermarking in Encrypted Triangular Mesh Models Based on Optimized Vertex Estimation and Error Histogram Shifting
Alireza Ghaemi - Habibollah Danyali - Kamran Kazemi - Zahra Qodrati - Amirhossein Ghaemi - Seyedeh Masoumeh Taji
SingAll: Scalable Control Flow Checking for Multi-Process Embedded Systems
Mehdi Amininasab - Ahmad Patooghy - Mahdi Fazeli
Improve the utility of tensor cores by compacting sparse matrix technique
Mohammad.S Abazari - Mahsa Zahedi - Abdorreza Savadi
An overview of Business Intelligence research in healthcare organizations using a topic modeling approach
Mohammad Mehraeen - Laya Mahmoudi - Mohammad Hossein Sharifi
IranITJobs2021: a Dataset for Analyzing Iranian Online IT Job Advertisements Collected Using a New Crowdsourcing Process
Fakhroddin Noorbehbahani - Nikta Akbarpour - Mohammad Reza Saeidi
A Smart Electrochemical Biosensor for Arsenic Detection in Water
Keyvan Asefpour Vakilian
An Advanced Dual Attention-based U-Net Using Breast Ultrasound Data for Image Segmentation
Erfan Akbarnezhad Sany - Niloufar Asghari - Fatemeh Naserizadeh - Seyyed Abed Hosseini
A Framework for Automated Cardiovascular Magnetic Resonance Image Quality Scoring based on EuroCMR Registry Criteria
Shahabedin Nabavi - Mohsen Ebrahimi Moghaddam - Ahmad Ali Abin - Alejandro Frangi
more
Samin Hamayesh - Version 43.7.0