0% Complete
Home
/
14th International Conference on Computer and Knowledge Engineering
Multi-Digit Handwritten Recognition: A CNN-LSTM Hybrid Approach with Wavelet Transforms
Authors :
Amin Kazempour
1
Jafar Tanha
2
1- University of Tabriz
2- University of Tabriz
Keywords :
Handwritten Digit Recognition،Deep Learning،Convolution Neural Network،Attention Mechanism،Wavelet transforms
Abstract :
Handwritten digit recognition remains a pivotal area in machine learning and computer vision, essential for applications like license plate identification, form processing, and historical document reading. Addressing the challenges of multi-digit and multi-language recognition, including variations in handwriting styles across different languages, we propose a novel model integrating convolutional and recurrent neural networks with an attention mechanism. Unlike conventional methods, our model employs wavelet transforms instead of max pooling to preserve image texture and edges. We created a comprehensive dataset containing both English and Persian digits, featuring 80,000 training and 20,000 test images with 1–5 digit numbers. To demonstrate the superiority of the proposed model, we conducted extensive experiments and compared it to some state-of-the-art models. Our model demonstrated remarkable accuracy, achieving 99.58% for single digits and 98.03% for sequences. Extensive experiments validated the efficacy of our approach, highlighting its potential for future research in multi-digit recognition systems across various languages.
Papers List
List of archived papers
A Framework for Automated Cardiovascular Magnetic Resonance Image Quality Scoring based on EuroCMR Registry Criteria
Shahabedin Nabavi - Mohsen Ebrahimi Moghaddam - Ahmad Ali Abin - Alejandro Frangi
Intelligent Rule Extraction in Complex Event Processing Platform for Health Monitoring Systems
Mohammad Mehdi Naseri - Shima Tabibian - Elaheh Homayounvala
A Systematic Embedded Software Design Flow for Robotic Applications
Navid Mahdian - Seyed-Hosein Attarzadeh-Niaki - Armin Salimi-Badr
DPRNN-FORMER: AN EFFICIENT WAY TO DEAL WITH BLIND SOURCE SEPARATION
Ramin Ghorbani - Sajad Haghzad Klidbary
Multi-Task Transformer for Stock Market Trend Prediction
Seyed Morteza Mirjebreili - Ata Solouki - Hamidreza Soltanalizadeh - Mohammad Sabokrou
Extreme Gradient Boosting (XGBoost) Regressor and Shapley Additive Explanation for Crop Yield Prediction in Agriculture
Dennis A/L Mariadass - Ervin Gubin Moung - Maisarah Mohd Sufian - Ali Farzamnia
SAT Based Analogy Evaluation Framework For Persian Word Embeddings
Seyed Ehsan Mahmoudi - Mehrnoush Shamsfard
Generating Hand-Written Symbols With Trajectory Planning Using A Robotic Arm
Arya Parvizi - Armin Salimi-Badr
Emotion Recognition In Persian Speech Using Deep Neural Networks
Ali Yazdani - Hossein Simchi - Yasser Shekofteh
Virus-Antiviral Prediction Using Machine and Deep Learning Methods
Shayan Majidifar - Fatemeh Nasiri - Mohsen Hooshmand
more
Samin Hamayesh - Version 41.7.6