0% Complete
Home
/
15th International Conference on Computer and Knowledge Engineering
Adaptive Pronunciation Scoring: Aligning Automated Assessments with Human Expert Evaluations
Authors :
Omid Aghdaei
1
Mohammad Sadegh Safari
2
Mohammad Hassan Rasoolizadeh
3
Abedeh Mirzaee
4
1- Part AI Research Center
2- Part AI Research Center
3- Part AI Research Center
4- Part AI Research Center
Keywords :
Pronunciation evaluation،adaptive scoring،speech assessment،Computer-assisted Pronunciation Training (CAPT،phonetic error analysis
Abstract :
Conventional pronunciation evaluation systems aggregate metrics such as accuracy, fluency, completeness, and prosody into a composite score. These systems apply uniform penalization when assessing accuracy, neglecting the linguistic and perceptual significance of different errors. Consequently, minor phonetic variations are penalized as severely as critical intelligibility errors, leading to inaccurate assessments. This paper introduces an adaptive scoring framework that adjusts penalty weights based on the severity and linguistic relevance of errors. By categorizing errors into contextual, structural, phonetic, and alignment types, the framework enables more precise, context-sensitive evaluation. The approach is evaluated on the Speechocean762 and proprietary datasets. Experimental results show stronger alignment with expert judgments compared to traditional models, improving the reliability of Computer-Assisted Pronunciation Training (CAPT) and automated speech evaluation.
Papers List
List of archived papers
Sports News Summarization Using Ensebmle Learning
Moein Sartakhti.salimi@gmail.com - Mohammad Javad Maleki Kahaki - Ahmad Yoosofan - Seyyed Vahid Moravvej
An interactive user groups recommender system based on reinforcement learning
Hediyeh Naderi Allaf - Mohsen Kahani
SUT: a new multi-purpose synthetic dataset for Farsi document image analysis
Elham Shabaninia - Fatemeh sadat Eslami - Ali Afkari Fahandari - Hossein Nezamabadi-pour
Semi-automatic Detection of Persian Stopwords using FastText Library
Mohammad Dehghani - Mohammad Manthouri
A 2D-CNN Architecture for Improving the Classification Accuracy of an Electronic Nose with Different Sensor Positions
Hannaneh Mahdavi - Reza Goldoust - Saeideh Rahbarpour
A Synergistic Hybrid Architecture with Residual Attention and Mixture-of-Experts for Robust Hour-Ahead Forex Forecasting
Alireza Abbaszadeh - Seyyed Abed Hosseini - Mohammad Reza Akbarzadeh Totonchi
DRL-based Decision-Making for Autonomous Vehicle Collision Avoidance
Hoda Gholamrezaee - Seyedreza Taghizadeh - Ali Honarjoo
Zone-Based Federated Learning in Indoor Positioning
Omid Tasbaz - Vahideh Moghtadaiee - Bahar Farahani
A Genetic-based Fusion Approach of Persian and Universal Phonetic results for Spoken Language Identification
Ashkan Moradi - Yasser Shekofteh - Saeed Zarei
Improving Soft Error Reliability of FPGA-based Deep Neural Networks with Reduced Approximate TMR
Anahita Hosseinkhani - Behnam Ghavami
more
Samin Hamayesh - Version 43.7.0