0% Complete
Home
/
11th International Conference on Computer and Knowledge Engineering
A Genetic-based Fusion Approach of Persian and Universal Phonetic results for Spoken Language Identification
Authors :
Ashkan Moradi
1
Yasser Shekofteh
2
Saeed Zarei
3
1- Computer Science and Engineering Department, Shahid Beheshti University, Tehran, Iran
2- Computer Science and Engineering Department, Shahid Beheshti University, Tehran, Iran
3- Computer Science and Engineering Department, Shahid Beheshti University, Tehran, Iran
Keywords :
Spoken language identification, Phonetic-based approach, perplexity, Classifier fusion, Genetic Algorithm
Abstract :
Automatic Spoken language identification (LID) refers to the automatic process of identifying languages spoken in the audio files. Pure acoustic approaches have shown great potential in LID. Since acoustic approaches have become more and more popular, phonetic information has been largely overlooked. In this paper, we present a fusion approach based on the score probabilities of two phonetic LID systems. There are two SVM classifiers trained on perplexities as their feature vectors which are obtained from phone language models of different phone recognizers. Two phone recognizers are here utilized; one decodes the speech file to a sequence of IPA alphabet, as a universal phone recognizer [1], and the other is a Farsi phone recognizer which is trained on FARSDAT databases. The experimental results conducted on 27 languages within the NIST-LRE09 corpus demonstrated that the proposed fusion approach could greatly increase the classification accuracy of target languages.
Papers List
List of archived papers
Multi-source Ensemble Model for Scene Recognition
Amir Hossein Saleknia - Ahmad Ayatollahi
Spatio-Temporal Graph Neural Networks for Accurate Crime Prediction
Rojan Roshankar - Mohammad Reza Keyvanpour
A Robust Network for Embedded Traffic Sign Recognation.
Omid Nejati Manzari - Shahriar Baradaran Shokouhi
Low-Cost and Hardware Efficient Implementation of Pooling Layers for Stochastic CNN Accelerators
Mobin Vaziri - Hadi Jahanirad
AgeNet-AT: An End-to-End Model for Robust Joint Speaker Age Estimation and Gender Recognition Based on Attention Mechanism and Titanet
Mahsa Zamani Tarashandeh - Amirhossein Torkanloo - Mohammad Hossein Moattar
Enhancing Lighter Neural Network Performance with Layer-wise Knowledge Distillation and Selective Pixel Attention
Siavash Zaravashan - Sajjad Torabi - Hesam Zaravashan
FedBrain-Distill: Communication-Efficient Federated Brain Tumor Classification Using Ensemble Knowledge Distillation on Non-IID Data
Rasoul Jafari Gohari - Laya Aliahmadipour - Ezat Valipour
Segmentation of Coronary Artery Stenosis in X-ray Angiography using Mamba Models
Fatemeh Fouladi - Ali Rostami - Hedieh Sajedi
Sum Rate Analysis and Power Allocation in Massive MIMO Systems with Power Constraints
Abdolrasoul Sakhaei Gharagezlou - Mahdi Nangir
Stock market prediction using multi-objective optimization
Mahshid Zolfaghari - Hamid Fadishei - Mohsen Tajgardan - Reza Khoshkangini
more
Samin Hamayesh - Version 41.7.6