0% Complete
Home
/
11th International Conference on Computer and Knowledge Engineering
Speech Emotion Recognition Using a Hierarchical Adaptive Weighted Multi-Layer Sparse Auto-Encoder Extreme Learning Machine with New Weighting and Spectral/SpectroTemporal Gabor Filter Bank Features
Authors :
Fatemeh Daneshfar
1
Seyed Jahanshah Kabudian
2
1- University of Kurdistan
2- Razi University
Keywords :
speech emotion recognition, extreme learning machine, weighted classification
Abstract :
The importance of doing research into affective computing has multiplied with the growing popularity of intelligent and human-machine interface systems. In this paper, a system for speech emotion recognition (SER) is proposed using new techniques in different parts. The given system extracts speech features from both speech and glottal-waveform signals in feature extraction section including spectro-temporal ones obtained from Gabor filter bank (GBFB) and separate Gabor filter bank (SGBFB) which have not been so far utilized for SER. At the classification step, a hierarchical adaptive weighted multilayer extreme learning machine (H-AWELM) is employed. This hybrid classifier consists of two parts: the first part for sparse unsupervised feature learning using a multi-layer neural network (NN) with sparse extreme learning machine auto-encoder (ELMAE) layers, and the second part for feature classification in the last layer using Tikhonov’s regularized least squares (LS) technique. One of the most important issues in multi-class ELM training process is how to deal with data imbalance problem. This paper presents a new adaptive weighting method to solve this problem that can be more accurate than current weighting methods. Finally, the proposed system is evaluated on a well-known emotional speech database. Experimental results demonstrate that the proposed system outperforms the state-of-the-art ones.
Papers List
List of archived papers
FGM Copula based Analysis of Coverage Region for Wireless Three-User Multiple Access Channel with Correlated Channel Coefficients
Mona Sadat Mohsenzadeh - Ghosheh Abed Hodtani
Detecting Non-Spherical Clusters Using Modified CURE Algorithm
Arezou Safdari - Pedram Salehpour
Mitochondrial Segmentation in Microscopy Images Using UNet-VGG19
Zerek Sediq Hossein - Rojiar Pir Mohammadiani - Saadat Izadi
Robustness Scan of Digital Circuits Using Convolutional Neural Networks
Mobin Vaziri - Mohammad Mehdi Rahimifar - Hadi Jahanirad
AvashoG2P: A multi-module G2P Converter for Persian
Ali Moghadaszadeh - Fatemeh Pasban - Mohsen Mahmoudzadeh - Maryam Vatanparast - Amirmohammad Salehoof
A Language-Independent Approach to Classification of Textual File Fragments: Case Study of Persian, English, and Chinese Languages
Fatemeh Mansouri Hanis - Hamidreza Khoshvaghti - Mehdi Teimouri - Hadi Veisi
Chaotic multi-population ABC algorithm based on memory and levy flight for solving dynamic job shop scheduling problems
Mohammad Ali Zarif - Javad Hamidzadeh
Optimization of quantum secret sharing communication using corresponding bits
Mahsa Khorrampanah - Mohammad Bolokian - Monireh Houshmand
Dynamic Knowledge Enhanced Neural Fashion Trend Forecasting with Quantile Loss
Fatemeh Rooholamini - Reza Azmi - Mobina Khademhossein - Maral Zarvani
Semi-automatic Detection of Persian Stopwords using FastText Library
Mohammad Dehghani - Mohammad Manthouri
more
Samin Hamayesh - Version 41.7.6