0% Complete
Home
/
11th International Conference on Computer and Knowledge Engineering
Speech Emotion Recognition Using a Hierarchical Adaptive Weighted Multi-Layer Sparse Auto-Encoder Extreme Learning Machine with New Weighting and Spectral/SpectroTemporal Gabor Filter Bank Features
Authors :
Fatemeh Daneshfar
1
Seyed Jahanshah Kabudian
2
1- University of Kurdistan
2- Razi University
Keywords :
speech emotion recognition, extreme learning machine, weighted classification
Abstract :
The importance of doing research into affective computing has multiplied with the growing popularity of intelligent and human-machine interface systems. In this paper, a system for speech emotion recognition (SER) is proposed using new techniques in different parts. The given system extracts speech features from both speech and glottal-waveform signals in feature extraction section including spectro-temporal ones obtained from Gabor filter bank (GBFB) and separate Gabor filter bank (SGBFB) which have not been so far utilized for SER. At the classification step, a hierarchical adaptive weighted multilayer extreme learning machine (H-AWELM) is employed. This hybrid classifier consists of two parts: the first part for sparse unsupervised feature learning using a multi-layer neural network (NN) with sparse extreme learning machine auto-encoder (ELMAE) layers, and the second part for feature classification in the last layer using Tikhonov’s regularized least squares (LS) technique. One of the most important issues in multi-class ELM training process is how to deal with data imbalance problem. This paper presents a new adaptive weighting method to solve this problem that can be more accurate than current weighting methods. Finally, the proposed system is evaluated on a well-known emotional speech database. Experimental results demonstrate that the proposed system outperforms the state-of-the-art ones.
Papers List
List of archived papers
An Energy-efficient Clustering Method based on Butterfly Optimization Algorithm by Considering the Criterion of Intra-cluster Distances in WSNs
Fariba Saghi Hadi S. Aghdasi
DPRNN-FORMER: AN EFFICIENT WAY TO DEAL WITH BLIND SOURCE SEPARATION
Ramin Ghorbani - Sajad Haghzad Klidbary
Towards Efficient Video Object Detection on Embedded Devices
Mohammad Hajizadeh - Adel Rahmani - Mohammad Sabokrou
Semi-automatic Detection of Persian Stopwords using FastText Library
Mohammad Dehghani - Mohammad Manthouri
Practical Implementation of Real-Time Waste Detection and Recycling based on Deep Learning for Delta Parallel Robot
Hasan Jalali - Shaya Garjani - Ahmad Kalhor - Mehdi Tale Masouleh - Parisa Yousefi
Real-Time Forecasting Using Mixed Frequency Time-Series Data
Armin Khayati - Mohammad Taheri - Koorush Ziarati
Energy Efficient Power Allocation in MIMO-NOMA Systems with ZF Receiver Beamforming in Multiple Clusters
Mahdi Nangir - Abdolrasoul Sakhaei Gharagezlou - Nima Imani
Iris Detection and Segmentation Using Deep Learning
Ali Khaki - Ali Aghagolzadeh - Bagher Rahimpour Cami
Fatty Liver Level Recognition Using Particle Swarm Optimization (PSO) Image Segmentation and Analysis
Seyed Muhammad Hossein Mousavi - Vyacheslav Lyashenko - Atiye Ilanloo - S. Younes Mirinezhad
A Vision-Based Method for Human Activity Recognition Using Local Binary Pattern
Babak Goodarzi - Reza Javidan - Mohammad Sadegh Rezaei
more
Samin Hamayesh - Version 42.4.1