0% Complete
Home
/
11th International Conference on Computer and Knowledge Engineering
Emotion Recognition In Persian Speech Using Deep Neural Networks
Authors :
Ali Yazdani
1
Hossein Simchi
2
Yasser Shekofteh
3
1- Faculty of Computer Science and Engineering, Shahid Beheshti University
2- Faculty of Computer Science and Engineering, Shahid Beheshti University
3- Faculty of Computer Science and Engineering Shahid Beheshti University Tehran, Iran
Keywords :
Speech Emotion Recognition, Feature extraction, Deep Learning, Farsi Language, ShEMO dataset
Abstract :
Speech Emotion Recognition (SER) is of great importance in Human-Computer Interaction (HCI), as it provides a deeper understanding of the situation and results in better interaction. In recent years, various machine learning and Deep Learning (DL) algorithms have been developed to improve SER techniques. Recognition of the spoken emotions depends on the type of expression that varies between different languages. In this paper, to further study important factors in the Farsi language, we examine various DL techniques on a Farsi/Persian dataset, Sharif Emotional Speech Database (ShEMO), which was released in 2018. Using signal features in low- and high-level descriptions and different deep neural networks and machine learning techniques, Unweighted Accuracy (UA) of 65.20% and Weighted Accuracy (WA) of 78.29% is achieved.
Papers List
List of archived papers
FarCQA: A Farsi Community Dataset for Question Classification and Answer Selection
Saba Emami - Maedeh Mosharraf
Innovative Customer Segmentation based on Multi-Step Sequential Deep Clustering in the Telecommunication Industry
Fatemeh Jalali Farahani - Shima Tabibian
Lempel-Ziv-based Hyper-Heuristic Solution for Longest Common Subsequence Problem
Mahdi Nasrollahi - Reza Shami Tanha - Mohsen Hooshmand
Optimizing Text-Based Protocol Clustering in Reverse Engineering with Auto-Encoders and Fine-Tuned Parameters
Shiva Mahmoudzadeh - Mohaddese Nemati - Mehdi Teimouri
A Hybrid Echo State Network for Hypercomplex Pattern Recognition, Classification, and Big Data Analysis
Mohammad Jamshidi - Fatemeh Daneshfar
MCRS-SAE : multi criteria recommender system based on sparse autoencoder
Amir reza Kalantarnezhad - Javad Hamidzadeh
Brain Age Estimation with Twin Vision Transformer using Hippocampus Information Applicable to Alzheimer Dementia Diagnosis
Zahra Qodrati - Seyedeh Masoumeh Taji - Amirhossein Ghaemi - Habibollah Danyali - Kamran Kazemi - Alireza Ghaemi
Robust Learning to Learn Graph Topologies
Navid Akhavan Attar - Ali Fahim
Joint ADC-less Analog Demodulator and Decoder for Extended Binary (8, 4, 4) Hamming Channel Code
Mir Mahdi Safari - Jafar Pourrostam - Behzad Mozaffari Tazehkand
An Improved and Accurate Measure for Mining Correlated High-utility Itemsets
Amir Masoud Heidari Orojloo - Morteza Keshtkaran
more
Samin Hamayesh - Version 42.4.1