0% Complete
Home
/
12th International Conference on Computer and Knowledge Engineering
Deep Deterministic Policy Gradient in Acoustic To Articulatory inversion
Authors :
Farzane Abdoli
1
Hamid Sheikhzade
2
Vahid Pourahmadi
3
1- Amirkabir university of technology
2- Amirkabir university of technology
3- Amirkabir university of technology
Keywords :
acoustic-to-articulatory inversion،DDPG،Reinforcement learning،Speaker-independent،MFCC
Abstract :
This paper aims to utilize a deep reinforcement learning algorithm for the acoustic-to-articulatory inversion problem. A deep deterministic policy gradient (DDPG) based method is adopted to adjust the articulatory parameters of a speaker to minimize the cepstral difference between original speech and the synthesized one. In traditional methods such as NNs, GMMs,... , parallel acoustic and articulatory training data is needed for each speaker, but the proposed iterative DDPG is used to explore articulatory space for finding the best point, which maximizes the desired reward without any need for joint kinematic and articulatory data for the speaker. Acoustic signals are synthesized by VocalTractLab(VTL), a three-dimensional articulatory synthesizer, and represented by Mel-frequency cepstral coefficients (MFCCs). This method provides estimated parameters very close to the ones which are calculated by MRI and advanced processing.
Papers List
List of archived papers
A New Hypercube Variant: Pruned Shuffle Connected Cube
Reza Latifi - Mahmoud Naghibzadeh
Facial Emotion Recognition Under Mask Coverage Using a Data Augmentation Technique
Aref Farhadipour - Pouya Taghipour
Real-Time Vehicle Detection and Classification in UAV imagery Using Improved YOLOv5
Mohammad Hossein Hamzenejadi - Hadis Mohseni
Forecasting El Niño Six Months in Advance Utilizing Augmented Convolutional Neural Network
Mohammad Naisipour - Iraj Saeedpanah - Arash Adib - Mohammad Hossein Neisi Pour
Analysis of Address Lifespans in Bitcoin and Ethereum
Amir Mohammad Karimi Mamaghan - Amin Setayesh - Behnam Bahrak
An Ensemble CNN for Brain Age Estimation based on Hippocampal Region Applicable to Alzheimer's Diagnosis
Zahra Qodrati - Seyedeh Masoumeh Taji - Habibollah Danyali - Kamran Kazemi
A supervised approach using transformer networks for the detection of turning-related anomalies in urban intersections
Mohammad Mahdi HajiAbadi - Manoochehr Nahvi
Histopathology Image-Based Cancer Classification Utilizing Transfer Learning Approach
Amir Meydani - Alireza Meidani - Ali Ramezani - Maryam Shabani - Mohammad Mehdi Kazeminasab - Shahriar Shahablavasani
Automating Theory of Mind Assessment with a LLaMA-3-Powered Chatbot: Enhancing Faux Pas Detection in Autism
Avisa Fallah - Ali Keramati - Mohammad Ali Nazari - Fatemeh Sadat Mirfazeli
Speech Emotion Recognition Using a Hierarchical Adaptive Weighted Multi-Layer Sparse Auto-Encoder Extreme Learning Machine with New Weighting and Spectral/SpectroTemporal Gabor Filter Bank Features
Fatemeh Daneshfar - Seyed Jahanshah Kabudian
more
Samin Hamayesh - Version 42.2.1