0% Complete
Home
/
12th International Conference on Computer and Knowledge Engineering
Deep Deterministic Policy Gradient in Acoustic To Articulatory inversion
Authors :
Farzane Abdoli
1
Hamid Sheikhzade
2
Vahid Pourahmadi
3
1- Amirkabir university of technology
2- Amirkabir university of technology
3- Amirkabir university of technology
Keywords :
acoustic-to-articulatory inversion،DDPG،Reinforcement learning،Speaker-independent،MFCC
Abstract :
This paper aims to utilize a deep reinforcement learning algorithm for the acoustic-to-articulatory inversion problem. A deep deterministic policy gradient (DDPG) based method is adopted to adjust the articulatory parameters of a speaker to minimize the cepstral difference between original speech and the synthesized one. In traditional methods such as NNs, GMMs,... , parallel acoustic and articulatory training data is needed for each speaker, but the proposed iterative DDPG is used to explore articulatory space for finding the best point, which maximizes the desired reward without any need for joint kinematic and articulatory data for the speaker. Acoustic signals are synthesized by VocalTractLab(VTL), a three-dimensional articulatory synthesizer, and represented by Mel-frequency cepstral coefficients (MFCCs). This method provides estimated parameters very close to the ones which are calculated by MRI and advanced processing.
Papers List
List of archived papers
Optimization Resource Allocation in NOMA-based Fog Computing with a Hybrid Algorithm
Zohreh Torki - S.Mojtaba Matinkhah
A Hybrid Echo State Network for Hypercomplex Pattern Recognition, Classification, and Big Data Analysis
Mohammad Jamshidi - Fatemeh Daneshfar
Efficient Prediction of Cardiovascular Disease via Extra Tree Feature Selection
Mina Abroodi - Mohammad Reza Keyvanpour - Ghazaleh Kakavand Teimoory
An effective hybrid algorithm for locating splicing forgery image
Seyed Hesamoddin Hosseini - Amene Vatanparast - Amir Hossein Taherinia
Information Theoretic Learning-based Deep Embedded Clustering (ITL-DEC)
Hoda Shad - Mona Zamiri - Tahereh Bahreini - Reza Monsefi - Ghoshe Abed Hodtani
MIPS-Core Application Specific Instruction-Set Processor for IDEA Cryptography − Comparison between Single-Cycle and Multi-Cycle Architectures
Ahmad Ahmadi - Reza Faghih Mirzaee
Crack Segmentation in Civil Structure Images Using a Deep Learning Based Multi-Classifier System
Mohammadreza Asadi - Seyedeh Sogand Hashemi - Mohammad Taghi Sadeghi
BERT transformers Multitask learning Sarcasm and Sentiment classification (BMSS)
Fatemeh Molavi - Jamshid Bagherzadeh Mohasefi
Lightweight Local Transformer for COVID-19 Detection Using Chest CT Scans
Hojat Asgarian Dehkordi - Hossein Kashiani - Amir Abbas Hamidi Imani - Shahriar Baradaran Shokouhi
Underwater Image Super-Resolution using Generative Adversarial Network-based Model
Alireza Aghelan - Modjtaba Rouhani
more
Samin Hamayesh - Version 41.7.6