0% Complete
Home
/
12th International Conference on Computer and Knowledge Engineering
Deep Deterministic Policy Gradient in Acoustic To Articulatory inversion
Authors :
Farzane Abdoli
1
Hamid Sheikhzade
2
Vahid Pourahmadi
3
1- Amirkabir university of technology
2- Amirkabir university of technology
3- Amirkabir university of technology
Keywords :
acoustic-to-articulatory inversion،DDPG،Reinforcement learning،Speaker-independent،MFCC
Abstract :
This paper aims to utilize a deep reinforcement learning algorithm for the acoustic-to-articulatory inversion problem. A deep deterministic policy gradient (DDPG) based method is adopted to adjust the articulatory parameters of a speaker to minimize the cepstral difference between original speech and the synthesized one. In traditional methods such as NNs, GMMs,... , parallel acoustic and articulatory training data is needed for each speaker, but the proposed iterative DDPG is used to explore articulatory space for finding the best point, which maximizes the desired reward without any need for joint kinematic and articulatory data for the speaker. Acoustic signals are synthesized by VocalTractLab(VTL), a three-dimensional articulatory synthesizer, and represented by Mel-frequency cepstral coefficients (MFCCs). This method provides estimated parameters very close to the ones which are calculated by MRI and advanced processing.
Papers List
List of archived papers
Multi-Layer Collaborative Graph with BPR Similarity Embedding for Recommender System
Mostafa Ghorbani - Azadeh Mansouri
AgeNet-AT: An End-to-End Model for Robust Joint Speaker Age Estimation and Gender Recognition Based on Attention Mechanism and Titanet
Mahsa Zamani Tarashandeh - Amirhossein Torkanloo - Mohammad Hossein Moattar
Improving performance of multi-label classification using ensemble of feature selection and outlier detection
Mohammad Ali Zarif - Javad Hamidzadeh
A Federated Learning-Based Hybrid Deep Learning Framework for Enhanced Human Activity Recognition
Jamileh Azmoudeh - Sajjad Arghaee - Parisa Valizadeh - Samaneh Dandani - Iman Havangi - Mohammad Hossein Yaghmaee
SCDS: A Secure Clustering Protocol Using Dempster-Shafer Theory for VANET in Smart City
Hoda Mosadegh - Nazbanoo Farzaneh
Islamic Geometric algorithms: A survey
Elham Akbari - Azam Bastanfard
Adaptive Channel Estimation for MIMO-OFDM Systems in Impulsive Noise Environments
Mojtaba Hajiabadi
ExaAEC: A New Multi-label Emotion Classification Corpus in Arabic Tweets
Saeed Sarbazi-Azad - Ahmad Akbari - Mohsen Khazeni
A Hybrid Echo State Network for Hypercomplex Pattern Recognition, Classification, and Big Data Analysis
Mohammad Jamshidi - Fatemeh Daneshfar
Evaluating the Impact of Traveling on COVID-19 Prevalence and Predicting the New Confirmed Cases According to the Travel Rate Using Machine Learning: A Case Study in Iran
Anita Ghandehari - Soheil Shirvani - Hadi Moradi
more
Samin Hamayesh - Version 41.5.3