0% Complete
Home
/
12th International Conference on Computer and Knowledge Engineering
Deep Deterministic Policy Gradient in Acoustic To Articulatory inversion
Authors :
Farzane Abdoli
1
Hamid Sheikhzade
2
Vahid Pourahmadi
3
1- Amirkabir university of technology
2- Amirkabir university of technology
3- Amirkabir university of technology
Keywords :
acoustic-to-articulatory inversion،DDPG،Reinforcement learning،Speaker-independent،MFCC
Abstract :
This paper aims to utilize a deep reinforcement learning algorithm for the acoustic-to-articulatory inversion problem. A deep deterministic policy gradient (DDPG) based method is adopted to adjust the articulatory parameters of a speaker to minimize the cepstral difference between original speech and the synthesized one. In traditional methods such as NNs, GMMs,... , parallel acoustic and articulatory training data is needed for each speaker, but the proposed iterative DDPG is used to explore articulatory space for finding the best point, which maximizes the desired reward without any need for joint kinematic and articulatory data for the speaker. Acoustic signals are synthesized by VocalTractLab(VTL), a three-dimensional articulatory synthesizer, and represented by Mel-frequency cepstral coefficients (MFCCs). This method provides estimated parameters very close to the ones which are calculated by MRI and advanced processing.
Papers List
List of archived papers
Multi-Task Transformer for Stock Market Trend Prediction
Seyed Morteza Mirjebreili - Ata Solouki - Hamidreza Soltanalizadeh - Mohammad Sabokrou
AgeNet-AT: An End-to-End Model for Robust Joint Speaker Age Estimation and Gender Recognition Based on Attention Mechanism and Titanet
Mahsa Zamani Tarashandeh - Amirhossein Torkanloo - Mohammad Hossein Moattar
Optimization Resource Allocation in NOMA-based Fog Computing with a Hybrid Algorithm
Zohreh Torki - S.Mojtaba Matinkhah
Word-level Persian Lipreading Dataset
Javad Peymanfard - Ali Lashini - Samin Heydarian - Hossein Zeinali - Nasser Mozayani
Leveraging the Power of Object Detection Models in Identifying Litter for a Significant Reduction in Environmental Pollution
Lim Zhen Xian - Ervin Gubin Moung - Jason Teo Tze Wi - Nordin Saad - Farashazillah Yahya - Tiong Lin Rui - Ali Farzamnia
A Novel Density-Based KNN in Pattern Recognition
Sajad Haghzad Klidbary - Abazar Arabameri
Adaptive Channel Estimation for MIMO-OFDM Systems in Impulsive Noise Environments
Mojtaba Hajiabadi
A Stacking Ensemble Framework for Ransomware Detection on the Bitcoin Blockchain Using Transaction Graph Analytics
Mohammad Mobin Teymourpour - Parsa Hedayatnia - Mohammad Allahbakhsh - Haleh Amintoosi
Synthetic Trajectory Sharing Indoors under Privacy Constraints
Mahdi Soltanpour - Vahideh Moghtadaiee - Mina Alishahi
ExaASC: A General Target-Based Stance Detection Corpus in Arabic Language
Mohammad Mehdi Jaziriyan - Ahmad Akbari - Hamed Karbasi
more
Samin Hamayesh - Version 43.7.0