0% Complete
Home
/
12th International Conference on Computer and Knowledge Engineering
Deep Deterministic Policy Gradient in Acoustic To Articulatory inversion
Authors :
Farzane Abdoli
1
Hamid Sheikhzade
2
Vahid Pourahmadi
3
1- Amirkabir university of technology
2- Amirkabir university of technology
3- Amirkabir university of technology
Keywords :
acoustic-to-articulatory inversion،DDPG،Reinforcement learning،Speaker-independent،MFCC
Abstract :
This paper aims to utilize a deep reinforcement learning algorithm for the acoustic-to-articulatory inversion problem. A deep deterministic policy gradient (DDPG) based method is adopted to adjust the articulatory parameters of a speaker to minimize the cepstral difference between original speech and the synthesized one. In traditional methods such as NNs, GMMs,... , parallel acoustic and articulatory training data is needed for each speaker, but the proposed iterative DDPG is used to explore articulatory space for finding the best point, which maximizes the desired reward without any need for joint kinematic and articulatory data for the speaker. Acoustic signals are synthesized by VocalTractLab(VTL), a three-dimensional articulatory synthesizer, and represented by Mel-frequency cepstral coefficients (MFCCs). This method provides estimated parameters very close to the ones which are calculated by MRI and advanced processing.
Papers List
List of archived papers
Computational Microscopy Based on Fourier Ptychography using Embedded Architecture
Rezvan Mir - Abedin Vahedian
Link Prediction for Recommendation based on Complex Representation of Items Similarities
Masoumeh Alinia - Seyed Mohammad Hossein Hasheminejad - Hadi Shakibian
Diagnosis of Depression Based on New Features Extractive from the Frequency Space of the EEG
Melika Changizi - Saeid Rashidi
InfOnto: An ontology for fashion influencer marketing based on Instagram
Somaye Sultani - Mohsen Kahani
Optimization of quantum secret sharing communication using corresponding bits
Mahsa Khorrampanah - Mohammad Bolokian - Monireh Houshmand
Brain Age Estimation with Twin Vision Transformer using Hippocampus Information Applicable to Alzheimer Dementia Diagnosis
Zahra Qodrati - Seyedeh Masoumeh Taji - Amirhossein Ghaemi - Habibollah Danyali - Kamran Kazemi - Alireza Ghaemi
Effect of Tissue Excitation in Breast Cancer Detection from Ultrasound RF Time Series: Phantom studies
Elaheh Norouzi Ghehi - Ali Fallah - Saeid Rashidi - Maryam Mehdizadeh Dastjerdi
Optimizing Question-Answering Framework Through Integration of Text Summarization Model and Third-Generation Generative Pre-Trained Transformer
Ervin Gubin Moung - Toh Sin Tong - Maisarah Mohd Sufian - Valentino Liaw - Ali Farzamnia - Farashazillah Yahya
The Effect of Network Environment on Traffic Classification
Abolghasem Rezaei Khesal - Mehdi Teimouri
Towards Efficient Capsule Networks through Approximate Squash Function and Layer-wise Quantization
Mohsen Raji - Kimia Soroush - Amir Ghazizadeh
more
Samin Hamayesh - Version 41.7.6