0% Complete
Home
/
13th International Conference on Computer and Knowledge Engineering
DPRNN-FORMER: AN EFFICIENT WAY TO DEAL WITH BLIND SOURCE SEPARATION
Authors :
Ramin Ghorbani
1
Sajad Haghzad Klidbary
2
1- University of Zanjan
2- University of Zanjan
Keywords :
Blind Source Separation (BSS)،Deep Neural Network (DNN)،Long short term memory(LSTM)،Speech Source Separation،Transformer
Abstract :
Recent advancements in DL have indicated that time-domain methods are more successful than traditional time-frequency-based methods regarding speech separation. However, modeling very long sequences in time-domain separation systems presents some challenges. Recurrent neural networks and 1-D convolutional neural networks are not sufficient for modeling lengthy sequences by themselves. In this paper, a hybrid RNN is proposed, combining a pre-trained DPRNN and transformer. This strategy uses the transformer's ability to perceive context, allowing it to gain insight into the time-evolving data connected to audio signals. To handle extended input sequences, the network partitions them into more manageable sections, performing intra-section and inter-section operations iteratively. The proposed network surpasses current state-of-the-art algorithms, achieving SI-SNR of 11.129 and SDR of 11.285 dB on the public WSj0-3mix dataset.
Papers List
List of archived papers
A Vision-Based Method for Human Activity Recognition Using Local Binary Pattern
Babak Goodarzi - Reza Javidan - Mohammad Sadegh Rezaei
Improvement of Credit Scoring by LSTM Autoencoder Model
Milad Sattari Maleki - Seyedeh Niusha Motevallian - Faezehsadat Hosseini - Mohammad Sabokrou - Hamidreza Soltanalizadeh Maleki
FGM Copula based Analysis of Coverage Region for Wireless Three-User Multiple Access Channel with Correlated Channel Coefficients
Mona Sadat Mohsenzadeh - Ghosheh Abed Hodtani
R2-BAC: A Novel Blockchain and IoT-Based Access Control Model for Supply Chain Management
Sadegh Sohani - Farnaz Kamranfar - Haleh Amintoosi - Mohammad Allahbakhsh
Machine Learning-Driven Prediction of Anti-Alzheimer Drug Efficacy Using PubChem Molecular Fingerprints
Mohammad Javad Sadeghi - Mohammad Javad Nemati - AliAsghar Zare - Mohammadreza Shams
DevRanker: An Effective Approach to Rank Developers for Bug Report Assignment
Mohammad Reza Kardoost - Mohammad Reza Moosavi - Reza Akbari
A Novel Density-Based KNN in Pattern Recognition
Sajad Haghzad Klidbary - Abazar Arabameri
Farsi Optical Character Recognition Using a Transformer-based Model
Fatemeh Asadi Zeydabadi - Elham Shabaninia - Hossein Nezamabadi-pour - Melika Shojaee
Enhanced Duplicate Bug Report Detection in Anonymized Environments: A Parallelized Multi-Task Learning Framework
Alireza Shorafa - Abolfazl Zarghani
Deep Learning-Based Malaysian Sign Language (MSL) Recognition: Exploring the Impact of Color Spaces
Ervin Gubin Moung - Precilla Fiona Suwek - Maisarah Mohd Sufian - Valentino Liaw - Ali Farzamnia - Wei Leong Khong
more
Samin Hamayesh - Version 43.7.0