0% Complete
Home
/
13th International Conference on Computer and Knowledge Engineering
DPRNN-FORMER: AN EFFICIENT WAY TO DEAL WITH BLIND SOURCE SEPARATION
Authors :
Ramin Ghorbani
1
Sajad Haghzad Klidbary
2
1- University of Zanjan
2- University of Zanjan
Keywords :
Blind Source Separation (BSS)،Deep Neural Network (DNN)،Long short term memory(LSTM)،Speech Source Separation،Transformer
Abstract :
Recent advancements in DL have indicated that time-domain methods are more successful than traditional time-frequency-based methods regarding speech separation. However, modeling very long sequences in time-domain separation systems presents some challenges. Recurrent neural networks and 1-D convolutional neural networks are not sufficient for modeling lengthy sequences by themselves. In this paper, a hybrid RNN is proposed, combining a pre-trained DPRNN and transformer. This strategy uses the transformer's ability to perceive context, allowing it to gain insight into the time-evolving data connected to audio signals. To handle extended input sequences, the network partitions them into more manageable sections, performing intra-section and inter-section operations iteratively. The proposed network surpasses current state-of-the-art algorithms, achieving SI-SNR of 11.129 and SDR of 11.285 dB on the public WSj0-3mix dataset.
Papers List
List of archived papers
Real-Time Forecasting Using Mixed Frequency Time-Series Data
Armin Khayati - Mohammad Taheri - Koorush Ziarati
Design and Simulation of a Low PDP Full Adder by Combining Majority Function and TGDI Technique in CNTFET Technology
Mahsa Mohammadi
Pyramid Transformer for Traffic Sign Detection
Omid Nejati manzari - Amin Boudesh - Shahriar B. Shokouhi
Extreme Gradient Boosting (XGBoost) Regressor and Shapley Additive Explanation for Crop Yield Prediction in Agriculture
Dennis A/L Mariadass - Ervin Gubin Moung - Maisarah Mohd Sufian - Ali Farzamnia
SUBoost: A Novel Boosting-Based Selective Undersampling for handling Imbalanced Data
Nima Rasi Baghmishe - Jafar Tanha - Ehsan Roshan
An optimal workflow scheduling method in cloud-fog computing using three-objective Harris-Hawks algorithm
Ahmadreza Montazerolghaem - Maryam Khosravi - Fatemeh Rezaee
Crack Segmentation in Civil Structure Images Using a Deep Learning Based Multi-Classifier System
Mohammadreza Asadi - Seyedeh Sogand Hashemi - Mohammad Taghi Sadeghi
Time Series Analysis by Bi-GRU for Forecasting Bitcoin Trends based on Sentiment Analysis
Fatemeh Saadatmand - Mohammad Ali Zare Chahoki
Plant Disease Detection Using Dynamic Knowledge Distillation and Attention Mechanism
Mohammad Ghasemi Arian - Mohammad Hossein Yaghmaee Moghaddam
Improving the classification of high dimensional class-imbalanced data using the Chaos particle swarm optimization with Levy Flight
Mohammad Ali Zarif - Javad Hamidzadeh
more
Samin Hamayesh - Version 43.7.0