0% Complete
Home
/
12th International Conference on Computer and Knowledge Engineering
Dual Memory Structure for Memory Augmented Neural Networks for Question-Answering Tasks
Authors :
Amir Bidokhti
1
Shahrokh Ghaemmaghami
2
1- Department of Electrical Engineering. Sharif University of Technology. Tehran, Iran
2- Sharif University of Technology
Keywords :
deep learning،memory augmented neural networks،graph neural networks،question-answering،neural Turing machine
Abstract :
Memory is crucial for machine learning tasks on sequential data. From vanilla RNN to LSTM and memory augmented neural networks, researchers have investigated several types of memory structures. However, they suffer from limitations in the capacity or ability to keep track of long-term dependencies. This paper presents an external memory module composed of two distinct submodules that are inspired by memory in the human brain. Besides, a sleep mechanism is incorporated into this memory, which can mimic sleep's effects on improving human memory. The proposed method is fully differentiable; thus, backpropagation can be used for its training. Experiments conducted on the bAbI dataset show that the proposed method is successful in 16 out of 20 tasks, and the average error is 2.8%. The performance of the proposed method is far better than the conventional NTM, and it has the lowest prediction error in 7 out of 20 tasks among baseline systems. Besides, the proposed system is the only system that can solve tasks 16 and 17 of the bAbI dataset.
Papers List
List of archived papers
Improving ADHD Detection with Cost-Sensitive LightGBM
Behnam Yousefimehr - Mehdi Ghatee - Ali Heydari
Efficient Object Detection using Deep Reinforcement Learning and Capsule Networks
Sobhan Siamak - Eghbal Mansoori
Performance Evaluation Study of Color Space Selection In Video Based Facial Expression Recognition Using Deep Neural Networks For Sentiment Analysis
Phee Wei Qin - Ervin Gubin Moung - Ali Farzamnia - Farashazillah Yahya - John Julius Danker Khoo - Maisarah Mohd Sufian
Area-Efficient VLSI Implementation of Bit-Serial Multiplier Using Polynomial Basis over GF(2m)
Saeideh Nabipour - Javad Javidan - Gholamreza Zare Fatin
Efficient Sub-Carrier Relationship Extraction for Human Activity Recognition via EEGNet in Wireless Sensing
Siavash Zaravashan - Sadegh ArefiZadeh - Sajjad Torabi
PowerLinear Activation Functions with application to the first layer of CNNs
Kamyar Nasiri - Kamaledin Ghiasi-Shirazi
A Cost-Sensitive Genetic Algorithm for Customer Segmentation in Auto Insurances
Alireza Khajenoori - Mohammad Saniee Abadeh - Mohsen Mohammadzadeh
DPRNN-FORMER: AN EFFICIENT WAY TO DEAL WITH BLIND SOURCE SEPARATION
Ramin Ghorbani - Sajad Haghzad Klidbary
A Deep CNN Model Based Ensemble Approach for Semantic and Instance Segmentation of Indoor Environment
Sajad Rezaei - Jafar Tanha - Zahra Jafari - SeyedEhsan Roshan - Mohammad-Amin Memar Kochebagh
Damage Detection After the Earthquake Using Sentinel-1 and 2 Images and Machine Learning Algorithms (Case Study: Sarpol-e Zahab Earthquake)
Niloofar Alizadeh - Behnam Asghari Beirami - Mehdi Mokhtarzade
more
Samin Hamayesh - Version 42.2.1