0% Complete
Home
/
12th International Conference on Computer and Knowledge Engineering
Dual Memory Structure for Memory Augmented Neural Networks for Question-Answering Tasks
Authors :
Amir Bidokhti
1
Shahrokh Ghaemmaghami
2
1- Department of Electrical Engineering. Sharif University of Technology. Tehran, Iran
2- Sharif University of Technology
Keywords :
deep learning،memory augmented neural networks،graph neural networks،question-answering،neural Turing machine
Abstract :
Memory is crucial for machine learning tasks on sequential data. From vanilla RNN to LSTM and memory augmented neural networks, researchers have investigated several types of memory structures. However, they suffer from limitations in the capacity or ability to keep track of long-term dependencies. This paper presents an external memory module composed of two distinct submodules that are inspired by memory in the human brain. Besides, a sleep mechanism is incorporated into this memory, which can mimic sleep's effects on improving human memory. The proposed method is fully differentiable; thus, backpropagation can be used for its training. Experiments conducted on the bAbI dataset show that the proposed method is successful in 16 out of 20 tasks, and the average error is 2.8%. The performance of the proposed method is far better than the conventional NTM, and it has the lowest prediction error in 7 out of 20 tasks among baseline systems. Besides, the proposed system is the only system that can solve tasks 16 and 17 of the bAbI dataset.
Papers List
List of archived papers
Fatty Liver Level Recognition Using Particle Swarm Optimization (PSO) Image Segmentation and Analysis
Seyed Muhammad Hossein Mousavi - Vyacheslav Lyashenko - Atiye Ilanloo - S. Younes Mirinezhad
Introducing E4MT and LMBNC: Persian pre-processing utilities
Zakieh Shakeri - Mehran Ziabary - Behrooz Vedadian - Fatemeh Azadi - Saeed Torabzadeh - Arian Atefi
Supervised Contrastive Learning for Short Text Classification in Natural Language Processing
Mitra Esmaeili - Hamed Vahdat nejad
A parallel CNN-BiGRU network for short-term load forecasting in demand-side management
Arghavan Irankhah - Sahar Rezazadeh Saatlou - Mohammad Hossein Yaghmaee - Sara Ershadi-Nasab - Mohammad Alishahi
Detecting Non-Spherical Clusters Using Modified CURE Algorithm
Arezou Safdari - Pedram Salehpour
Hybrid Flow-Rule Placement Method of Proactive and Reactive in SDNs
Mohammadreza Khoobbakht - Mohammadreza Noei - Mohammadreza Parvizimosaed
Analysis of Address Lifespans in Bitcoin and Ethereum
Amir Mohammad Karimi Mamaghan - Amin Setayesh - Behnam Bahrak
TriMAE: Fashion visual search with Triplet Masked Auto Encoder Vision Transformer
Lachin Zamani - Reza Azmi
An Analysis of Botnet Detection Using Graph Neural Network
Faezeh Alizadeh - Mohammad Khansari
Chaotic multi-population ABC algorithm based on memory and levy flight for solving dynamic job shop scheduling problems
Mohammad Ali Zarif - Javad Hamidzadeh
more
Samin Hamayesh - Version 41.7.6