0% Complete
Home
/
12th International Conference on Computer and Knowledge Engineering
Dual Memory Structure for Memory Augmented Neural Networks for Question-Answering Tasks
Authors :
Amir Bidokhti
1
Shahrokh Ghaemmaghami
2
1- Department of Electrical Engineering. Sharif University of Technology. Tehran, Iran
2- Sharif University of Technology
Keywords :
deep learning،memory augmented neural networks،graph neural networks،question-answering،neural Turing machine
Abstract :
Memory is crucial for machine learning tasks on sequential data. From vanilla RNN to LSTM and memory augmented neural networks, researchers have investigated several types of memory structures. However, they suffer from limitations in the capacity or ability to keep track of long-term dependencies. This paper presents an external memory module composed of two distinct submodules that are inspired by memory in the human brain. Besides, a sleep mechanism is incorporated into this memory, which can mimic sleep's effects on improving human memory. The proposed method is fully differentiable; thus, backpropagation can be used for its training. Experiments conducted on the bAbI dataset show that the proposed method is successful in 16 out of 20 tasks, and the average error is 2.8%. The performance of the proposed method is far better than the conventional NTM, and it has the lowest prediction error in 7 out of 20 tasks among baseline systems. Besides, the proposed system is the only system that can solve tasks 16 and 17 of the bAbI dataset.
Papers List
List of archived papers
Real-Time Gender Recognition with a Deep Neural Network
Samad Azimi Abriz - Majid Meghdadi
Efficient T-Count Fault-tolerant Quantum Clifford+T Multiplexer
Negin Mashayekhi - Shekoofeh Moghimi - Mohammad Reza Reshadinezhad
Histopathology Image-Based Cancer Classification Utilizing Transfer Learning Approach
Amir Meydani - Alireza Meidani - Ali Ramezani - Maryam Shabani - Mohammad Mehdi Kazeminasab - Shahriar Shahablavasani
Analysis of Insect-plant Interactions Affected by Mining operations, A Graph Mining Approach
Mohammad Heydari - Ali Bayat - Amir Albadvi
An optimal workflow scheduling method in cloud-fog computing using three-objective Harris-Hawks algorithm
Ahmadreza Montazerolghaem - Maryam Khosravi - Fatemeh Rezaee
DEW-WIN: A Dynamic Energy-aware Window-based Scheduler for Mixed-criticality Systems
Mahin Moradiyan - Yasser Sedaghat - Pouria Hosseini - Yousef Rezazadeh
Introducing Meta-Contrastive Adaptive Autoencoder to Tackle Cold-Start Challenges in Sparse Domains
Hossein Rashid - Erfan Arzhmand - Fatemeh Hosseini
Class-Aware Balanced Point Cloud Donwsampling for Efficient Large-Scale 3D Scene Understanding
Mohammad Yousefipour - Marjan Naderan - Morteza Jaderyan
T-Rank: Graph Data Analytics for Urban Traffic Modeling
Alireza Safarpour - Iman Gholampour - Amirhossain Aghazadeh Fard - Seyed Mohammad Karbasi
MIPS-Core Application Specific Instruction-Set Processor for IDEA Cryptography − Comparison between Single-Cycle and Multi-Cycle Architectures
Ahmad Ahmadi - Reza Faghih Mirzaee
more
Samin Hamayesh - Version 43.7.0