0% Complete
Home
/
12th International Conference on Computer and Knowledge Engineering
Dual Memory Structure for Memory Augmented Neural Networks for Question-Answering Tasks
Authors :
Amir Bidokhti
1
Shahrokh Ghaemmaghami
2
1- Department of Electrical Engineering. Sharif University of Technology. Tehran, Iran
2- Sharif University of Technology
Keywords :
deep learning،memory augmented neural networks،graph neural networks،question-answering،neural Turing machine
Abstract :
Memory is crucial for machine learning tasks on sequential data. From vanilla RNN to LSTM and memory augmented neural networks, researchers have investigated several types of memory structures. However, they suffer from limitations in the capacity or ability to keep track of long-term dependencies. This paper presents an external memory module composed of two distinct submodules that are inspired by memory in the human brain. Besides, a sleep mechanism is incorporated into this memory, which can mimic sleep's effects on improving human memory. The proposed method is fully differentiable; thus, backpropagation can be used for its training. Experiments conducted on the bAbI dataset show that the proposed method is successful in 16 out of 20 tasks, and the average error is 2.8%. The performance of the proposed method is far better than the conventional NTM, and it has the lowest prediction error in 7 out of 20 tasks among baseline systems. Besides, the proposed system is the only system that can solve tasks 16 and 17 of the bAbI dataset.
Papers List
List of archived papers
Weakly Supervised Convolutional Neural Network for Automatic Gleason Grading of Prostate Cancer
Maryam Kamareh - Mohammad Sadegh Helfroush - Kamran Kazemi
Enhanced Atrial Fibrillation (AF) Detection via Data Augmentation with Diffusion Model
Arash Vashagh - Amirhossein Akhoondkazemi - Sayed Jalal Zahabi - Davood Shafie
Efficient Prediction of Cardiovascular Disease via Extra Tree Feature Selection
Mina Abroodi - Mohammad Reza Keyvanpour - Ghazaleh Kakavand Teimoory
SASIAF, An Scalable Accelerator For Seismic Imaging on Amazon AWS FPGAs
Mostafa Koraei - S.Omid Fatemi
Vaccine Distribution Modelling in Pandemics through Multi-Agent Systems: COVID-19 Case
Hossein Yarahmadi - Mohammad Ebrahim Shiri - Hamid Reza Navidi - Arash Sharifi - Moharram Challenger - Hassan Piriaei
Fatty Liver Level Recognition Using Particle Swarm Optimization (PSO) Image Segmentation and Analysis
Seyed Muhammad Hossein Mousavi - Vyacheslav Lyashenko - Atiye Ilanloo - S. Younes Mirinezhad
Ramp Progressive Secret Image Sharing using Ensemble of Simple Methods
Atieh Mokhtari - Mohammad Taheri
Sensitivity Reliability Analysis of Power Distribution Networks Using Fuzzy Logic
Mohammed Wadi - Wisam Elmasry - Ismail Kucuk - Hossein Shahinzadeh
Optimal PMU Placement Considering Reliability of Measurement System in Smart Grids
Mohammad Shahraeini - Shahla Khormali - Ahad Alvandi
Lempel-Ziv-based Hyper-Heuristic Solution for Longest Common Subsequence Problem
Mahdi Nasrollahi - Reza Shami Tanha - Mohsen Hooshmand
more
Samin Hamayesh - Version 42.2.1