0% Complete
Home
/
12th International Conference on Computer and Knowledge Engineering
FAST: FPGA Acceleration of Neural Networks Training
Authors :
Alireza Borhani
1
Mohammad Hossein Goharinejad
2
Hamid Reza Zarandi
3
1- Department of Computer Engineering, Amirkabir university of technology
2- Department of Computer Engineering, Amirkabir university of technology
3- Department of Computer Engineering, Amirkabir university of technology
Keywords :
Field Programmable Gate Array،Embedded Devices،Artificial Neural Network،Machine Learning،Approximation
Abstract :
Training state-of-the-art ANNs is computationally and memory intensive. Thus, implementing the training on embedded devices with limited resources is challenging. In order to address this challenge, we propose FAST, a low-precision method to implement and optimize ANN training on FPGA. FAST first addresses the challenge of implementing the non-polynomial sigmoid activation function by presenting a solution using PNLA methods. Then, it introduces Hardware Optimized PReLU (HOPE) activation function, which is specifically devised to reduce the required resources and increase the accuracy of computations on FPGA. We evaluated FAST against the software implementations of ANNs, using training tasks available in the MNIST benchmark. The results show that FAST improves the training speed by 8.6× and reduces the required memory size by orders of magnitude. It is worthwhile to mention that the method imposes almost no degradation in training accuracy.
Papers List
List of archived papers
Improvement of Credit Scoring by LSTM Autoencoder Model
Milad Sattari Maleki - Seyedeh Niusha Motevallian - Faezehsadat Hosseini - Mohammad Sabokrou - Hamidreza Soltanalizadeh Maleki
Cross-project Defect Prediction with An Enhanced Transfer Boosting Algorithm
Nazgol Nikravesh - Mohammad Reza Keyvanpour
UAV-based Firefighting by Multi-agent Reinforcement Learning
Reza Shami Tanha - Mohsen Hooshmand - Mohsen Afsharchi
Performance Evaluation Study of Color Space Selection In Video Based Facial Expression Recognition Using Deep Neural Networks For Sentiment Analysis
Phee Wei Qin - Ervin Gubin Moung - Ali Farzamnia - Farashazillah Yahya - John Julius Danker Khoo - Maisarah Mohd Sufian
Non-Functional Requirement Extracting Methods for AI-based Systems: A Survey
Reza Damirchi - Amineh Amini
An interactive user groups recommender system based on reinforcement learning
Hediyeh Naderi Allaf - Mohsen Kahani
Multi-Layer Collaborative Graph with BPR Similarity Embedding for Recommender System
Mostafa Ghorbani - Azadeh Mansouri
Capturing Local and Global Features in Medical Images by Using Ensemble CNN-Transformer
Javad Mirzapour Kaleybar - Hooman Saadat - Hooman Khaloo
Automatic Detection and Risk Assessment of Session Management Vulnerabilities in Web Applications
Nasrin Garmabi - Mohammad Ali Hadavi
Impossible differential and zero-correlatin linear cryptanalysis of Marx, Marx2, Chaskey andSpeck32
Mahshid Saberi - Nasour Bagheri - Sadegh Sadeghi
more
Samin Hamayesh - Version 42.4.1