0% Complete
Home
/
12th International Conference on Computer and Knowledge Engineering
FAST: FPGA Acceleration of Neural Networks Training
Authors :
Alireza Borhani
1
Mohammad Hossein Goharinejad
2
Hamid Reza Zarandi
3
1- Department of Computer Engineering, Amirkabir university of technology
2- Department of Computer Engineering, Amirkabir university of technology
3- Department of Computer Engineering, Amirkabir university of technology
Keywords :
Field Programmable Gate Array،Embedded Devices،Artificial Neural Network،Machine Learning،Approximation
Abstract :
Training state-of-the-art ANNs is computationally and memory intensive. Thus, implementing the training on embedded devices with limited resources is challenging. In order to address this challenge, we propose FAST, a low-precision method to implement and optimize ANN training on FPGA. FAST first addresses the challenge of implementing the non-polynomial sigmoid activation function by presenting a solution using PNLA methods. Then, it introduces Hardware Optimized PReLU (HOPE) activation function, which is specifically devised to reduce the required resources and increase the accuracy of computations on FPGA. We evaluated FAST against the software implementations of ANNs, using training tasks available in the MNIST benchmark. The results show that FAST improves the training speed by 8.6× and reduces the required memory size by orders of magnitude. It is worthwhile to mention that the method imposes almost no degradation in training accuracy.
Papers List
List of archived papers
Semi-Supervised Supply Chain Fraud Detection with Unsupervised Pre-Filtering
Fatemeh Moradi - Mehran Tarif - Mohammadhossein Homaei
Weakly Supervised Convolutional Neural Network for Automatic Gleason Grading of Prostate Cancer
Maryam Kamareh - Mohammad Sadegh Helfroush - Kamran Kazemi
Improving LoRaWAN Scalability for IoT Applications using Context Information
Hamed Mahmoudi - Behrouz ShahgholiGhahfarokhi
EEMC: Energy Efficient Multi-Clustering Using Grey Wolf Optimizer in WSNs
Maryam Ghorbanvirdi - Sayyed Majid Mazinani
A Cost-Sensitive Genetic Algorithm for Customer Segmentation in Auto Insurances
Alireza Khajenoori - Mohammad Saniee Abadeh - Mohsen Mohammadzadeh
Decentralized Federated Learning in IoT Environments: A Hierarchical Approach
Majid Mohammadpour - Seyedakbar Mostafavi
Robustness Scan of Digital Circuits Using Convolutional Neural Networks
Mobin Vaziri - Mohammad Mehdi Rahimifar - Hadi Jahanirad
Adaptive Hybrid TRCA–CORRCA algorithm for enhanced accuracy in SSVEP-based brain-computer interfaces
Sepehr Tayebeh Khabbaz - Sina Tayebeh Khabbaz - Arshia Barani - Arsalan Ganjeh - Sasan Harifi - Seyed Mohsen Mirhosseini
Graph-Cut-Based Semantic Optimization for Temporal Action Segmentation
Mohanna Ansari - Ehsan Fazl-Ersi
SASIAF, An Scalable Accelerator For Seismic Imaging on Amazon AWS FPGAs
Mostafa Koraei - S.Omid Fatemi
more
Samin Hamayesh - Version 43.7.0