0% Complete
Home
/
12th International Conference on Computer and Knowledge Engineering
FAST: FPGA Acceleration of Neural Networks Training
Authors :
Alireza Borhani
1
Mohammad Hossein Goharinejad
2
Hamid Reza Zarandi
3
1- Department of Computer Engineering, Amirkabir university of technology
2- Department of Computer Engineering, Amirkabir university of technology
3- Department of Computer Engineering, Amirkabir university of technology
Keywords :
Field Programmable Gate Array،Embedded Devices،Artificial Neural Network،Machine Learning،Approximation
Abstract :
Training state-of-the-art ANNs is computationally and memory intensive. Thus, implementing the training on embedded devices with limited resources is challenging. In order to address this challenge, we propose FAST, a low-precision method to implement and optimize ANN training on FPGA. FAST first addresses the challenge of implementing the non-polynomial sigmoid activation function by presenting a solution using PNLA methods. Then, it introduces Hardware Optimized PReLU (HOPE) activation function, which is specifically devised to reduce the required resources and increase the accuracy of computations on FPGA. We evaluated FAST against the software implementations of ANNs, using training tasks available in the MNIST benchmark. The results show that FAST improves the training speed by 8.6× and reduces the required memory size by orders of magnitude. It is worthwhile to mention that the method imposes almost no degradation in training accuracy.
Papers List
List of archived papers
Developing Convolutional Neural Networks using a Novel Lamarckian Co-Evolutionary Algorithm
Zaniar Sharifi - Khabat Soltanian - Ali Amiri
Evaluation of Efficient Electrocardiomatrix-based Identification Using Deep Learning Methods
Amirhossein Safari - Narges Mokhtari - Mohsen Hooshmand - Sadegh Sadeghi - Peyman Pahlevani
Emotion Recognition In Persian Speech Using Deep Neural Networks
Ali Yazdani - Hossein Simchi - Yasser Shekofteh
Information Theoretic Learning-based Deep Embedded Clustering (ITL-DEC)
Hoda Shad - Mona Zamiri - Tahereh Bahreini - Reza Monsefi - Ghoshe Abed Hodtani
Improved TrustChain for Lightweight Devices
Seyed Salar Ghazi - Haleh Amintoosi
Classification of benign and malignant tumors in Digital Breast Tomosynthesis images using Radiomic-based methods
Farangis Sajadi moghadam - Saeid Rashidi
A routing method with the approach of reducing energy consumption in WSNs with the Jellyfish Search (JS) optimizer algorithm and unequal clustering
Ehsan Gholami - Javad Hamidzadeh
SGFL: A Federated Learning Approach for Non-IID Data Using Semi-Supervised DCGAN
Alireza Rabiee - Abolfazl Ajdarloo - Mohsen Rahmani
Optimizing MR Image Registration for Accurate Brain Volume Measurement in Children with Autism Spectrum Disorder
Shiva Sanati - Mahdi Saadatmand
Hybrid Vision Transformer for Detection of Dentigerous Cysts in Dental Radiography Images
Reza Tavasoli - Arya VarastehNezhad - Hamed Farbeh
more
Samin Hamayesh - Version 41.7.6