0% Complete
Home
/
12th International Conference on Computer and Knowledge Engineering
FAST: FPGA Acceleration of Neural Networks Training
Authors :
Alireza Borhani
1
Mohammad Hossein Goharinejad
2
Hamid Reza Zarandi
3
1- Department of Computer Engineering, Amirkabir university of technology
2- Department of Computer Engineering, Amirkabir university of technology
3- Department of Computer Engineering, Amirkabir university of technology
Keywords :
Field Programmable Gate Array،Embedded Devices،Artificial Neural Network،Machine Learning،Approximation
Abstract :
Training state-of-the-art ANNs is computationally and memory intensive. Thus, implementing the training on embedded devices with limited resources is challenging. In order to address this challenge, we propose FAST, a low-precision method to implement and optimize ANN training on FPGA. FAST first addresses the challenge of implementing the non-polynomial sigmoid activation function by presenting a solution using PNLA methods. Then, it introduces Hardware Optimized PReLU (HOPE) activation function, which is specifically devised to reduce the required resources and increase the accuracy of computations on FPGA. We evaluated FAST against the software implementations of ANNs, using training tasks available in the MNIST benchmark. The results show that FAST improves the training speed by 8.6× and reduces the required memory size by orders of magnitude. It is worthwhile to mention that the method imposes almost no degradation in training accuracy.
Papers List
List of archived papers
Improving ADHD Detection with Cost-Sensitive LightGBM
Behnam Yousefimehr - Mehdi Ghatee - Ali Heydari
Divide and Conquer Approach to Long Genomic Sequence Alignment
Mahmoud Naghibzadeh - Samira Babaei - Behshid Behkmal - Mojtaba Hatami
Enhancing Cloud Security with Federated CNN-LSTM: A Novel Approach to Intrusion Detection
Reyhaneh Ilaghi - Raheleh Ilaghi - Fereshteh Rahmani - Seyyed hamid Ghafoori
Classification of Audio Streaming in Network Traffic Based on Machine Learning Methods
Mohammad Nikbakht - Mehdi Teimouri
Collaborative LLM Reasoning for Vulnerability Detection in Smart Contracts
Amirreza Samari - Parsa Hedayatnia - Seyyed Javad Bozorgzadeh Razavi - Mohammad Allahbakhsh - Haleh Amintoosi
Improve the utility of tensor cores by compacting sparse matrix technique
Mohammad.S Abazari - Mahsa Zahedi - Abdorreza Savadi
Classification of benign and malignant tumors in Digital Breast Tomosynthesis images using Radiomic-based methods
Farangis Sajadi moghadam - Saeid Rashidi
FarSick: A Persian Semantic Textual Similarity And Natural Language Inference Dataset
Zahra Ghasemi - Mohammad Ali Keyvanrad
Lightweight Local Transformer for COVID-19 Detection Using Chest CT Scans
Hojat Asgarian Dehkordi - Hossein Kashiani - Amir Abbas Hamidi Imani - Shahriar Baradaran Shokouhi
Blind image quality assessment based on Multi-resolution Local Structures
Seyed Majid Khorashadizadeh - Mehdi Sadeghi Bakhi - Fatemeh Seifishahpar - AliMohammad Latif
more
Samin Hamayesh - Version 42.7.0