0% Complete
Home
/
13th International Conference on Computer and Knowledge Engineering
Pruning and Mixed Precision Techniques for Accelerating Neural Network
Authors :
Mahsa Zahedi
1
Mohammad Sediq Abazari Bozhgani
2
Abdorreza Savadi
3
1- Department of Computer Engineering Ferdowsi University of Mashhad Mashhad, Iran
2- Department of Computer Engineering Ferdowsi University of Mashhad Mashhad, Iran
3- Department of Computer Engineering Ferdowsi University of Mashhad Mashhad, Iran
Keywords :
Prune،Mixed Precision،Neural Network،Machine Learning،image processing
Abstract :
This study investigates the use of pruning and mixed precision techniques to enhance neural network performance, focusing on the AlexNet model trained on the MNIST dataset. Pruning removes unnecessary components, while mixed precision optimizes memory and computation efficiency. The study applies structured pruning to create a pruned model, which achieves improved inference time compared to the baseline. Automatic mixed precision is also employed, further enhancing inference speed. Combining pruning and mixed precision in a single model yields superior performance, surpassing the individual approaches. The combined model achieves significantly faster inference time by leveraging both techniques. The research highlights the potential of combining pruning and mixed precision for faster and more efficient neural network computations, reducing network size, optimizing memory utilization, and accelerating computations. The findings provide valuable insights for integrating these techniques into the AlexNet model and lay the groundwork for future exploration in larger and more complex models. This work holds promise for developing faster and more efficient deep learning models to meet the demands of real-world applications, particularly in resource-constrained environments
Papers List
List of archived papers
Identification of Botnets and Nodes Attacking Smart Cities by Majority Voting Mechanism and Feature Selection
Maliheh Araghchi - Nazbanoo Farzaneh
A New Application of Machine Learning Based Methods for Disk Space Variation Fault Diagnosis in Transformer Windings
Reza Behkam - Amir Lotfi - Gevork B. Gharehpetian
Optimizing the controller placement problem in SDN with uncertain parameters with robust optimization
Mohammad Kazemi - AhmadReza Montazerolghaem
Predicting cascading failure with machine learning methods in the interdependent networks
Mohamad Hossein Maghsoodi - Mohamad Khansari
Hybrid navigation based on GPS data and SIFT-based place recognition using Biologically-inspired SLAM
Sahar Salimpour Kasebi - Hadi Seyedarabi - Javad Musevi Niya
A Vision-Based Method for Human Activity Recognition Using Local Binary Pattern
Babak Goodarzi - Reza Javidan - Mohammad Sadegh Rezaei
Optimal PMU Placement Considering Reliability of Measurement System in Smart Grids
Mohammad Shahraeini - Shahla Khormali - Ahad Alvandi
Classification of Audio Streaming in Network Traffic Based on Machine Learning Methods
Mohammad Nikbakht - Mehdi Teimouri
Energy Efficient Power Allocation in MIMO-NOMA Systems with ZF Receiver Beamforming in Multiple Clusters
Mahdi Nangir - Abdolrasoul Sakhaei Gharagezlou - Nima Imani
CSI-Based Human Activity Recognition using Convolutional Neural Networks
Parisa Fard Moshiri - Mohammad Nabati - Reza Shahbazian - Seyed Ali Ghorashi
more
Samin Hamayesh - Version 41.5.3