0% Complete
Home
/
13th International Conference on Computer and Knowledge Engineering
Pruning and Mixed Precision Techniques for Accelerating Neural Network
Authors :
Mahsa Zahedi
1
Mohammad Sediq Abazari Bozhgani
2
Abdorreza Savadi
3
1- Department of Computer Engineering Ferdowsi University of Mashhad Mashhad, Iran
2- Department of Computer Engineering Ferdowsi University of Mashhad Mashhad, Iran
3- Department of Computer Engineering Ferdowsi University of Mashhad Mashhad, Iran
Keywords :
Prune،Mixed Precision،Neural Network،Machine Learning،image processing
Abstract :
This study investigates the use of pruning and mixed precision techniques to enhance neural network performance, focusing on the AlexNet model trained on the MNIST dataset. Pruning removes unnecessary components, while mixed precision optimizes memory and computation efficiency. The study applies structured pruning to create a pruned model, which achieves improved inference time compared to the baseline. Automatic mixed precision is also employed, further enhancing inference speed. Combining pruning and mixed precision in a single model yields superior performance, surpassing the individual approaches. The combined model achieves significantly faster inference time by leveraging both techniques. The research highlights the potential of combining pruning and mixed precision for faster and more efficient neural network computations, reducing network size, optimizing memory utilization, and accelerating computations. The findings provide valuable insights for integrating these techniques into the AlexNet model and lay the groundwork for future exploration in larger and more complex models. This work holds promise for developing faster and more efficient deep learning models to meet the demands of real-world applications, particularly in resource-constrained environments
Papers List
List of archived papers
IranITJobs2021: a Dataset for Analyzing Iranian Online IT Job Advertisements Collected Using a New Crowdsourcing Process
Fakhroddin Noorbehbahani - Nikta Akbarpour - Mohammad Reza Saeidi
Dynamic Knowledge Enhanced Neural Fashion Trend Forecasting with Quantile Loss
Fatemeh Rooholamini - Reza Azmi - Mobina Khademhossein - Maral Zarvani
Pruning and Mixed Precision Techniques for Accelerating Neural Network
Mahsa Zahedi - Mohammad Sediq Abazari Bozhgani - Abdorreza Savadi
Sotfware defined content popularity estimation for wireless D2D caching networks
Maede Rezaei - AhmadReza Montazerolghaem
An Overview of Regression Methods in Early Prediction of Movie Ratings
Houmaan Chamani - Zhivar Sourati Hassanzadeh - Behnam Bahrak
Synthetic Trajectory Sharing Indoors under Privacy Constraints
Mahdi Soltanpour - Vahideh Moghtadaiee - Mina Alishahi
Taguchi Design of Experiments Application in Robust sEMG Based Force Estimation
Mohsen Ghanaei - Hadi Kalani - Alireza Akbarzadeh
Islamic Geometric algorithms: A survey
Elham Akbari - Azam Bastanfard
Bridging Knowledge and Language Models in Healthcare: A RAG Survey
Seyedali Hasanzadeh - Fahimeh Ghasemian - Elham Shabaninia
A New Hypercube Variant: Pruned Shuffle Connected Cube
Reza Latifi - Mahmoud Naghibzadeh
more
Samin Hamayesh - Version 43.7.0