0% Complete
Home
/
13th International Conference on Computer and Knowledge Engineering
Pruning and Mixed Precision Techniques for Accelerating Neural Network
Authors :
Mahsa Zahedi
1
Mohammad Sediq Abazari Bozhgani
2
Abdorreza Savadi
3
1- Department of Computer Engineering Ferdowsi University of Mashhad Mashhad, Iran
2- Department of Computer Engineering Ferdowsi University of Mashhad Mashhad, Iran
3- Department of Computer Engineering Ferdowsi University of Mashhad Mashhad, Iran
Keywords :
Prune،Mixed Precision،Neural Network،Machine Learning،image processing
Abstract :
This study investigates the use of pruning and mixed precision techniques to enhance neural network performance, focusing on the AlexNet model trained on the MNIST dataset. Pruning removes unnecessary components, while mixed precision optimizes memory and computation efficiency. The study applies structured pruning to create a pruned model, which achieves improved inference time compared to the baseline. Automatic mixed precision is also employed, further enhancing inference speed. Combining pruning and mixed precision in a single model yields superior performance, surpassing the individual approaches. The combined model achieves significantly faster inference time by leveraging both techniques. The research highlights the potential of combining pruning and mixed precision for faster and more efficient neural network computations, reducing network size, optimizing memory utilization, and accelerating computations. The findings provide valuable insights for integrating these techniques into the AlexNet model and lay the groundwork for future exploration in larger and more complex models. This work holds promise for developing faster and more efficient deep learning models to meet the demands of real-world applications, particularly in resource-constrained environments
Papers List
List of archived papers
Multi Model CNN Based Gas Meter Characters Recognition
Sanaz Tarhib - Jafar Tanha - Soodabeh Imanzadeh - Sahar Hassanzadeh Mostafaei
Automatic Detection and Risk Assessment of Session Management Vulnerabilities in Web Applications
Nasrin Garmabi - Mohammad Ali Hadavi
Optimizing MR Image Registration for Accurate Brain Volume Measurement in Children with Autism Spectrum Disorder
Shiva Sanati - Mahdi Saadatmand
PowerLinear Activation Functions with application to the first layer of CNNs
Kamyar Nasiri - Kamaledin Ghiasi-Shirazi
An Efficient Planning Method for Autonomous Navigation of a Wheeled-Robot based on Deep Reinforcement Learning
Ali Salimi Sadr - Mahdi Shahbazi Khojasteh - Hamed Malek - Armin Salimi-Badr
Stock market prediction using multi-objective optimization
Mahshid Zolfaghari - Hamid Fadishei - Mohsen Tajgardan - Reza Khoshkangini
Vision-Based Obstacle Avoidance in Drone Navigation using Deep Reinforcement Learning
Pooyan Rahmanzadeh Gervi - Ahad Harati - Sayed Kamaledin Ghiasi-Shirazi
Investigation of topological characteristics of Iranian railway network: A network science approach
Sina Firuzbakht - Mohammad Khansari
Multi-Fusion Ensemble CNN for Drug–Target Binding Affinity Prediction Using Transformer-Based Molecular and Protein Representations
Betsabeh Tanoori
Trust Management Enhancement for the Internet of Things: a Smart Contract Approach
Amin Rouzbahani - Fattaneh Taghiyareh
more
Samin Hamayesh - Version 43.7.0