0% Complete
Home
/
13th International Conference on Computer and Knowledge Engineering
Pruning and Mixed Precision Techniques for Accelerating Neural Network
Authors :
Mahsa Zahedi
1
Mohammad Sediq Abazari Bozhgani
2
Abdorreza Savadi
3
1- Department of Computer Engineering Ferdowsi University of Mashhad Mashhad, Iran
2- Department of Computer Engineering Ferdowsi University of Mashhad Mashhad, Iran
3- Department of Computer Engineering Ferdowsi University of Mashhad Mashhad, Iran
Keywords :
Prune،Mixed Precision،Neural Network،Machine Learning،image processing
Abstract :
This study investigates the use of pruning and mixed precision techniques to enhance neural network performance, focusing on the AlexNet model trained on the MNIST dataset. Pruning removes unnecessary components, while mixed precision optimizes memory and computation efficiency. The study applies structured pruning to create a pruned model, which achieves improved inference time compared to the baseline. Automatic mixed precision is also employed, further enhancing inference speed. Combining pruning and mixed precision in a single model yields superior performance, surpassing the individual approaches. The combined model achieves significantly faster inference time by leveraging both techniques. The research highlights the potential of combining pruning and mixed precision for faster and more efficient neural network computations, reducing network size, optimizing memory utilization, and accelerating computations. The findings provide valuable insights for integrating these techniques into the AlexNet model and lay the groundwork for future exploration in larger and more complex models. This work holds promise for developing faster and more efficient deep learning models to meet the demands of real-world applications, particularly in resource-constrained environments
Papers List
List of archived papers
A Survey on Semi-Automated and Automated Approaches for Video Annotation
Samin Zare - Mehran Yazdi
A New Time Series Approach in Churn Prediction with Discriminatory Intervals
Hedieh Ahmadi - Seyed Mohammad Hossein Hasheminejad
Efficient Object Detection using Deep Reinforcement Learning and Capsule Networks
Sobhan Siamak - Eghbal Mansoori
Automatic Infrared-Based Volume and Mass Estimation System for Agricultural Products
Seyed Muhammad Hossein Mousavi - S. Muhammad Hassan Mosavi
Deep Learning Based High-Resolution Edge Detection for Microwave Imaging using a Variational Autoencoder
Seyed Reza Razavi Pour - Leila Ahmadi - Amir Ahmad Shishegar
Automatic Generation of XACML Code using Model-Driven Approach
Athareh Fatemian - Bahman Zamani - Marzieh Masoumi - Mehran Kamranpour - Behrouz Tork Ladani - Shekoufeh Kolahdouz Rahimi
SGFL: A Federated Learning Approach for Non-IID Data Using Semi-Supervised DCGAN
Alireza Rabiee - Abolfazl Ajdarloo - Mohsen Rahmani
InfOnto: An ontology for fashion influencer marketing based on Instagram
Somaye Sultani - Mohsen Kahani
Dual Memory Structure for Memory Augmented Neural Networks for Question-Answering Tasks
Amir Bidokhti - Shahrokh Ghaemmaghami
Capturing Local and Global Features in Medical Images by Using Ensemble CNN-Transformer
Javad Mirzapour Kaleybar - Hooman Saadat - Hooman Khaloo
more
Samin Hamayesh - Version 42.2.1