0% Complete
Home
/
13th International Conference on Computer and Knowledge Engineering
Pruning and Mixed Precision Techniques for Accelerating Neural Network
Authors :
Mahsa Zahedi
1
Mohammad Sediq Abazari Bozhgani
2
Abdorreza Savadi
3
1- Department of Computer Engineering Ferdowsi University of Mashhad Mashhad, Iran
2- Department of Computer Engineering Ferdowsi University of Mashhad Mashhad, Iran
3- Department of Computer Engineering Ferdowsi University of Mashhad Mashhad, Iran
Keywords :
Prune،Mixed Precision،Neural Network،Machine Learning،image processing
Abstract :
This study investigates the use of pruning and mixed precision techniques to enhance neural network performance, focusing on the AlexNet model trained on the MNIST dataset. Pruning removes unnecessary components, while mixed precision optimizes memory and computation efficiency. The study applies structured pruning to create a pruned model, which achieves improved inference time compared to the baseline. Automatic mixed precision is also employed, further enhancing inference speed. Combining pruning and mixed precision in a single model yields superior performance, surpassing the individual approaches. The combined model achieves significantly faster inference time by leveraging both techniques. The research highlights the potential of combining pruning and mixed precision for faster and more efficient neural network computations, reducing network size, optimizing memory utilization, and accelerating computations. The findings provide valuable insights for integrating these techniques into the AlexNet model and lay the groundwork for future exploration in larger and more complex models. This work holds promise for developing faster and more efficient deep learning models to meet the demands of real-world applications, particularly in resource-constrained environments
Papers List
List of archived papers
Segmentation of Hard Exudates in Retinal Fundus Images Using BCDU-Net
Nafise Ameri - Nasser Shoeibi - Mojtaba Abrishami
Semi-automatic Detection of Persian Stopwords using FastText Library
Mohammad Dehghani - Mohammad Manthouri
A novel hybrid DMHS-GMDH algorithm to predict COVID-19 pandemic time series
Ahmad Taheri - Shahriar Ghashghaei - Amin Beheshti - Keyvan RahimiZadeh
A Semi-supervised Fake News Detection using Sentiment Encoding and LSTM with Self-Attention
Pouya Shaeri - Ali Katanforoush
DFIG-WECS Renewable Integration to the Grid and Stability Improvement through Optimal Damping Controller Design
Theophilus Ebuka Odoh - Aliyu Sabo - Hossien Shahinzadeh - Noor Izzri Abdul Wahab - Farshad Ebrahimi
AvashoG2P: A multi-module G2P Converter for Persian
Ali Moghadaszadeh - Fatemeh Pasban - Mohsen Mahmoudzadeh - Maryam Vatanparast - Amirmohammad Salehoof
Bipartite link prediction improvement using the effective utilization of edge betweenness centrality
Sadegh Sulaimany Sulaimany - Yasin Amini
An intelligent linguistic error detection approach to automated diagnosis of Dyslexia disorder in Persian speaking children
Fatemeh Asghari - Mahsa Khorasani - Mohsen Kahani - Seyed Amir Amin Yazdi - Mahdi Arkhodi Ghalenoei
Real-Time Gender Recognition with a Deep Neural Network
Samad Azimi Abriz - Majid Meghdadi
Brain Age Estimation with Twin Vision Transformer using Hippocampus Information Applicable to Alzheimer Dementia Diagnosis
Zahra Qodrati - Seyedeh Masoumeh Taji - Amirhossein Ghaemi - Habibollah Danyali - Kamran Kazemi - Alireza Ghaemi
more
Samin Hamayesh - Version 42.7.0