0% Complete
Home
/
13th International Conference on Computer and Knowledge Engineering
Compressing Deep Neural Networks Using Explainable AI
Authors :
Kimia Soroush
1
Mohsen Raji
2
Behnam Ghavami
3
1- Shiraz university
2- Shiraz university
3- Shahid Bahonar University of Kerman
Keywords :
Deep Neural Networks،Compression،Explainable-AI
Abstract :
Abstract— Deep neural networks (DNNs) have demonstrated remarkable performance in many tasks but it often comes at a high computational cost and memory usage. Compression techniques, such as pruning and quantization, are applied to reduce the memory footprint of DNNs and make it possible to accommodate them on resource-constrained edge devices. Recently, explainable artificial intelligence (XAI) methods have been introduced with the purpose of understanding and explaining AI methods. XAI can be utilized to get to know the inner functioning of DNNs, such as the importance of different neurons and features in the overall performance of DNNs. In this paper, a novel DNN compression approach using XAI is proposed to efficiently reduce the DNN model size with negligible accuracy loss. In the proposed approach, the importance score of DNN parameters (i.e. weights) are computed using a gradient-based XAI technique called Layer-wise Relevance Propagation (LRP). Then, the scores are used to compress the DNN as follows: 1) the parameters with the negative or zero importance scores are pruned and removed from the model, 2) mixed-precision quantization is applied to quantize the weights with higher/lower score with higher/lower number of bits. The experimental results show that, the proposed compression approach reduces the model size by 64% while the accuracy is improved by 42% compared to the state-of-the-art XAI-based compression method.
Papers List
List of archived papers
An Improved and Accurate Measure for Mining Correlated High-utility Itemsets
Amir Masoud Heidari Orojloo - Morteza Keshtkaran
Leveraging Self-Supervised Models for Automatic Whispered Speech Recognition
Aref Farhadipour - Homa Asadi - Volker Dellwo
Distilled BERT Model In Natural Language Processing
Yazdan Zandiye Vakili - Avisa Fallah - Hedieh Sajedi
T-Rank: Graph Data Analytics for Urban Traffic Modeling
Alireza Safarpour - Iman Gholampour - Amirhossain Aghazadeh Fard - Seyed Mohammad Karbasi
An Ensemble CNN for Brain Age Estimation based on Hippocampal Region Applicable to Alzheimer's Diagnosis
Zahra Qodrati - Seyedeh Masoumeh Taji - Habibollah Danyali - Kamran Kazemi
LPCNet: Lane detection by lane points correction network in challenging environments based on deep learning
Sina BaniasadAzad - Seyed Mohammadreza Mousavi mirkolaei
A Stacking Ensemble Framework for Ransomware Detection on the Bitcoin Blockchain Using Transaction Graph Analytics
Mohammad Mobin Teymourpour - Parsa Hedayatnia - Mohammad Allahbakhsh - Haleh Amintoosi
A Graph-based Feature Selection using Class-Feature Association Map (CFAM)
Motahare Akhavan - Seyed Mohammad Hossein Hasheminejad
An Efficient Approach for Breast Abnormality Detection through High-Level Features of Thermography Images
Farhad Abedinzadeh Torghabeh - Yeganeh Modaresnia - Seyyed Abed Hosseini
A Systematic Embedded Software Design Flow for Robotic Applications
Navid Mahdian - Seyed-Hosein Attarzadeh-Niaki - Armin Salimi-Badr
more
Samin Hamayesh - Version 43.7.0