0% Complete
Home
/
13th International Conference on Computer and Knowledge Engineering
Compressing Deep Neural Networks Using Explainable AI
Authors :
Kimia Soroush
1
Mohsen Raji
2
Behnam Ghavami
3
1- Shiraz university
2- Shiraz university
3- Shahid Bahonar University of Kerman
Keywords :
Deep Neural Networks،Compression،Explainable-AI
Abstract :
Abstract— Deep neural networks (DNNs) have demonstrated remarkable performance in many tasks but it often comes at a high computational cost and memory usage. Compression techniques, such as pruning and quantization, are applied to reduce the memory footprint of DNNs and make it possible to accommodate them on resource-constrained edge devices. Recently, explainable artificial intelligence (XAI) methods have been introduced with the purpose of understanding and explaining AI methods. XAI can be utilized to get to know the inner functioning of DNNs, such as the importance of different neurons and features in the overall performance of DNNs. In this paper, a novel DNN compression approach using XAI is proposed to efficiently reduce the DNN model size with negligible accuracy loss. In the proposed approach, the importance score of DNN parameters (i.e. weights) are computed using a gradient-based XAI technique called Layer-wise Relevance Propagation (LRP). Then, the scores are used to compress the DNN as follows: 1) the parameters with the negative or zero importance scores are pruned and removed from the model, 2) mixed-precision quantization is applied to quantize the weights with higher/lower score with higher/lower number of bits. The experimental results show that, the proposed compression approach reduces the model size by 64% while the accuracy is improved by 42% compared to the state-of-the-art XAI-based compression method.
Papers List
List of archived papers
Evolutionary Approach to GAN Hyperparameter Tuning: Minimizing Discriminator and Generator Loss Functions
Sajad Haghzad Klidbary - Anahita Babaei - Ramin Ghorbani
A Language-Independent Approach to Classification of Textual File Fragments: Case Study of Persian, English, and Chinese Languages
Fatemeh Mansouri Hanis - Hamidreza Khoshvaghti - Mehdi Teimouri - Hadi Veisi
Atlas-based segmentation of cardiac chambers in systolic and diastolic phases of echocardiographic images
Elham Fathipour - Mahdi Saadatmand
Improved TrustChain for Lightweight Devices
Seyed Salar Ghazi - Haleh Amintoosi
City Intersection Clustering and Analysis Based on Traffic Time Series
Mohammad Aminazadeh - Fakhroddin Noorbehbahani
Introducing E4MT and LMBNC: Persian pre-processing utilities
Zakieh Shakeri - Mehran Ziabary - Behrooz Vedadian - Fatemeh Azadi - Saeed Torabzadeh - Arian Atefi
Swin-RSCBNet: A Transformer-Based Network for Skin Cancer Segmentation with Multi-Scale and Attention Modules
Benyamin Mirab Golkhatmi - Mostafa Heydari - Mahboobeh Houshmand - Seyyed Abed Hosseini
The process of multi class fake news dataset generation
Sajjad Rezaei - Mohsen Kahani - Behshid Behkamal
Diagnosis of Depression Based on New Features Extractive from the Frequency Space of the EEG
Melika Changizi - Saeid Rashidi
Deep Learning Feature Extraction for COVID-19 Detection Algorithm using Computerized Tomography Scan
Maisarah Mohd Sufian - Ervin Gubin Moung - Chong Joon Hou - Ali Farzamnia
more
Samin Hamayesh - Version 43.7.0