0% Complete
Home
/
13th International Conference on Computer and Knowledge Engineering
Compressing Deep Neural Networks Using Explainable AI
Authors :
Kimia Soroush
1
Mohsen Raji
2
Behnam Ghavami
3
1- Shiraz university
2- Shiraz university
3- Shahid Bahonar University of Kerman
Keywords :
Deep Neural Networks،Compression،Explainable-AI
Abstract :
Abstract— Deep neural networks (DNNs) have demonstrated remarkable performance in many tasks but it often comes at a high computational cost and memory usage. Compression techniques, such as pruning and quantization, are applied to reduce the memory footprint of DNNs and make it possible to accommodate them on resource-constrained edge devices. Recently, explainable artificial intelligence (XAI) methods have been introduced with the purpose of understanding and explaining AI methods. XAI can be utilized to get to know the inner functioning of DNNs, such as the importance of different neurons and features in the overall performance of DNNs. In this paper, a novel DNN compression approach using XAI is proposed to efficiently reduce the DNN model size with negligible accuracy loss. In the proposed approach, the importance score of DNN parameters (i.e. weights) are computed using a gradient-based XAI technique called Layer-wise Relevance Propagation (LRP). Then, the scores are used to compress the DNN as follows: 1) the parameters with the negative or zero importance scores are pruned and removed from the model, 2) mixed-precision quantization is applied to quantize the weights with higher/lower score with higher/lower number of bits. The experimental results show that, the proposed compression approach reduces the model size by 64% while the accuracy is improved by 42% compared to the state-of-the-art XAI-based compression method.
Papers List
List of archived papers
Reliability Evaluation of 4:2 Compressors Based on Hammock Networks
Farshad Safaei - Mohammad mahdi Emadi Kouchak - Sara Talebpour
Towards Low-Overhead Mitigation of Trojan Bit-Flip Attacks on DNNs via Causal Inference
Bahare Gholami - Mohsen Raji
Load Frequency Control of Geothermal Power Plant Incorporated Two-Area Hydro-Thermal System with AC-DC Lines
Shanker J Gambhire - Malligunta Kiran Kumar - Hossein Shahinzadeh - Mohammad-hossein Fayaz-dastgerdi - B. Srikanth Goud - Ch.Naga sai Kalyan
Iris Detection and Segmentation Using Deep Learning
Ali Khaki - Ali Aghagolzadeh - Bagher Rahimpour Cami
Mitochondrial Segmentation in Microscopy Images Using UNet-VGG19
Zerek Sediq Hossein - Rojiar Pir Mohammadiani - Saadat Izadi
Enhancing Vehicle Make and Model Recognition with 3D Attention Modules
Narges Semiromizadeh - Omid Nejati Manzari - Shahriar B. Shokouhi - Sattar Mirzakuchaki
An Evolutionary Approach with Surrogate Models for Feature Selection in Intrusion Detection Systems
Sadeq Moradi - Hadi Shahriar Shahhoseini
New Design of Efficient Reversible Quantum Saturation Adder
Negin Mashayekhi - Mohammad Reza Reshadinezhad - Shekoofeh Moghimi
Automated software design using Machine Learning With Natural Language Processing
Fahimeh Khedmatkon - Seyed Mohammad Hossein Hasheminejad - Jaleh Shoshtarian Malak
EpiGraph: Anomaly Detection in Contact Networks for Early Disease Outbreak Prediction
Abolfazl Zarghani
more
Samin Hamayesh - Version 43.7.0