0% Complete
Home
/
14th International Conference on Computer and Knowledge Engineering
Improve the utility of tensor cores by compacting sparse matrix technique
Authors :
Mohammad.S Abazari
1
Mahsa Zahedi
2
Abdorreza Savadi
3
1- Ferdowsi university of mashhad
2- Ferdowsi university of mashhad
3- Ferdowsi university of mashhad
Keywords :
Tensor Cores،Neural Networks،Convolution Operations،Graphics Processing Unit
Abstract :
Neural networks have demanding computational requirements, particularly in matrix multiplication operations. To address this challenge, we propose a model that combines network pruning and matrix compression techniques. Our approach leverages NVIDIA's tensor cores, which excel at efficient matrix operations. We compress the network weights based on the tensor core structure and perform convolutions using the compressed weight matrix on the tensor cores. Our model incorporates neural network pruning, mixed-precision training, and compression of network weight tensors using the im2col algorithm and CSR format. We also utilize tensor kernels with a block size of 16x16 for multiplication. We evaluate the performance of various models, including pruned, AMP-optimized, combined pruning and AMP techniques, and our proposed model. Our evaluation reveals a significant improvement in performance compared to a simple baseline model. Through an extensive analysis of related works, we establish foundational concepts, present our proposed model, and share the obtained results.
Papers List
List of archived papers
Density Estimation Helps Adversarial Robustness
Afsaneh Hasanebrahimi - Bahareh Kaviani Baghbaderani - Reshad Hosseini - Ahmad Kalhor
Efficient Object Detection using Deep Reinforcement Learning and Capsule Networks
Sobhan Siamak - Eghbal Mansoori
Simulating Human Visual Cortex and Recall System with Convolutional Neural Networks
Sina Saadati - Abdolah Sepahvand
REMA: Reinforced Exponential Moving Average for Real-Time Anomaly Detection in Sensor Data
Mohammad Hossein Jafari Naeimi - Ali Norouzi - Athena Abdi
Vision-Based Obstacle Avoidance in Drone Navigation using Deep Reinforcement Learning
Pooyan Rahmanzadeh Gervi - Ahad Harati - Sayed Kamaledin Ghiasi-Shirazi
A Smart Electrochemical Biosensor for Arsenic Detection in Water
Keyvan Asefpour Vakilian
Enhancing EEG-based BCI Performances by Reducing Covariate Shift via Adaptive Multi-Domain Feature Extraction
Moein Radman - Reza Arghand - Nader Nariman-Zadeh - Ali Chaibakhsh
Adaptive Pattern Reconstruction Using Linear Regression for Improved TPS Anomaly Detection
Ali Azarsina - Alireza Safarzadeh - MohammadReza Jamali - Abdolhossein Vahabie
Hybrid Vision Transformer for Detection of Dentigerous Cysts in Dental Radiography Images
Reza Tavasoli - Arya VarastehNezhad - Hamed Farbeh
An intelligent linguistic error detection approach to automated diagnosis of Dyslexia disorder in Persian speaking children
Fatemeh Asghari - Mahsa Khorasani - Mohsen Kahani - Seyed Amir Amin Yazdi - Mahdi Arkhodi Ghalenoei
more
Samin Hamayesh - Version 43.7.0