0% Complete
Home
/
14th International Conference on Computer and Knowledge Engineering
Improve the utility of tensor cores by compacting sparse matrix technique
Authors :
Mohammad.S Abazari
1
Mahsa Zahedi
2
Abdorreza Savadi
3
1- Ferdowsi university of mashhad
2- Ferdowsi university of mashhad
3- Ferdowsi university of mashhad
Keywords :
Tensor Cores،Neural Networks،Convolution Operations،Graphics Processing Unit
Abstract :
Neural networks have demanding computational requirements, particularly in matrix multiplication operations. To address this challenge, we propose a model that combines network pruning and matrix compression techniques. Our approach leverages NVIDIA's tensor cores, which excel at efficient matrix operations. We compress the network weights based on the tensor core structure and perform convolutions using the compressed weight matrix on the tensor cores. Our model incorporates neural network pruning, mixed-precision training, and compression of network weight tensors using the im2col algorithm and CSR format. We also utilize tensor kernels with a block size of 16x16 for multiplication. We evaluate the performance of various models, including pruned, AMP-optimized, combined pruning and AMP techniques, and our proposed model. Our evaluation reveals a significant improvement in performance compared to a simple baseline model. Through an extensive analysis of related works, we establish foundational concepts, present our proposed model, and share the obtained results.
Papers List
List of archived papers
Weakly Supervised Convolutional Neural Network for Automatic Gleason Grading of Prostate Cancer
Maryam Kamareh - Mohammad Sadegh Helfroush - Kamran Kazemi
An Overview of Regression Methods in Early Prediction of Movie Ratings
Houmaan Chamani - Zhivar Sourati Hassanzadeh - Behnam Bahrak
Information Theoretic Learning-based Deep Embedded Clustering (ITL-DEC)
Hoda Shad - Mona Zamiri - Tahereh Bahreini - Reza Monsefi - Ghoshe Abed Hodtani
Improvement of Credit Scoring by LSTM Autoencoder Model
Milad Sattari Maleki - Seyedeh Niusha Motevallian - Faezehsadat Hosseini - Mohammad Sabokrou - Hamidreza Soltanalizadeh Maleki
Emotion Recognition In Persian Speech Using Deep Neural Networks
Ali Yazdani - Hossein Simchi - Yasser Shekofteh
Intensity-Image Reconstruction Using Event Camera Data by Changing in LSTM Update
Arezoo Rahmati Soltangholi - Ahad Harati - Abedin Vahedian
Cardiology Disease Diagnosis by Analyzing Histological Microscopic Images Using Deep Learning
Maria Salehpanah - Jafar Tanha - Zahra Jafari - SeyedEhsan Roshan - Sajad Rezaei
The process of multi class fake news dataset generation
Sajjad Rezaei - Mohsen Kahani - Behshid Behkamal
Solving the influence maximization problem by using entropy and weight of edges
Farzaneh Kazemzadeh - Amir Karian - Mitra Mirzarezaee - Ali Asghar Safaei
Weakly Supervised Learning in a Group of Learners with Communication
Ali Ganjbakhsh - Ahad Harati
more
Samin Hamayesh - Version 42.4.1