0% Complete
Home
/
14th International Conference on Computer and Knowledge Engineering
Distilled BERT Model In Natural Language Processing
Authors :
Yazdan Zandiye Vakili
1
Avisa Fallah
2
Hedieh Sajedi
3
1- School of Mathematics, Statistics and Computer Science, University of Tehran, Tehran, Iran
2- School of Mathematics, Statistics and Computer Science, University of Tehran, Tehran, Iran
3- School of Mathematics, Statistics and Computer Science, University of Tehran, Tehran, Iran
Keywords :
NLP،Machine Learning،Distillation،BERT،Transformers
Abstract :
This paper reviews the evolution of Natural Language Processing (NLP) models, focusing on the distillation techniques used to create efficient and compact versions of large models. Traditional NLP models laid the foundation but had limitations in scalability and contextual understanding. Transformer models like BERT revolutionized NLP but required significant computational resources. This review examines TinyBERT, DistilBERT, MobileBERT, and MiniLM, which balance size and performance through knowledge distillation. These distilled models maintain high performance while being suitable for deployment on resource-constrained devices, making advanced NLP capabilities accessible in real-world applications.
Papers List
List of archived papers
MCRS-SAE : multi criteria recommender system based on sparse autoencoder
Amir reza Kalantarnezhad - Javad Hamidzadeh
Two-step thermal-aware routing algorithm in 3D NoC
Majid Nezarat - Masoume Momeni
A Novel Density-Based KNN in Pattern Recognition
Sajad Haghzad Klidbary - Abazar Arabameri
Crack Segmentation in Civil Structure Images Using a Deep Learning Based Multi-Classifier System
Mohammadreza Asadi - Seyedeh Sogand Hashemi - Mohammad Taghi Sadeghi
Investigating the Behavior of Generation Z Customers in Online Banking Services (Case Study of a Bank of Iran)
Elham Mahmoudabadi - Esmaeil Mollaahmadi
Human vs NotebookLM for Educational Podcasts: A Controlled Experiment on Two General Topics
Ali Banihashemi - Amirali Shahriary - Yadollah Yaghoobzadeh
UAV-based Firefighting by Multi-agent Reinforcement Learning
Reza Shami Tanha - Mohsen Hooshmand - Mohsen Afsharchi
Optimizing Magnetic Sensory Configuration for Gesture Recognition in Bionic Hands
Mehdi Alimohammadi - Arman Abasian - Mohammad Reza Akbarzadeh Totonchi
Impact of Oversampling Methods on Imbalanced Dataset for Software Fault Prediction
Alireza Abiri - Alireza Tajary - Mansoor Fateh
PowerLinear Activation Functions with application to the first layer of CNNs
Kamyar Nasiri - Kamaledin Ghiasi-Shirazi
more
Samin Hamayesh - Version 42.7.0