0% Complete
Home
/
12th International Conference on Computer and Knowledge Engineering
SAT Based Analogy Evaluation Framework For Persian Word Embeddings
Authors :
Seyed Ehsan Mahmoudi
1
Mehrnoush Shamsfard
2
1- Computer Science and Engineering Department, Shahid Beheshti University, Tehran, Iran
2- Faculty of Computer Science and Engineering, Shahid Beheshti University
Keywords :
Analogy test،Semantic Similarity،Embedding Evaluation،Persian Langugae Processing
Abstract :
In recent years there has been a special interest in word embeddings as an approach to convert words to vectors. It has been a focal point to understand how much of the semantics of the words has been transferred into embedding vectors. Intrinsic evaluation of word embeddings is cheaper than evaluating them extrinsically and it is usually costly to evaluate the downstream application end-to-end in order to determine the quality of the used embedding model. Generally the word embeddings are evaluated through a number of tests, including analogy test. In this paper we propose a test framework for Persian embedding models. Persian is a low resource language and there is no rich semantic benchmark to evaluate word embedding models for this language. In this paper we introduce an evaluation framework including a hand crafted Persian SAT based analogy dataset, a colliquial test set (specific to Persian) and a benchmark to study the impact of various parameters on the semantic evaluation task.
Papers List
List of archived papers
A scalable blockchain-based educational network for data storage and assessment
Maryam Fattahi Vanani - Hamidreza Shayegh Borujeni - Ali Nourollah
An Improved and Accurate Measure for Mining Correlated High-utility Itemsets
Amir Masoud Heidari Orojloo - Morteza Keshtkaran
DevRanker: An Effective Approach to Rank Developers for Bug Report Assignment
Mohammad Reza Kardoost - Mohammad Reza Moosavi - Reza Akbari
Deep Learning-Based Malaysian Sign Language (MSL) Recognition: Exploring the Impact of Color Spaces
Ervin Gubin Moung - Precilla Fiona Suwek - Maisarah Mohd Sufian - Valentino Liaw - Ali Farzamnia - Wei Leong Khong
Optimizing Text-Based Protocol Clustering in Reverse Engineering with Auto-Encoders and Fine-Tuned Parameters
Shiva Mahmoudzadeh - Mohaddese Nemati - Mehdi Teimouri
MultiPath ViT OCR: A Lightweight Visual Transformer-based License Plate Optical Character Recognition
Alireza Azadbakht - Saeed Reza Kheradpisheh - Hadi Farahani
Cross-project Defect Prediction with An Enhanced Transfer Boosting Algorithm
Nazgol Nikravesh - Mohammad Reza Keyvanpour
Improving LoRaWAN Scalability for IoT Applications using Context Information
Hamed Mahmoudi - Behrouz ShahgholiGhahfarokhi
Decentralized Federated Learning in IoT Environments: A Hierarchical Approach
Majid Mohammadpour - Seyedakbar Mostafavi
Area-Efficient VLSI Implementation of Bit-Serial Multiplier Using Polynomial Basis over GF(2m)
Saeideh Nabipour - Javad Javidan - Gholamreza Zare Fatin
more
Samin Hamayesh - Version 41.5.3