0% Complete
Home
/
15th International Conference on Computer and Knowledge Engineering
PersianILP: Construction and Evaluation of a Standard Persian Dataset for Inductive Link Prediction
Authors :
Mohammad Rahimi
1
Afsaneh Fatemi
2
Ahmad Baraani
3
1- Dept. of Software Engineering
2- Dept. of Software Engineering
3- Dept. of Software Engineering
Keywords :
Inductive Link Prediction،Knowledge Graph Completion،PersianILP Dataset
Abstract :
Link prediction in knowledge graphs is a key task aimed at addressing the challenge of graph sparsity. In inductive link prediction, a model is trained on one graph and evaluated on another containing unseen entities. While twelve inductive datasets have been introduced for English to benchmark models in this domain, no such dataset exists for Persian. This study introduces PersianILP, the first Persian dataset designed for inductive link prediction. PersianILP is constructed through a purposeful combination of real-world data extracted from the FarsiBase knowledge graph and synthetic data generated using the DeepSeek language model. To evaluate PersianILP, key criteria such as structural and semantic diversity, statistical alignment between synthetic and real data, and adherence to inductive evaluation principles were considered. The dataset is compared with twelve benchmark datasets, including WN18RR, FB237, and NELL995. PersianILP contains 16,306 semantic triples, 10,693 entities, and 432 unique relations, exhibiting a highly sparse structure with a sparsity rate of 0.99. Evaluation using a baseline inductive link prediction model confirms the dataset’s high quality and effectiveness. Statistical analyses further demonstrate that PersianILP meets all essential requirements for research in inductive link prediction and can serve as a standard resource for studies in Persian language processing, semantic web, and recommender systems.
Papers List
List of archived papers
Two-step thermal-aware routing algorithm in 3D NoC
Majid Nezarat - Masoume Momeni
An Attention-Based Model for Clinical Time Series Prediction: Enhancing ICU Readmission Prediction
Hananeh Sadat Madinei - Mohammad Reza Keyvanpour - Seyed Vahab Shojaedini
Token-Based Access Control for Inter-organization Collaboration in Hyperldger Fabric
Parsa Hedayatnia - Mohammad Ata Jalilian - Mohammad Allahbakhsh - Haleh Amintoosi
Forecasting El Niño Six Months in Advance Utilizing Augmented Convolutional Neural Network
Mohammad Naisipour - Iraj Saeedpanah - Arash Adib - Mohammad Hossein Neisi Pour
African Vultures Optimization Algorithm for Optimal Damping Controllers Design in the Electrical Power Grid System
Aliyu Sabo - Theophilus Ebuka Odoh - Samuel Habu - Hossein Shahinzadeh - Farshad Ebrahimi
GAP: Fault tolerance Improvement of Convolutional Neural Networks through GAN-aided Pruning
Pouya Hosseinzadeh - Yasser Sedaghat - Ahad Harati
Overview of Electric Vehicles Charging Stations in Smart Grids
Mohammed Wadi - Wisam Elmasry - Mohammed Jouda - Hossein Shahinzadeh - Gevork B. Gharehpetian
Lossless Watermarking in Encrypted Triangular Mesh Models Based on Optimized Vertex Estimation and Error Histogram Shifting
Alireza Ghaemi - Habibollah Danyali - Kamran Kazemi - Zahra Qodrati - Amirhossein Ghaemi - Seyedeh Masoumeh Taji
AL-YOLO: Accurate and Lightweight Vehicle and Pedestrian Detector in Foggy Weather
Behdad Sadeghian Pour - Hamidreza Mohammadi Jozani - Shahriar Baradaran Shokouhi
A Vision-Based Method for Human Activity Recognition Using Local Binary Pattern
Babak Goodarzi - Reza Javidan - Mohammad Sadegh Rezaei
more
Samin Hamayesh - Version 43.7.0