0% Complete
Home
/
11th International Conference on Computer and Knowledge Engineering
A Weighted TF-IDF-based Approach for Authorship Attribution
Authors :
Ali Abedzadeh
1
Reza Ramezani
2
Afsaneh Fatemi
3
1- university of isfahan
2- university of isfahan
3- university of isfahan
Keywords :
Authorship Attribution, Author Identification, Information Retrieval, Term Frequency, TF-IDF
Abstract :
Authorship Attribution (AA) is a task in which a disputed text is automatically assigned to an author chosen from a list of candidate authors. To this end, a model is trained on a dataset of textual documents with known authors, which can be considered as a multi-class single-label classification task. In this paper, we approach this task differently by extending information retrieval techniques to train an AA model. It is based on weighting the AARR technique, presented in our previous study, to relax the value of term frequency. The efficiency of the proposed solution has been evaluated by conducting several experiments on six datasets. The results show the superiority of the proposed solution by improving the accuracy of IMDB, Gutenberg books, Poetry, Blogs, PAN2011, and Twitter datasets by 33%, 31%, 31%, 19%, 6%, and 1%, respectively, where the average improvement is 19.94% over all datasets. The best accuracy over these datasets is 88%, 82%, 67%, 90%, 65%, and 81% in the same respect. In addition, compared to the baseline system, the computation time of the proposed solution has been improved significantly (21.44X) by employing a dictionary-based indexing technique.
Papers List
List of archived papers
A Hybrid Echo State Network for Hypercomplex Pattern Recognition, Classification, and Big Data Analysis
Mohammad Jamshidi - Fatemeh Daneshfar
Evaluating the Impact of Traveling on COVID-19 Prevalence and Predicting the New Confirmed Cases According to the Travel Rate Using Machine Learning: A Case Study in Iran
Anita Ghandehari - Soheil Shirvani - Hadi Moradi
Lempel-Ziv-based Hyper-Heuristic Solution for Longest Common Subsequence Problem
Mahdi Nasrollahi - Reza Shami Tanha - Mohsen Hooshmand
Optimizing Text-Based Protocol Clustering in Reverse Engineering with Auto-Encoders and Fine-Tuned Parameters
Shiva Mahmoudzadeh - Mohaddese Nemati - Mehdi Teimouri
Evolutionary Approach to GAN Hyperparameter Tuning: Minimizing Discriminator and Generator Loss Functions
Sajad Haghzad Klidbary - Anahita Babaei - Ramin Ghorbani
Optimization Resource Allocation in NOMA-based Fog Computing with a Hybrid Algorithm
Zohreh Torki - S.Mojtaba Matinkhah
VVC-AAR: Adaptive Attention-Aware Resolution and Residual Coding for Perceptually Optimized Ultra-Low Bitrate VVC Compression
Yaghoub Saberi - Somayeh Arab Najafabadi - Mohammadreza Hemmati
No-Reference Video Quality Assessment by Deep Feature Maps Relations
Amir Hossein Bakhtiari - Azadeh Mansouri
DRL-based Decision-Making for Autonomous Vehicle Collision Avoidance
Hoda Gholamrezaee - Seyedreza Taghizadeh - Ali Honarjoo
Real-time Implementation of Fuzzy Visual Servoing for a Delta Robot via Shape and Color Detection
Nooshin Najafian - Alireza Ashrafi Majd - Abbas Ansaroudi - Sahar Aghazadeh - Manizheh Zakeri - Mohammad-Reza Sayyed Noorani
more
Samin Hamayesh - Version 43.7.0