0% Complete
Home
/
11th International Conference on Computer and Knowledge Engineering
A Weighted TF-IDF-based Approach for Authorship Attribution
Authors :
Ali Abedzadeh
1
Reza Ramezani
2
Afsaneh Fatemi
3
1- university of isfahan
2- university of isfahan
3- university of isfahan
Keywords :
Authorship Attribution, Author Identification, Information Retrieval, Term Frequency, TF-IDF
Abstract :
Authorship Attribution (AA) is a task in which a disputed text is automatically assigned to an author chosen from a list of candidate authors. To this end, a model is trained on a dataset of textual documents with known authors, which can be considered as a multi-class single-label classification task. In this paper, we approach this task differently by extending information retrieval techniques to train an AA model. It is based on weighting the AARR technique, presented in our previous study, to relax the value of term frequency. The efficiency of the proposed solution has been evaluated by conducting several experiments on six datasets. The results show the superiority of the proposed solution by improving the accuracy of IMDB, Gutenberg books, Poetry, Blogs, PAN2011, and Twitter datasets by 33%, 31%, 31%, 19%, 6%, and 1%, respectively, where the average improvement is 19.94% over all datasets. The best accuracy over these datasets is 88%, 82%, 67%, 90%, 65%, and 81% in the same respect. In addition, compared to the baseline system, the computation time of the proposed solution has been improved significantly (21.44X) by employing a dictionary-based indexing technique.
Papers List
List of archived papers
Distilled BERT Model In Natural Language Processing
Yazdan Zandiye Vakili - Avisa Fallah - Hedieh Sajedi
Dynamic Hand Gesture Recognition with 2DCNN-LSTM and Improved Keyframe Extraction
Narjes Heidari - Javid Norouzi - Mohammad Sadegh Helfroush - Habibollah Danyal
FarCQA: A Farsi Community Dataset for Question Classification and Answer Selection
Saba Emami - Maedeh Mosharraf
Robust Learning to Learn Graph Topologies
Navid Akhavan Attar - Ali Fahim
Attention Transfer in Self-Regulated Networks for Recognizing Human Actions from Still Images
Masoumeh Chapariniya - Sara Vesali Barazande - Seyed Sajad Ashrafi - Shahriar B.Shokouhi
Uncertainty-Aware Deep Ensembles for Confident Customer Churn Prediction with Rejection Option
Fatemeh Moradi - Mehran Tarif - Mohammadhossein Homaei
BioBERT-based SNP-traits Associations Extraction from Biomedical Literature
Mohammad Dehghani - Behrouz Bokharaeian - Zahra Yazdanparast
AVID: A VARIATIONAL INFERENCE DELIBERATION FOR META-LEARNING
Alireza Javaheri - Arsham Gholamzadeh Khoee - Saeed Reza Kheradpisheh - Hadi Farahani - Mohammad Ganjtabesh
Designing an IT2 Fuzzy Rule-based System for Emotion Recognition Using Biological Data
Mahsa Keshtkar - Hooman Tahayori
DEW-WIN: A Dynamic Energy-aware Window-based Scheduler for Mixed-criticality Systems
Mahin Moradiyan - Yasser Sedaghat - Pouria Hosseini - Yousef Rezazadeh
more
Samin Hamayesh - Version 43.7.0