0% Complete
Home
/
11th International Conference on Computer and Knowledge Engineering
A Weighted TF-IDF-based Approach for Authorship Attribution
Authors :
Ali Abedzadeh
1
Reza Ramezani
2
Afsaneh Fatemi
3
1- university of isfahan
2- university of isfahan
3- university of isfahan
Keywords :
Authorship Attribution, Author Identification, Information Retrieval, Term Frequency, TF-IDF
Abstract :
Authorship Attribution (AA) is a task in which a disputed text is automatically assigned to an author chosen from a list of candidate authors. To this end, a model is trained on a dataset of textual documents with known authors, which can be considered as a multi-class single-label classification task. In this paper, we approach this task differently by extending information retrieval techniques to train an AA model. It is based on weighting the AARR technique, presented in our previous study, to relax the value of term frequency. The efficiency of the proposed solution has been evaluated by conducting several experiments on six datasets. The results show the superiority of the proposed solution by improving the accuracy of IMDB, Gutenberg books, Poetry, Blogs, PAN2011, and Twitter datasets by 33%, 31%, 31%, 19%, 6%, and 1%, respectively, where the average improvement is 19.94% over all datasets. The best accuracy over these datasets is 88%, 82%, 67%, 90%, 65%, and 81% in the same respect. In addition, compared to the baseline system, the computation time of the proposed solution has been improved significantly (21.44X) by employing a dictionary-based indexing technique.
Papers List
List of archived papers
To Transfer or Not To Transfer (TNT): Action Recognition in Still Image Using Transfer Learning
Ali Soltani Nezhad - Hojat Asgarian Dehkordi - Seyed Sajad Ashrafi - Shahriar Baradaran Shokouhi
A Review on Machine Learning Methods for Workload Prediction in Cloud Computing
Mohammad Yekta - Hadi Shahriar Shahhoseini
Robustness Scan of Digital Circuits Using Convolutional Neural Networks
Mobin Vaziri - Mohammad Mehdi Rahimifar - Hadi Jahanirad
Dual Memory Structure for Memory Augmented Neural Networks for Question-Answering Tasks
Amir Bidokhti - Shahrokh Ghaemmaghami
Spatial-channel attention-based stochastic neighboring embedding pooling and long short term memory for lung nodules classification
AHMED SAIHOOD - HOSSEIN KARSHENAS - AHMADREZA NAGHSH NILCHI
Attention Transfer in Self-Regulated Networks for Recognizing Human Actions from Still Images
Masoumeh Chapariniya - Sara Vesali Barazande - Seyed Sajad Ashrafi - Shahriar B.Shokouhi
A Hybrid Echo State Network for Hypercomplex Pattern Recognition, Classification, and Big Data Analysis
Mohammad Jamshidi - Fatemeh Daneshfar
Hate Sentiment Recognition System For Persian Language
Pegah Shams jey - Arash Hemmati - Ramin Toosi - Mohammad ali Akhaee
Robat-e-Beheshti: A Persian Wake Word Detection Dataset for Robotic Purposes
Parisa Ahmadzadeh Raji - Yasser Shekofteh
A Survey on Semi-Automated and Automated Approaches for Video Annotation
Samin Zare - Mehran Yazdi
more
Samin Hamayesh - Version 41.3.1