0% Complete
Home
/
13th International Conference on Computer and Knowledge Engineering
FarCQA: A Farsi Community Dataset for Question Classification and Answer Selection
Authors :
Saba Emami
1
Maedeh Mosharraf
2
1- Faculty of Computer Science and Engineering, Shahid Beheshti University, Tehran, Iran
2- Faculty of Computer Science and Engineering Shahid Beheshti University Tehran, Iran
Keywords :
Question answering systems (QAS)،dataset،FarCQA،Persian،question classification
Abstract :
Question Answering Systems (QASs) have become increasingly important due to the need for accurate and concise answers that traditional search engines often struggle to provide. However, the development of QASs for the Persian language has been limited due to its complexity, fewer available resources, and tools compared to other languages. One crucial component of a QAS is question classification, which plays an effective role in retrieving correct answers. In this paper, we introduce FarCQA, the first open domain Persian community dataset for question classification and answer selection tasks, collected from an online forum. This dataset is tagged with 9 types of questions and includes both formal and informal language. In addition, we propose question classification and answer selection models using transformer based models and combining word embedding and deep learning techniques. Our approach demonstrates a notable accuracy on the test set, surpassing state-of-the-art methods.
Papers List
List of archived papers
Dynamic Knowledge Enhanced Neural Fashion Trend Forecasting with Quantile Loss
Fatemeh Rooholamini - Reza Azmi - Mobina Khademhossein - Maral Zarvani
UAV-based Firefighting by Multi-agent Reinforcement Learning
Reza Shami Tanha - Mohsen Hooshmand - Mohsen Afsharchi
A scalable blockchain-based educational network for data storage and assessment
Maryam Fattahi Vanani - Hamidreza Shayegh Borujeni - Ali Nourollah
An Efficient Approach for Breast Abnormality Detection through High-Level Features of Thermography Images
Farhad Abedinzadeh Torghabeh - Yeganeh Modaresnia - Seyyed Abed Hosseini
Robat-e-Beheshti: A Persian Wake Word Detection Dataset for Robotic Purposes
Parisa Ahmadzadeh Raji - Yasser Shekofteh
Cross-project Defect Prediction with An Enhanced Transfer Boosting Algorithm
Nazgol Nikravesh - Mohammad Reza Keyvanpour
Attention Transfer in Self-Regulated Networks for Recognizing Human Actions from Still Images
Masoumeh Chapariniya - Sara Vesali Barazande - Seyed Sajad Ashrafi - Shahriar B.Shokouhi
HiCAP: Hierarchical Clustering-based Attention Pooling for Graph Representation Learning
Parsa Haddadian - Rooholah Abedian - Ali Moeini
An overview of Business Intelligence research in healthcare organizations using a topic modeling approach
Mohammad Mehraeen - Laya Mahmoudi - Mohammad Hossein Sharifi
Optimizing MR Image Registration for Accurate Brain Volume Measurement in Children with Autism Spectrum Disorder
Shiva Sanati - Mahdi Saadatmand
more
Samin Hamayesh - Version 41.5.3