0% Complete
Home
/
13th International Conference on Computer and Knowledge Engineering
FarCQA: A Farsi Community Dataset for Question Classification and Answer Selection
Authors :
Saba Emami
1
Maedeh Mosharraf
2
1- Faculty of Computer Science and Engineering, Shahid Beheshti University, Tehran, Iran
2- Faculty of Computer Science and Engineering Shahid Beheshti University Tehran, Iran
Keywords :
Question answering systems (QAS)،dataset،FarCQA،Persian،question classification
Abstract :
Question Answering Systems (QASs) have become increasingly important due to the need for accurate and concise answers that traditional search engines often struggle to provide. However, the development of QASs for the Persian language has been limited due to its complexity, fewer available resources, and tools compared to other languages. One crucial component of a QAS is question classification, which plays an effective role in retrieving correct answers. In this paper, we introduce FarCQA, the first open domain Persian community dataset for question classification and answer selection tasks, collected from an online forum. This dataset is tagged with 9 types of questions and includes both formal and informal language. In addition, we propose question classification and answer selection models using transformer based models and combining word embedding and deep learning techniques. Our approach demonstrates a notable accuracy on the test set, surpassing state-of-the-art methods.
Papers List
List of archived papers
GroupRec: Group Recommendation by Numerical Characteristics of Groups in Telegram
Davod Karimpour - Mohammad Ali Zare Chahooki - Ali Hashemi
Reversible Data Insertion in Encryption Domain Based on Reduced Quad Difference Expansion
Alireza Ghaemi - Mohammad Zare Ehteshami - Amirhossein Ghaemi
An Overview of Regression Methods in Early Prediction of Movie Ratings
Houmaan Chamani - Zhivar Sourati Hassanzadeh - Behnam Bahrak
A Comparative Analysis of Clinical Note Categories for Mortality Prediction in ICU Patients
Maryam Karrabi - Mohsen Kahani - Mina Afzali - Nadieh Armin
Improvement of CluStream Algorithm Using Sliding Window for the Clustering of Data Streams
Sahar Ahsani - Morteza Yousef Sanati - Muharram Mansoorizadeh
Span-prediction of Unknown Values for Long-sequence Dialogue State Tracking
Marzieh Naghdi Dorabati - Reza Ramezani - Mohammad Ali Nematbakhsh
Capsule Routing over Stacked GCN-GAT Embeddings with Negative Sampling for Graph Link Prediction
Fatemeh Safari Sarvandi - Sayeh Mirzaei - Rooholah Abedian
Effect of Tissue Excitation in Breast Cancer Detection from Ultrasound RF Time Series: Phantom studies
Elaheh Norouzi Ghehi - Ali Fallah - Saeid Rashidi - Maryam Mehdizadeh Dastjerdi
A Novel Deformable Registration Method for Cerebral Magnetic Resonance Images
Bahareh Asadpour Dasht Bayaz - Mahdi Saadatmand - Fabrice Wallois
Analysis of Insect-plant Interactions Affected by Mining operations, A Graph Mining Approach
Mohammad Heydari - Ali Bayat - Amir Albadvi
more
Samin Hamayesh - Version 43.7.0