0% Complete
Home
/
12th International Conference on Computer and Knowledge Engineering
PeQa: a Massive Persian Quenstion-Answering and Chatbot Dataset
Authors :
Fatemeh Zahra Arshia
1
Mohammad Ali Keyvanrad
2
Saeedeh Sadat Sadidpour
3
Sayyid Mohammad Reza Mohammadi
4
1- Faculty of Electrical & Computer Engineering Malek-Ashtar University of Technology Tehran, Iran
2- Faculty of Electrical & Computer Engineering Malek-Ashtar University of Technology Tehran, Iran
3- Faculty of Electrical & Computer Engineering Malek-Ashtar University of Technology Tehran, Iran
4- Faculty of Electrical & Computer Engineering Malek-Ashtar University of Technology Tehran, Iran
Keywords :
Question-Answering System،Tweeter Dataset،Persian QA،Chatbot
Abstract :
TA question-answering (QA) system is an application able to communicate with humans using natural language processing. Modelling a dialogue between humans and machines is considered one of the most important tasks of Artificial Intelligence (AI). Creating a Chatbot with a good performance in modelling human-machine conversations is still one of the unsolved challenges in this field. Although Chatbots have many applications, in general, they should understand users’ meaning through their words and provide them with relevant answers. In the past, Chatbot architectures mainly relied on rules or statistical methods. With the advent of deep learning methods, trainable neural networks soon replaced the traditional models. These sorts of deep models are highly affected by the dataset that would be fed into them, and there is no big enough one available in the Persian language! We present a huge dataset of 14 million Persian tweets from tweeter that is meticulously processed to create a rich collection of 420,000 pairs of question-answer data. We also present modelling results on Transformers, including Sensibleness and Specificity Average (SSA) and the BLEU metric. We will release our dataset, modelling code, and models publicly.
Papers List
List of archived papers
Deep Learning Feature Extraction for COVID-19 Detection Algorithm using Computerized Tomography Scan
Maisarah Mohd Sufian - Ervin Gubin Moung - Chong Joon Hou - Ali Farzamnia
WBT-GAN:Wavelet based Generative Adversarial Network for Texture Synthesis
Sara Saberi moghadam - Reza Azmi - Maral Zarvani
New Design of Efficient Reversible Quantum Saturation Adder
Negin Mashayekhi - Mohammad Reza Reshadinezhad - Shekoofeh Moghimi
Islamic Geometric algorithms: A survey
Elham Akbari - Azam Bastanfard
Attention-Boosted Ensemble of Pre-trained Convolutional Neural Networks for Accurate Diabetic Retinopathy Detection
Benyamin Mirab Golkhatmi - Mohammad Hossein Moattar
Classification of COVID-19 and Nodule in CT Images using Deep Convolutional Neural Network
Amirhossein Ghaemi - Seyyed Amir Mousavi mobarakeh - Habibollah Danyali - Kamran Kazemi
Information Theoretic Learning-based Deep Embedded Clustering (ITL-DEC)
Hoda Shad - Mona Zamiri - Tahereh Bahreini - Reza Monsefi - Ghoshe Abed Hodtani
EfficientNetB0’s Hybrid Approach for Brain Tumor Classification from MRI Images Using Deep Learning and Bagging Trees
Yeganeh Modaresnia - Farhad Abedinzadeh Torghabeh - Seyyed Abed Hosseini
City Intersection Clustering and Analysis Based on Traffic Time Series
Mohammad Aminazadeh - Fakhroddin Noorbehbahani
Identification of Botnets and Nodes Attacking Smart Cities by Majority Voting Mechanism and Feature Selection
Maliheh Araghchi - Nazbanoo Farzaneh
more
Samin Hamayesh - Version 42.2.1