0% Complete
Home
/
12th International Conference on Computer and Knowledge Engineering
Span-prediction of Unknown Values for Long-sequence Dialogue State Tracking
Authors :
Marzieh Naghdi Dorabati
1
Reza Ramezani
2
Mohammad Ali Nematbakhsh
3
1- Department of Computer Engineering, University of Isfahan
2- Dept. of Computer Engineering, University of Isfahan
3- Dept. of Computer Engineering, University of Isfahan
Keywords :
Task-oriented Dialogue Systems،Dialogue State Tracking،Unknown Values،Diversity of User Utterances،Long Sequences
Abstract :
Abstract—Dialogue state tracking is one of the main components in task-oriented dialogue systems whose duty is tracking the user goal during the conversation. Due to the diversity in natural languages and existing different utterances, the user requests may include unknown values at different turns in these systems. However, predicting true values of the user requests is necessary for completing the intended task. In existing studies, these values are determined using span-based methods to predict a span in utterances or previous dialogues. However, the slots are not correctly filled when values are multi-word. In addition, in some scenarios, the slot values in a given turn may depend on previously dialogue states. However, due to the limitation of the input length of language models, it is impossible to access all the previous dialogue states. In this study, a new approach is proposed that uses a span-tokenizer and adds the Bi-LSTM layer on top of the BERT model to predict the exact span of multi-word values. This approach uses parameters like user utterance, important dialogue histories, and all dialogue states as input to decrease the length of the sequences. The results show that this strategy has led to 1.80% improvement in the joint-goal accuracy and 0.15% improvement in the slot accuracy metrics over the MultiWOZ 2.1 dataset compared to the SAVN model.
Papers List
List of archived papers
Leveraging a structure-based and learning-based predictor using various feature groups in bioinformatics (case study: protein-peptide region residue-level interaction)
Shima Shafiee - Abdolhossein Fathi
Reversible Data Insertion in Encryption Domain Based on Reduced Quad Difference Expansion
Alireza Ghaemi - Mohammad Zare Ehteshami - Amirhossein Ghaemi
PowerLinear Activation Functions with application to the first layer of CNNs
Kamyar Nasiri - Kamaledin Ghiasi-Shirazi
Disturbance Rejection in Quadruple-Tank System by Proposing New Method in Reinforcement Learning
Alireza Nezamzadeh - Mohammadreza Esmaeilidehkordi
Cardiology Disease Diagnosis by Analyzing Histological Microscopic Images Using Deep Learning
Maria Salehpanah - Jafar Tanha - Zahra Jafari - SeyedEhsan Roshan - Sajad Rezaei
SASIAF, An Scalable Accelerator For Seismic Imaging on Amazon AWS FPGAs
Mostafa Koraei - S.Omid Fatemi
Prediction of West Texas Intermediate Crude-oil Price Using Hybrid Attention-based Deep Neural Networks: A Comparative Study
Alireza Jahandoost - Mahboobeh Houshmand - Seyyed Abed Hosseini
Predicting the Recovery Rate of COVID-19 Using a Novel Hybrid Method
Fatemeh Ahouz - Ebrahim Sayahi
Instance Selection from Skewed Class Distributions by Using the multi-objective optimizer
Mona Moradi - Javad Hamidzadeh
Evaluating the Impact of Traveling on COVID-19 Prevalence and Predicting the New Confirmed Cases According to the Travel Rate Using Machine Learning: A Case Study in Iran
Anita Ghandehari - Soheil Shirvani - Hadi Moradi
more
Samin Hamayesh - Version 42.2.1