0% Complete
Home
/
12th International Conference on Computer and Knowledge Engineering
Span-prediction of Unknown Values for Long-sequence Dialogue State Tracking
Authors :
Marzieh Naghdi Dorabati
1
Reza Ramezani
2
Mohammad Ali Nematbakhsh
3
1- Department of Computer Engineering, University of Isfahan
2- Dept. of Computer Engineering, University of Isfahan
3- Dept. of Computer Engineering, University of Isfahan
Keywords :
Task-oriented Dialogue Systems،Dialogue State Tracking،Unknown Values،Diversity of User Utterances،Long Sequences
Abstract :
Abstract—Dialogue state tracking is one of the main components in task-oriented dialogue systems whose duty is tracking the user goal during the conversation. Due to the diversity in natural languages and existing different utterances, the user requests may include unknown values at different turns in these systems. However, predicting true values of the user requests is necessary for completing the intended task. In existing studies, these values are determined using span-based methods to predict a span in utterances or previous dialogues. However, the slots are not correctly filled when values are multi-word. In addition, in some scenarios, the slot values in a given turn may depend on previously dialogue states. However, due to the limitation of the input length of language models, it is impossible to access all the previous dialogue states. In this study, a new approach is proposed that uses a span-tokenizer and adds the Bi-LSTM layer on top of the BERT model to predict the exact span of multi-word values. This approach uses parameters like user utterance, important dialogue histories, and all dialogue states as input to decrease the length of the sequences. The results show that this strategy has led to 1.80% improvement in the joint-goal accuracy and 0.15% improvement in the slot accuracy metrics over the MultiWOZ 2.1 dataset compared to the SAVN model.
Papers List
List of archived papers
Generating Hand-Written Symbols With Trajectory Planning Using A Robotic Arm
Arya Parvizi - Armin Salimi-Badr
Improvement of CluStream Algorithm Using Sliding Window for the Clustering of Data Streams
Sahar Ahsani - Morteza Yousef Sanati - Muharram Mansoorizadeh
TCAR: Thermal and Congestion-Aware Routing Algorithm in a Partially Connected 3D Network on Chip
Majid Nezarat - Masoomeh Momeni
FAST: FPGA Acceleration of Neural Networks Training
Alireza Borhani - Mohammad Hossein Goharinejad - Hamid Reza Zarandi
Experimental evaluation and comparison of anti-pattern detection tools by the gold standard
Somayeh Kalhor - Mohammad reza Keyvanpour - Afshin Salajegheh
Performance Evaluation Study of Color Space Selection In Video Based Facial Expression Recognition Using Deep Neural Networks For Sentiment Analysis
Phee Wei Qin - Ervin Gubin Moung - Ali Farzamnia - Farashazillah Yahya - John Julius Danker Khoo - Maisarah Mohd Sufian
Lempel-Ziv-based Hyper-Heuristic Solution for Longest Common Subsequence Problem
Mahdi Nasrollahi - Reza Shami Tanha - Mohsen Hooshmand
Efficient Object Detection using Deep Reinforcement Learning and Capsule Networks
Sobhan Siamak - Eghbal Mansoori
Optimizing Text-Based Protocol Clustering in Reverse Engineering with Auto-Encoders and Fine-Tuned Parameters
Shiva Mahmoudzadeh - Mohaddese Nemati - Mehdi Teimouri
Supervised Contrastive Learning for Short Text Classification in Natural Language Processing
Mitra Esmaeili - Hamed Vahdat nejad
more
Samin Hamayesh - Version 41.3.1