0% Complete
Home
/
12th International Conference on Computer and Knowledge Engineering
Span-prediction of Unknown Values for Long-sequence Dialogue State Tracking
Authors :
Marzieh Naghdi Dorabati
1
Reza Ramezani
2
Mohammad Ali Nematbakhsh
3
1- Department of Computer Engineering, University of Isfahan
2- Dept. of Computer Engineering, University of Isfahan
3- Dept. of Computer Engineering, University of Isfahan
Keywords :
Task-oriented Dialogue Systems،Dialogue State Tracking،Unknown Values،Diversity of User Utterances،Long Sequences
Abstract :
Abstract—Dialogue state tracking is one of the main components in task-oriented dialogue systems whose duty is tracking the user goal during the conversation. Due to the diversity in natural languages and existing different utterances, the user requests may include unknown values at different turns in these systems. However, predicting true values of the user requests is necessary for completing the intended task. In existing studies, these values are determined using span-based methods to predict a span in utterances or previous dialogues. However, the slots are not correctly filled when values are multi-word. In addition, in some scenarios, the slot values in a given turn may depend on previously dialogue states. However, due to the limitation of the input length of language models, it is impossible to access all the previous dialogue states. In this study, a new approach is proposed that uses a span-tokenizer and adds the Bi-LSTM layer on top of the BERT model to predict the exact span of multi-word values. This approach uses parameters like user utterance, important dialogue histories, and all dialogue states as input to decrease the length of the sequences. The results show that this strategy has led to 1.80% improvement in the joint-goal accuracy and 0.15% improvement in the slot accuracy metrics over the MultiWOZ 2.1 dataset compared to the SAVN model.
Papers List
List of archived papers
Energy Efficient Power Allocation in MIMO-NOMA Systems with ZF Receiver Beamforming in Multiple Clusters
Mahdi Nangir - Abdolrasoul Sakhaei Gharagezlou - Nima Imani
Adaptive Active Queue Management for Time Slot Channel Hopping in Industrial Internet of Things
Mehdi Zirak - Yasser Sedaghat - Mohammad Hossein Yaghmaee Moghaddam
Adaptive Multi-Scale Attentional Network for Semantic Segmentation of Remote Sensing Images
Melika Zare - Sattar Hashemi
An overview of Business Intelligence research in healthcare organizations using a topic modeling approach
Mohammad Mehraeen - Laya Mahmoudi - Mohammad Hossein Sharifi
R2-BAC: A Novel Blockchain and IoT-Based Access Control Model for Supply Chain Management
Sadegh Sohani - Farnaz Kamranfar - Haleh Amintoosi - Mohammad Allahbakhsh
An Automated Visual Defect Segmentation for Flat Steel Surface Using Deep Neural Networks
Dorna Nourbakhsh Sabet - Mohammad Reza Zarifi - Javad Khoramdel - Yasamin Borhani - Esmaeil Najafi
Virus-Antiviral Prediction Using Machine and Deep Learning Methods
Shayan Majidifar - Fatemeh Nasiri - Mohsen Hooshmand
A scalable blockchain-based educational network for data storage and assessment
Maryam Fattahi Vanani - Hamidreza Shayegh Borujeni - Ali Nourollah
Persis: A Persian Font Recognition Pipeline Using Convolutional Neural Networks
Mehrdad Mohammadian - Neda Maleki - Tobias Olsson - Fredrik Ahlgren
Iris Detection and Segmentation Using Deep Learning
Ali Khaki - Ali Aghagolzadeh - Bagher Rahimpour Cami
more
Samin Hamayesh - Version 42.2.1