0% Complete
Home
/
15th International Conference on Computer and Knowledge Engineering
Beyond Appearance: Transformer-based Person Identification from Conversational Dynamics
Authors :
Masoumeh Chapariniya
1
Teodora Vukovic
2
Sarah Ebling
3
Volker Dellwo
4
1- University of Zurich
2- university of zurich
3- university of zurich
4- university of zurich
Keywords :
Person identification،conversational gestures،vision transformers،spatial-temporal modeling،keypoint dynamics
Abstract :
This paper investigates the performance of transformer-based architectures for person identification in natural, face-to-face conversation scenario. We implement and evaluate a two-stream framework that separately models spatial configurations and temporal motion patterns of 133 COCO WholeBody keypoints, extracted from a subset of the CANDOR conversational corpus. Our experiments compare pre-trained and from-scratch training, investigate the use of velocity features, and introduce a multi-scale temporal transformer for hierarchical motion modeling. Results demonstrate that domain-specific training significantly outperforms transfer learning, and that spatial configurations carry more discriminative information than temporal dynamics. The spatial transformer achieves 95.74% accuracy, while the multi-scale temporal transformer achieves 93.90%. Feature-level fusion pushes performance to 98.03%, confirming that postural and dynamic information are complementary. These findings highlight the potential of transformer architectures for person identification in natural interactions and provide insights for future multimodal and cross-cultural studies
Papers List
List of archived papers
Balanced Learning with Optimized Extra Trees Classifier for Reliable Lithology Identification in Imbalanced Well Log Data
Ali Daneshpour - Behnam Yousefimehr - Mehdi Ghatee
Non-Functional Requirement Extracting Methods for AI-based Systems: A Survey
Reza Damirchi - Amineh Amini
Semantic Segmentation Using Region Proposals and Weakly-Supervised Learning
Maryam Taghizadeh - Abdolah Chalechale
Divide and Conquer Approach to Long Genomic Sequence Alignment
Mahmoud Naghibzadeh - Samira Babaei - Behshid Behkmal - Mojtaba Hatami
Towards Efficient Video Object Detection on Embedded Devices
Mohammad Hajizadeh - Adel Rahmani - Mohammad Sabokrou
Adaptive-A-GCRNN: Enhancing Real-time Multi-band Spectrum Prediction through Attention-based Spatial-Temporal Modeling
Seyed majid Hosseini - Seyedeh Mozhgan Rahmatinia - Seyed Amin Hosseini Seno - Hadi Sadoghi yazdi
Classification of COVID-19 and Nodule in CT Images using Deep Convolutional Neural Network
Amirhossein Ghaemi - Seyyed Amir Mousavi mobarakeh - Habibollah Danyali - Kamran Kazemi
Improvement of Credit Scoring by LSTM Autoencoder Model
Milad Sattari Maleki - Seyedeh Niusha Motevallian - Faezehsadat Hosseini - Mohammad Sabokrou - Hamidreza Soltanalizadeh Maleki
Improving performance of multi-label classification using ensemble of feature selection and outlier detection
Mohammad Ali Zarif - Javad Hamidzadeh
Optimizing the controller placement problem in SDN with uncertain parameters with robust optimization
Mohammad Kazemi - AhmadReza Montazerolghaem
more
Samin Hamayesh - Version 43.7.0