0% Complete
Home
/
15th International Conference on Computer and Knowledge Engineering
Graph-Cut-Based Semantic Optimization for Temporal Action Segmentation
Authors :
Mohanna Ansari
1
Ehsan Fazl-Ersi
2
1- Department of Computer Engineering, Ferdowsi University of Mashhad, Iran
2- Department of Computer Engineering, Ferdowsi University of Mashhad, Iran
Keywords :
Temporal action segmentation،Energy minimization،Graph-cut،Smooth Action Transition
Abstract :
Temporal action segmentation in untrimmed videos is critical for understanding human activities in applications such as robotics, surveillance, and human-computer interaction. While existing methods based on temporal convolutional networks (TCNs) and transformers effectively capture temporal dependencies and refine features, they often lack explicit mechanisms to enforce semantic consistency between action labels, leading to fragmented predictions. To address this limitation, we propose a novel framework that formulates temporal action segmentation as an energy minimization problem combining data fidelity and smoothness costs. Data costs are derived from a diffusion-based generative model (DiffAct) to capture action probabilities, while smoothness costs enforce semantic coherence by modeling valid transitions between action labels. We leverage graph-cut optimization to efficiently minimize the energy function. Experiments on the GTEA dataset demonstrate that our method, GBSO, achieves superior segmentation accuracy and temporal consistency compared to state-of-the-art approaches, improving boundary alignment and ensuring smoother semantic transitions. These results highlight the effectiveness of integrating semantic smoothness constraints into data-driven action segmentation frameworks.
Papers List
List of archived papers
Predicting the Recovery Rate of COVID-19 Using a Novel Hybrid Method
Fatemeh Ahouz - Ebrahim Sayahi
Traffic Sign Recognition Using Local Vision Transformer
Ali Farzipour - Omid Nejati Manzari - Shahriar B. Shokouhi
Weakly Supervised Learning in a Group of Learners with Communication
Ali Ganjbakhsh - Ahad Harati
Analysis of Insect-plant Interactions Affected by Mining operations, A Graph Mining Approach
Mohammad Heydari - Ali Bayat - Amir Albadvi
Semantic Segmentation Using Region Proposals and Weakly-Supervised Learning
Maryam Taghizadeh - Abdolah Chalechale
Supervised Contrastive Learning for Short Text Classification in Natural Language Processing
Mitra Esmaeili - Hamed Vahdat nejad
The process of multi class fake news dataset generation
Sajjad Rezaei - Mohsen Kahani - Behshid Behkamal
Vaccine Distribution Modelling in Pandemics through Multi-Agent Systems: COVID-19 Case
Hossein Yarahmadi - Mohammad Ebrahim Shiri - Hamid Reza Navidi - Arash Sharifi - Moharram Challenger - Hassan Piriaei
Optimizing Question-Answering Framework Through Integration of Text Summarization Model and Third-Generation Generative Pre-Trained Transformer
Ervin Gubin Moung - Toh Sin Tong - Maisarah Mohd Sufian - Valentino Liaw - Ali Farzamnia - Farashazillah Yahya
Histopathology Image-Based Cancer Classification Utilizing Transfer Learning Approach
Amir Meydani - Alireza Meidani - Ali Ramezani - Maryam Shabani - Mohammad Mehdi Kazeminasab - Shahriar Shahablavasani
more
Samin Hamayesh - Version 43.7.0