0% Complete
Home
/
15th International Conference on Computer and Knowledge Engineering
Graph-Cut-Based Semantic Optimization for Temporal Action Segmentation
Authors :
Mohanna Ansari
1
Ehsan Fazl-Ersi
2
1- Department of Computer Engineering, Ferdowsi University of Mashhad, Iran
2- Department of Computer Engineering, Ferdowsi University of Mashhad, Iran
Keywords :
Temporal action segmentation،Energy minimization،Graph-cut،Smooth Action Transition
Abstract :
Temporal action segmentation in untrimmed videos is critical for understanding human activities in applications such as robotics, surveillance, and human-computer interaction. While existing methods based on temporal convolutional networks (TCNs) and transformers effectively capture temporal dependencies and refine features, they often lack explicit mechanisms to enforce semantic consistency between action labels, leading to fragmented predictions. To address this limitation, we propose a novel framework that formulates temporal action segmentation as an energy minimization problem combining data fidelity and smoothness costs. Data costs are derived from a diffusion-based generative model (DiffAct) to capture action probabilities, while smoothness costs enforce semantic coherence by modeling valid transitions between action labels. We leverage graph-cut optimization to efficiently minimize the energy function. Experiments on the GTEA dataset demonstrate that our method, GBSO, achieves superior segmentation accuracy and temporal consistency compared to state-of-the-art approaches, improving boundary alignment and ensuring smoother semantic transitions. These results highlight the effectiveness of integrating semantic smoothness constraints into data-driven action segmentation frameworks.
Papers List
List of archived papers
Soccer Video Event Detection Using Metric Learning
Ali Karimi - Ramin Toosi - Mohammad Ali Akhaee
Semi-Supervised Supply Chain Fraud Detection with Unsupervised Pre-Filtering
Fatemeh Moradi - Mehran Tarif - Mohammadhossein Homaei
Autonomous Drone Navigation Using Synchronized Camera and IMU Data with CNN
Reza Javanmard Alitappeh - Narges Hamzeh Mermeti - Fatemeh Barzegar - Fatemeh Ebrahimi - Nima Mahmoudi - Jalal Alipour Langouri
Ensemble-Based Fraud Detection: A Robust Approach Evaluated on IEEE-CIS
Fatemeh Moradi - Mehran Tarif - Mohammadhossein Homaei
Extracting structural clusters from NMF feature matrix using Cosine Similarity-Based Weighted Voting
Mehdi Rahimi - Keyhan Khamforoosh - Vafa Maihami
A Self-Configurable Model for Cloud Resource Allocation
Ali Bazghandi
Information Theoretic Learning-based Deep Embedded Clustering (ITL-DEC)
Hoda Shad - Mona Zamiri - Tahereh Bahreini - Reza Monsefi - Ghoshe Abed Hodtani
No-Reference Video Quality Assessment by Deep Feature Maps Relations
Amir Hossein Bakhtiari - Azadeh Mansouri
AVID: A VARIATIONAL INFERENCE DELIBERATION FOR META-LEARNING
Alireza Javaheri - Arsham Gholamzadeh Khoee - Saeed Reza Kheradpisheh - Hadi Farahani - Mohammad Ganjtabesh
FarSick: A Persian Semantic Textual Similarity And Natural Language Inference Dataset
Zahra Ghasemi - Mohammad Ali Keyvanrad
more
Samin Hamayesh - Version 43.7.0