0% Complete
Home
/
15th International Conference on Computer and Knowledge Engineering
Graph-Cut-Based Semantic Optimization for Temporal Action Segmentation
Authors :
Mohanna Ansari
1
Ehsan Fazl-Ersi
2
1- Department of Computer Engineering, Ferdowsi University of Mashhad, Iran
2- Department of Computer Engineering, Ferdowsi University of Mashhad, Iran
Keywords :
Temporal action segmentation،Energy minimization،Graph-cut،Smooth Action Transition
Abstract :
Temporal action segmentation in untrimmed videos is critical for understanding human activities in applications such as robotics, surveillance, and human-computer interaction. While existing methods based on temporal convolutional networks (TCNs) and transformers effectively capture temporal dependencies and refine features, they often lack explicit mechanisms to enforce semantic consistency between action labels, leading to fragmented predictions. To address this limitation, we propose a novel framework that formulates temporal action segmentation as an energy minimization problem combining data fidelity and smoothness costs. Data costs are derived from a diffusion-based generative model (DiffAct) to capture action probabilities, while smoothness costs enforce semantic coherence by modeling valid transitions between action labels. We leverage graph-cut optimization to efficiently minimize the energy function. Experiments on the GTEA dataset demonstrate that our method, GBSO, achieves superior segmentation accuracy and temporal consistency compared to state-of-the-art approaches, improving boundary alignment and ensuring smoother semantic transitions. These results highlight the effectiveness of integrating semantic smoothness constraints into data-driven action segmentation frameworks.
Papers List
List of archived papers
Deep Learning Based High-Resolution Edge Detection for Microwave Imaging using a Variational Autoencoder
Seyed Reza Razavi Pour - Leila Ahmadi - Amir Ahmad Shishegar
Brain Age Estimation with Twin Vision Transformer using Hippocampus Information Applicable to Alzheimer Dementia Diagnosis
Zahra Qodrati - Seyedeh Masoumeh Taji - Amirhossein Ghaemi - Habibollah Danyali - Kamran Kazemi - Alireza Ghaemi
Efficient Prediction of Cardiovascular Disease via Extra Tree Feature Selection
Mina Abroodi - Mohammad Reza Keyvanpour - Ghazaleh Kakavand Teimoory
FarSick: A Persian Semantic Textual Similarity And Natural Language Inference Dataset
Zahra Ghasemi - Mohammad Ali Keyvanrad
T-Rank: Graph Data Analytics for Urban Traffic Modeling
Alireza Safarpour - Iman Gholampour - Amirhossain Aghazadeh Fard - Seyed Mohammad Karbasi
R2-BAC: A Novel Blockchain and IoT-Based Access Control Model for Supply Chain Management
Sadegh Sohani - Farnaz Kamranfar - Haleh Amintoosi - Mohammad Allahbakhsh
MIPS-Core Application Specific Instruction-Set Processor for IDEA Cryptography − Comparison between Single-Cycle and Multi-Cycle Architectures
Ahmad Ahmadi - Reza Faghih Mirzaee
Forecasting El Niño Six Months in Advance Utilizing Augmented Convolutional Neural Network
Mohammad Naisipour - Iraj Saeedpanah - Arash Adib - Mohammad Hossein Neisi Pour
FedBrain-Distill: Communication-Efficient Federated Brain Tumor Classification Using Ensemble Knowledge Distillation on Non-IID Data
Rasoul Jafari Gohari - Laya Aliahmadipour - Ezat Valipour
Semantic Segmentation Using Region Proposals and Weakly-Supervised Learning
Maryam Taghizadeh - Abdolah Chalechale
more
Samin Hamayesh - Version 43.7.0