0% Complete
Home
/
15th International Conference on Computer and Knowledge Engineering
Towards Transparent and Accurate Story Point Estimation via Interpretable BERT-based Modeling
Authors :
Seyed Emad Baradaran Hosseini
1
Maryam Khodabakhsh
2
Alireza Tajary
3
Seyedehfatemeh Karimi
4
1- Master Student of Computer Engineering, Shahrood University of Technology
2- Faculty of Computer Engineering, Shahrood University of Technology, Shahrood, Iran
3- Faculty of Computer Engineering, Shahrood University of Technology, Shahrood, Iran
4- Department of Engineering Ferdowsi University of Mashhad Mashhad, Iran
Keywords :
Agile Software Development،Story Point Estimation،Natural Language Processing،BERT Classifier،Interpretable
Abstract :
This study proposes a novel approach for estimating story points in agile software projects by leveraging advanced natural language processing (NLP) models combined with interpretability techniques. Task descriptions are first transformed into semantic embedding vectors, and then classified into four categories—Small, Medium, Large, and Huge—using a BERT-based classifier. To enhance model interpretability, the CLS embedding vectors are extracted, dimensionally reduced, and clustered via K-Means to clearly reveal class boundaries and overlaps. Experimental results demonstrate a high overall accuracy of 88.71% and an average F1-score exceeding 0.87, significantly outperforming baseline methods. Analysis of confusion matrices and semantic clustering indicates challenges in distinguishing between the small and medium classes, which could be alleviated by incorporating richer contextual features. The proposed framework, by providing interpretable insights alongside robust accuracy, represents an important step towards increasing transparency and trustworthiness in intelligent story point estimation systems for agile projects. Finally, recommendations for future work include employing more advanced language models, optimizing model performance, and expanding training datasets for improved generalizability.
Papers List
List of archived papers
AIRSPAN-X: Federated XGBoost with Sequential Anomaly Detection for Explainable Urban Air Quality Prediction
Saghar Shafaati - S. Hossein Erfani
Impact of Oversampling Methods on Imbalanced Dataset for Software Fault Prediction
Alireza Abiri - Alireza Tajary - Mansoor Fateh
A Comprehensive Dataset of Real-scene Images for Text Detection and Recognition in Persian
Iman Souzanchi - Ramin Rahimi - Mohammad Ali Majidi Anvari - Atefeh Baniasadi - Ashkan Sadeghi - Mohammad Reza Mohammadi
Identifying novel disease genes based on protein complexes and biological features
Mahshad Hashemi - Eghbal Mansoori
Optimizing Text-Based Protocol Clustering in Reverse Engineering with Auto-Encoders and Fine-Tuned Parameters
Shiva Mahmoudzadeh - Mohaddese Nemati - Mehdi Teimouri
Adaptive Prioritization in Experience Replay Using Feedback from Multiple Learning Signals
Seyed Hossein Mostafavi - Mohammad Bagher Naghibi Sistani
Enhanced Principal-curve based Classifiers for Time-series Label Prediction
Seyed Aref Hakimzadeh - Koorush Ziarati
HiCAP: Hierarchical Clustering-based Attention Pooling for Graph Representation Learning
Parsa Haddadian - Rooholah Abedian - Ali Moeini
Semi-automatic Detection of Persian Stopwords using FastText Library
Mohammad Dehghani - Mohammad Manthouri
Hybrid Flow-Rule Placement Method of Proactive and Reactive in SDNs
Mohammadreza Khoobbakht - Mohammadreza Noei - Mohammadreza Parvizimosaed
more
Samin Hamayesh - Version 43.7.0