Depending on the amount of data to process, file generation may take longer.

If it takes too long to generate, you can limit the data by, for example, reducing the range of years.

Chapter

Download BibTeX

Title

Automatic Task Classification of Software Projects for Planning and Simulation

Authors

[ 1 ] Instytut Informatyki, Wydział Informatyki i Telekomunikacji, Politechnika Poznańska | [ P ] employee

Scientific discipline (Law 2.0)

[2.3] Information and communication technology

Year of publication

2025

Chapter type

chapter in monograph / paper

Publication language

english

Abstract

EN Background: Information about project tasks stored in Issue tracking systems (ITS) can be used for project analytics or process simulation. However, such issues must be classified beforehand. Considering the number of tasks stored in ITS, this task shall be done automatically. Aims: Our research aims to build an automatic recurring Jira issue classification model based on types and textual descriptions to enable the practical application of the model for software project planning and management. Method: We study a dataset from six industrial projects containing 9.6K tasks and augment it with an additional dataset of 91K task descriptions from other industrial projects to up-sample minority classes during training. We labeled the data using a semi-manual, active-learning-based method. We perform ten runs of 10-fold cross-validation for each project and evaluate classifiers using a set of state-of-the-art prediction quality metrics, i.e., Accuracy, Precision, Recall, F1-score, and MCC. Our machine-learning pipeline includes a Transformer-based sentence embedder (‘mxbai-embed-large-v1’) and an XGBoost classifier. We also study the impact of task-classification errors on project staffing issues. Results: The model automatically classifies software process tasks into 14 classes with MCCs ranging from 0.69 to 0.88. We built a confusion matrix that showed the most frequently confused task classes. We analyzed the consequences of classification errors. Conclusions: The study’s results enable the practical application of the software process model to analyze, plan, and manage software development projects.

Date of online publication

29.03.2025

Pages (from - to)

30 - 63

DOI

10.1007/978-3-031-84913-8_2

URL

https://link.springer.com/chapter/10.1007/978-3-031-84913-8_2

Book

Software, System, and Service Engineering : S3E 2024 Topical Area, Held as Part of FedCSIS 2024, Belgrade, Serbia, 8–11 September, 2024 and KKIO 2024, Held as Part of SEAA 2024, Paris, France, August 28-30, 2024, Revised Selected Papers

Presented on

S3E 2024 Topical Area, Held as Part of FedCSIS 2024 ; KKIO 2024, Held as Part of SEAA 2024, 28-30.08.2024, Paris, France

Ministry points / chapter

20

Ministry points / conference (CORE)

70

This website uses cookies to remember the authenticated session of the user. For more information, read about Cookies and Privacy Policy.