Automated Code Review Comment Classification to Improve Modern Code Reviews

Mirosław Ochodek; Miroslaw Staron; Wilhelm Meding; Ola Söder

doi:10.1007/978-3-031-04115-0_3

Scientific Information System of the Poznań University of Technology

PL EN

Main page / Publications / Automated Code Review Comment Classification to Improve Modern Code Reviews

Submit a comment

Chapter

Download BibTeX

Title

Automated Code Review Comment Classification to Improve Modern Code Reviews

Authors

Mirosław Ochodek (WIiT) ^{[ 1 ][ 2.3 ][ P ]}
Miroslaw Staron
Wilhelm Meding
Ola Söder

^{[ 1 ]} Instytut Informatyki, Wydział Informatyki i Telekomunikacji, Politechnika Poznańska | ^{[ P ]} employee

Scientific discipline (Law 2.0)

[2.3] Information and communication technology

Year of publication

2022

Chapter type

chapter in monograph / paper

Publication language

english

Keywords

EN

modern code reviews
machine learning
BERT

Abstract

EN Modern Code Reviews (MCRs) are a widely-used quality assurance mechanism in continuous integration and deployment. Unfortunately, in medium and large projects, the number of changes that need to be integrated, and consequently the number of comments triggered during MCRs could be overwhelming. Therefore, there is a need for quickly recognizing which comments are concerning issues that need prompt attention to guide the focus of the code authors, reviewers, and quality managers. The goal of this study is to design a method for automated classification of review comments to identify the needed change faster and with higher accuracy. We conduct a Design Science Research study on three open-source systems. We designed a method (CommentBERT) for automated classification of the code-review comments based on the BERT (Bidirectional Encoder Representations from Transformers) language model and a new taxonomy of comments. When applied to 2,672 comments from Wireshark, The Mono Framework, and Open Network Automation Platform (ONAP) projects, the method achieved accuracy, measured using Matthews Correlation Coefficient, of 0.46–0.82 (Wireshark), 0.12–0.8 (ONAP), and 0.48–0.85 (Mono). Based on the results, we conclude that the proposed method seems promising and could be potentially used to build machine-learning-based tools to support MCRs as long as there is a sufficient number of historical code-review comments to train the model.

Date of online publication

12.04.2022

Pages (from - to)

23 - 40

DOI

10.1007/978-3-031-04115-0_3

URL

https://link.springer.com/chapter/10.1007/978-3-031-04115-0_3

Book

Software Quality: The Next Big Thing in Software Engineering and Quality : 14th International Conference on Software Quality, SWQD 2022, Vienna, Austria, May 17–19, 2022 : Proceedings

Presented on

14th International Conference on Software Quality SWQD 2022, 17-19.05.2022, Vienna, Austria