Depending on the amount of data to process, file generation may take longer.

If it takes too long to generate, you can limit the data by, for example, reducing the range of years.

Chapter

Download BibTeX

Title

Automated Code Review Comment Classification to Improve Modern Code Reviews

Authors

[ 1 ] Instytut Informatyki, Wydział Informatyki i Telekomunikacji, Politechnika Poznańska | [ P ] employee

Scientific discipline (Law 2.0)

[2.3] Information and communication technology

Year of publication

2022

Chapter type

chapter in monograph / paper

Publication language

english

Keywords
EN
  • modern code reviews
  • machine learning
  • BERT
Abstract

EN Modern Code Reviews (MCRs) are a widely-used quality assurance mechanism in continuous integration and deployment. Unfortunately, in medium and large projects, the number of changes that need to be integrated, and consequently the number of comments triggered during MCRs could be overwhelming. Therefore, there is a need for quickly recognizing which comments are concerning issues that need prompt attention to guide the focus of the code authors, reviewers, and quality managers. The goal of this study is to design a method for automated classification of review comments to identify the needed change faster and with higher accuracy. We conduct a Design Science Research study on three open-source systems. We designed a method (CommentBERT) for automated classification of the code-review comments based on the BERT (Bidirectional Encoder Representations from Transformers) language model and a new taxonomy of comments. When applied to 2,672 comments from Wireshark, The Mono Framework, and Open Network Automation Platform (ONAP) projects, the method achieved accuracy, measured using Matthews Correlation Coefficient, of 0.46–0.82 (Wireshark), 0.12–0.8 (ONAP), and 0.48–0.85 (Mono). Based on the results, we conclude that the proposed method seems promising and could be potentially used to build machine-learning-based tools to support MCRs as long as there is a sufficient number of historical code-review comments to train the model.

Date of online publication

12.04.2022

Pages (from - to)

23 - 40

DOI

10.1007/978-3-031-04115-0_3

URL

https://link.springer.com/chapter/10.1007/978-3-031-04115-0_3

Book

Software Quality: The Next Big Thing in Software Engineering and Quality : 14th International Conference on Software Quality, SWQD 2022, Vienna, Austria, May 17–19, 2022 : Proceedings

Presented on

14th International Conference on Software Quality SWQD 2022, 17-19.05.2022, Vienna, Austria

Ministry points / chapter

20

This website uses cookies to remember the authenticated session of the user. For more information, read about Cookies and Privacy Policy.